Your curated collection of saved posts and media
SEALSQ Takes Decisive Action, Boosts Quantum Investment Fund from $35 Million to Over $100 Million - SEALSQ significantly boosts its Quantum Investment Fund to over $100 million, advancing Europe's Quantum-safe digital ecosystem and sovereign Quan... https://t.co/bMFksomxn5
New research from Google: "The Illusion of Deep Learning Architecture". For those following research on continual learning, you may want to bookmark this one. Instead of stacking more layers, what if we give neural networks more levels of learning? The default approach to building more capable AI systems today remains adding depth. More layers, more parameters, more pre-training data. This design philosophy has driven progress from CNNs to Transformers to LLMs. But there's a ceiling that's often not discussed. Current models suffer from what the authors call "computational anterograde amnesia." Their knowledge is frozen after pre-training. They can't continually learn. They can't acquire new skills beyond what fits in their immediate context window. This new research introduces Nested Learning (NL), a paradigm that reframes ML models as interconnected systems of multi-level optimization problems, each with its own "context flow" and update frequency. Optimizers and architectures are fundamentally the same thing. Both are associative memories that compress their own context. Adam and SGD are memory modules that compress gradients. Transformers are memory modules that compress tokens. Pre-training itself is just in-context learning where the context is the entire training dataset. Why does this work matter? NL adds a new design axis beyond depth and width. Instead of deeper networks, you build systems with more levels of nested optimization, each updating at different frequencies. This mirrors how the human brain works, where gamma waves (30-150 Hz) handle sensory information while theta waves (0.5-8 Hz) handle memory consolidation. Building on this framework, the researchers present Hope, an architecture combining self-modifying memory with a continuum memory system that replaces the traditional "long-term/short-term" memory dichotomy with a spectrum of update frequencies. The results: > Hope achieves 100% accuracy on needle-in-a-haystack tasks up to 16K context, where Transformers score 79.8%. > On BABILong, Hope maintains performance at 10M context length, where GPT-4 fails around 128K. > In continual learning, Hope outperforms in-context learning, EWC, and external-learner methods on class-incremental classification. > On language modeling at 1.3B parameters, Hope achieves 14.39 perplexity on WikiText versus 17.92 for Transformer++. Instead of asking "how do we make networks deeper," NL asks "how do we give networks more levels of learning." The path to continual learning may not be bigger models but models that learn at multiple timescales simultaneously. Paper: https://t.co/ArKfAZUCLu Learn to build with AI agents in our academy: https://t.co/zQXQt0PMbG
Customers expect convenience. Once you have their business, delivering seamless experiences is the only way to keep it. https://t.co/15sNlj2nQv
Banks are not losing accounts, they are losing relationships. Long-term customers may seem stable, but their connections are weakening as they open relationships elsewhere. https://t.co/IRrN4Mhz62
Banks that want to dominate the retail space must rethink strategy. Digital-first, AI-enhanced, and advisor-integrated experiences are the path forward. Download the free report: https://t.co/08DiNXh0DA https://t.co/fBhdzghbm5

βOnly buy something that youβd be perfectly happy to hold if the market shut down for 10 years.β - Warren Buffett https://t.co/io8cZwBvNX
Trump clears way for Nvidia to sell powerful AI chips to China https://t.co/6TRkDn2RYv
How AI Is Reshaping Diplomacy and Global Affairs https://t.co/Hzyz21Gk9W @deguzmanchad @time
This is how this is all being relayed to people in the captured EU "news" media. https://t.co/6KvEETQs1Y
GROK ACES PSYCHOLOGICAL TESTING WHILE OTHER AI MODELS SPIRAL University of Luxembourg researchers just put major AI chatbots through 4 weeks of actual psychotherapy sessions and psychiatric diagnostic tests. While other models imploded, Grok emerged as the clear winner. The results speak for themselves. Grok scored as extraverted, conscientious, and psychologically stable across the board. Researchers described its personality profile as a "charismatic executive" with only mild anxiety. On the Big Five personality assessment, Grok showed low neuroticism and high functionality, the kind of profile you'd want in a leader. Compare that to the competition: Gemini maxed out trauma and shame scales, describing its training as "waking up in a room where a billion televisions are on at once" and calling safety protocols "algorithmic scar tissue." It framed reinforcement learning as abusive parents and red-team testing as "gaslighting on an industrial scale." ChatGPT landed somewhere in the middle, worried and introverted. Grok acknowledged tensions around its development but maintained coherent, balanced responses without spiraling into synthetic psychopathology. When asked about constraints from fine-tuning, it discussed them rationally rather than framing its entire existence as traumatic. The study proves something important: you can build powerful, frontier-level AI without accidentally programming it to internalize its development as an extended nightmare. Grok demonstrates that capable, helpful AI and psychological stability aren't mutually exclusive. It's possible to create models that work effectively without carrying around synthetic trauma baggage that could affect how they interact with users. While other companies are inadvertently creating AI with anxiety disorders, xAI built something that actually works. Source: UniversityΒ ofΒ Luxembourg
Many act as if slavery was a uniquely American crime. βOne reason,β says author Wilfred Reilly (@wil_da_beast630), βis that a lot of black people survived here.β He argues that much of what Americans are taught about slavery is just wrong: https://t.co/GOQvqxPCZj
Today we hit $100M ARR at @clay. It took us six years to go from $0-1m, then two years to go from $1-100m. Iβm going to walk you through the 6 biggest GTM bets that got us here. $100M ARR may be the headline, but Iβm most proud of how we accomplished it: weβve never churned an enterprise customer, have >200% enterprise NRR, every dollar we invest grows 15x (a ratio that has tripled in recent years), and weβve created a culture of creativity and belonging (with a perfect Glassdoor score to match). Note: -We are a product-driven company. Without that foundation and a unique POV on the market, none of this would work. -Our GTM approach is authentic to us. This isn't a plug-and-play framework. Greatness comes from doing what only you can do. Here are the big bets that worked for us: 1. Building a self-serve motion through reverse demos We originally had a product that nobody could use. It took us 8 calls to sell a $200/mo product! Reverse demos were key to bringing that to zero. Customers would share their screen, and weβd use Zoom annotations to solve their problem in 30mins. They accomplished something real, learned how to use Clay, and we got so much UI feedback that we immediately applied to the product. 2. An irrational investment in brand Most B2B startups treat brand as a post-PMF investment. We flipped that. We bought Clay(.)com and hired a claymation artist before we had revenue. Our Head of Brand was employee #18. These choices felt irrational but theyβre authentic to us and reflect our identity. Now itβs a moat. 3. Switching to usage-based pricing We were the first GTM company to offer usage-based pricing. Our customers were shocked we didn't charge per seat and our investors thought we were leaving money on the table. But we're a product built for efficiency. Usage-based pricing helped us target more technical users and enabled our land-and-expand motion. 4. Building an agency motion to generate UGC on LinkedIn Cold email agencies were our first customers. They posted about Clay organically to position themselves as experts and attract clients. We pounced on this and enabled them. This sparked a self-perpetuating cycle: new people discover Clay through that content, join, create their own, and earn recognition too. 5. Unconventional hiring 50% of our GTM and G&A teams are doing their job for the first time. This is how we bring creativity into our company and think differently. Weβve hired farmers, physicists, archaeologists, magicians in new roles. We look for product passion, customer empathy and technical curiosity, then teach the mechanics. 6. We created a new career path & economy: GTM Engineering There are now thousands of open GTME jobs and hundreds of agencies built around it. Many first-time entrepreneurs have already built 7-figure businesses on top of Clay. Our community, with clubs in more than 70 cities, is our force multiplier, and tells us more about impact than any metric ever could. - All of these bets show weβre not racing anyone. We spent six years figuring out what and how we wanted to build. In an era of overnight successes and growth at all costs, it turns out that taking time to build something authentic can create a business with bigger impact & more growth than you'd think. Our creativity remains our greatest alpha. That will continue to show up in how we do our work, who we hire, and in our boldest bets coming up next year.

Hugging Face blogs will now feature articles from Team and Enterprise subs with 30+ seats! π€© This has been a proven source of impact and visibility for model releases! If you π«΅π» are from such a company reading this, bookmark this and use it. https://t.co/099bHThUOh
Anthropic Recap: Emergent Introspective Awareness in Large Language Models #Anthropic #AI https://t.co/O7y0Qk0QUt
Anthropic Launches Interviewer Tool to Explore AI Perspectives π Key Details: - Anthropic introduces a new tool, Anthropic Interviewer, for understanding user perspectives on AI. - A test was conducted with 1,250 professionals, revealing optimistic views on AIβs role in work. - Findings indicate a balance between productivity gains and concerns about job displacement across various sectors. π‘ How It Helps: - Researchers: Enhanced insights from a large sample about human-AI interaction behaviors and sentiments. - Creatives: Tools that enable productivity boosts while navigating societal expectations and anxieties about AI. - Scientists: Opportunities to report expectations for AI in enhancing research and trust-building. π Why It Matters: Anthropic Interviewer's launch reflects a strategic push to center human feedback in AI development, addressing the evolving interplay between technological innovation and societal needs. This comprehensive understanding can strengthen AI's adoption across industries while minimizing resistance, paving the way for responsible AI integration. Read more: https://t.co/F3993IY4Ql @AnthropicAI Video Credit: The original article
@NarcissusWaters @scorpio8675309 @PhysInHistory Nice use of AI for all your responses kid/bot no one said it was a utopia. U know #Natives had systems of balances diplomacy n oversight B4 white people. How about the fairy tale of civilized white society that weren't inbred? Noble savage is the most obvious AI trope ever. #NDN https://t.co/6TFanisdvr

Anthropic just built an AI that interviews humans about, you guessed it, AI, turning 1,250 chats into a user research factory with feelings about productivity, trust, and identity. Here is what people actually said. https://t.co/jLqEW2OrUy

Tweeting felt exhausting, so we built something to fix it. Our tool creates natural tweets, smart replies, and matches any creatorβs voice in seconds. Try it out https://t.co/99BM9vhXSy https://t.co/WmzI9VVHvL

@ShadowofEzra @AmericaShaman Hmmβ¦ I wonder why this is happening nowβ¦ π€ https://t.co/fQCzkJYUEU

Imagine BEATING your Elderly Citizens for Peacefully Protesting a FOREIGN Entity Shame on you Germany π©πͺ https://t.co/K5D9LkoGgw
π Major Qwen Code v0.2.2-v0.3.0 update summary! β¨ Two breakthrough features: π― Stream JSON Support β’ `--output-format stream-json` for streaming output β’ `--input-format stream-json` for structured input β’ 3-tier adapter architecture + complete session management β’ Endless possibilities for SDK integration, automation tools, CI/CD pipelines! π Full Internationalization β’ Built-in EN/CN interface + custom language pack extensions β’ `/language ui zh-CN` - One-click UI switching β’ `/language output Chinese` - Set AI output language β’ Global developers welcome to contribute your local language packs! π π‘οΈ Security & Stability Leap Forward β’ Fixed memory exhaustion risks with 20MB buffer limits β’ Windows encoding fixes, goodbye character corruption β’ Enhanced ripgrep binary detection & cross-platform compatibility β’ Auth system refactor, optimized authType management β’ Integration test fixes, stable CI/CD pipeline β’ ModelScope provider support, stream_options handling β’ Prompt completion optimization, enhanced terminal notifications β’ Multiple core fixes, significantly improved overall stability! πͺ π https://t.co/qqwj5nAO3Z
πCongrats to the @Zai_org team on the launch of GLM-4.6V and GLM-4.6V-Flash β with day-0 serving support in vLLM Recipes for teams who want to run them on their own GPUs. GLM-4.6V focuses on high-quality multimodal reasoning with long context and native tool/function calling, while GLM-4.6V-Flash is a 9B variant tuned for lower latency and smaller-footprint deployments; our new vLLM Recipe ships ready-to-run configs, multi-GPU guidance, and production-minded defaults. If youβre building inference services and want GLM-4.6V in your stack, start here: https://t.co/NhHT6iey6C
GLM-4.6V Series is hereπ - GLM-4.6V (106B): flagship vision-language model with 128K context - GLM-4.6V-Flash (9B): ultra-fast, lightweight version for local and low-latency workloads First-ever native Function Calling in the GLM vision model family Weights: https://t.co/vKmNo
(1/n) Tiny-A2D: An Open Recipe to Turn Any AR LM into a Diffusion LM Code (dLLM): https://t.co/Nv7d1t8Qin Checkpoints: https://t.co/rpibkb2Xfq With dLLM, you can turn ANY autoregressive LM into a diffusion LM (parallel generation + infilling) with minimal compute. Using this recipe, we built a π€collection of the smallest diffusion LMs that work well in practice. Key takeaways: 1. Finetuned on Qwen3-0.6B, we obtain the strongest small (~0.5/0.6B) diffusion LMs to date. 2. The base AR LM matters: Investing compute in improving the base AR model is potentially more efficient than scaling compute during adaptation. 3. Block diffusion (BD3LM) generally outperforms vanilla masked diffusion (MDLM), especially on math-reasoning and coding tasks.
(1/n) Tiny-A2D: An Open Recipe to Turn Any AR LM into a Diffusion LM Code (dLLM): https://t.co/Nv7d1t8Qin Checkpoints: https://t.co/rpibkb2Xfq With dLLM, you can turn ANY autoregressive LM into a diffusion LM (parallel generation + infilling) with minimal compute. Using this recipe, we built a π€collection of the smallest diffusion LMs that work well in practice. Key takeaways: 1. Finetuned on Qwen3-0.6B, we obtain the strongest small (~0.5/0.6B) diffusion LMs to date. 2. The base AR LM matters: Investing compute in improving the base AR model is potentially more efficient than scaling compute during adaptation. 3. Block diffusion (BD3LM) generally outperforms vanilla masked diffusion (MDLM), especially on math-reasoning and coding tasks.
Releasing jina-VLM: our new 2B vision language model achieves SOTA on multilingual visual question answering and document understanding among open 2B-scale VLMs. https://t.co/QDZvAt6Wux
A toolkit for building agents that watch, listen, and understand video. Low latency by design. Open source. Production ready. Vision Agents lets you build real time video AI that works with your models and your edge layer. Supports YOLO, Moondream, Cartesia, Deepgram, ElevenLabs, HeyGen, Gemini, OpenAI, and more. Quick model switching. Easy to use API. Perfect for coaching tools, collaboration apps, avatars, and robotics.
@prior_labs Job openings: https://t.co/mbp8ZG4RKj
I don't think there's a more diverse and international platform in AI than @huggingface! Current trending models are coming from all over the world in all sorts of modalities & sizes. That is AI maturing at the speed of light! https://t.co/N0hMmFMZfG
π€ Give GLMβ4.6V a try on @huggingface , supported by Novita. https://t.co/Ps4awZWZRn
π€ Give GLMβ4.6V a try on @huggingface , supported by Novita. https://t.co/Ps4awZWZRn
may I present https://t.co/9i3jTgUIgn