Your curated collection of saved posts and media
discuss: https://t.co/uJHaaQkg5G
We just released TRL v0.26.0! It comes packed with updates: > Agent training with tools in GRPO > New CISPO & SAPO losses > Reasoning rewards > vLLM quantization in colocate mode > Dataset shuffling in SFT > Lots of NEW examples > Tons of fixes and documentation improvements https://t.co/Vt3dmI1sLU
Sharing the slides from a talk I gave this week on bridging the gap between research experiments and building production-ready models, based on our recent Smol Training Playbook. https://t.co/RmG53PytMv
We are now the #1 trending text-gen <256B size model on HuggingFace!! https://t.co/SyoOHjWfvH
We are now the #1 trending text-gen <256B size model on HuggingFace!! https://t.co/SyoOHjWfvH

π₯ Ultra-FineWeb-en-v1.4 is coming! 2.2T tokens fully open-sourced! The core training fuel for MiniCPM4 / 4.1, fully updated based on FineWeb v1.4.0: π What's New 1οΈβ£ Fresher Data: Added CommonCrawl snapshots from Apr 2024 - Jun 2025 to capture the latest world knowledge. 2οΈβ£ Easier Access: CC Dump Slices are here! No need to download the entire massive dataset anymore, fetch exactly what you need seamlessly. β‘ Highlights & Performance - Efficient Verification: Efficient Verification Strategy: Reduces data verification cost by 90% - High-Efficiency Filtering Pipeline: Optimizes selection of both positive and negative samples - Performance Gains: +3.613/+1.331 (Eng) & +1.98/+0.61 (Chn) vs. FineWeb/FineWeb-edu & Chinese FineWeb-edu-v2. Still high-quality cleaning. Still true to the open-source spirit. Welcome to download and test! π π Resources π€ Dataset: https://t.co/KluL5t2kUn π Paper: https://t.co/Kg9LLUqZgB π§© Classifier:https://t.co/oUfxrN6AmP π€ MiniCPM4:https://t.co/IQ82jD1PTi #UltraFineWeb #MiniCPM4 #AI #LLM #OpenBMB #UltraData

> llama-cli -hf org/model
> llama-cli -hf org/model

llama.cpp gets a new CLI (tested it and it's π₯) https://t.co/XKVicocKGC
llama.cpp gets a new CLI (tested it and it's π₯) https://t.co/XKVicocKGC

Even if you don't have a reachy mini (yet!), you can now creates apps thanks to our SDK, API and simulation and share them with the community. If you create simple apps in the coming days, I'll try them on my mini and share a video of them (+ you'll probably get good visibility as we're shipping a large number soon). Some ideas I had: - "What is love" Reachy mini plays "what is love?" and do the classic Jim Carrey head move - "Metronome app": Reachy's antennas turn into a metronome - "Relax app": Reachy plays relaxing music and does some calm zen moves - "Magic 8-Ball" Reachy answers a simple yes/no question by nodding or shaking its head based on a random outcome. - "Peek-a-Boo": Reachy stays hidden until an object (like a hand) gets close, then quickly pops its head up. - "Bless you": Reachy mini says "bless you" when you cough - "Take a picture": Reachy mini takes a picture of you - "Read": Reachy mini reads the paper you show him (using OCR) - "Face-tracking app" simple face-tracking app - Reachy follows your face as you move around - "Who's speaking": Reachy tries to detect who's speaking and turn/concentrate on them - "Describe the room": Reachy mini scans the room and describes it - "Describe what's on my screen": Reachy detects your laptop screen and describes what's on it - "Describe the object" you show an object, Reachy tells you what it is - "Where are?" Where are my keys, Reachy scans the room and point to where they are - "Dance app": Reachy mini does simple dances - "Translation app": you say something in English, Reachy repeats it in French So many ideas haha
Rnj-1-Instruct is now the #1 trending text generation model on HF! https://t.co/Gt5WGq9vLp
Rnj-1-Instruct is now the #1 trending text generation model on HF! https://t.co/Gt5WGq9vLp

Sharing a fun recipe for building a highly autonomous, moderately capable, and very UNreliable agent using the open source aisuite package that Rohit Prasad and I have been working on. With a few lines of code, you can give a frontier LLM a tool (like disk access or web search), prompt it with a high-level task (such as creating a snake game and saving as an HTML file, or carrying out deep research), and let the LLM loose and see what it does. Example in image. Caveat: This is not how practical agents are built today, since most need much more scaffolding (see my Agentic AI course to learn more), but is still interesting to experiment with. Longer write-up here: https://t.co/BdS8tGhnIy

A year ago, we verified a preview of an unreleased version of @OpenAI o3 (High) that scored 88% on ARC-AGI-1 at est. $4.5k/task Today, weβve verified a new GPT-5.2 Pro (X-High) SOTA score of 90.5% at $11.64/task This represents a ~390X efficiency improvement in one year https://t.co/9T47FdZ5Ry
A website that doesn't rank is a missed opportunity. Weβve added a complete SEO engine to Manus Web App so you can stop worrying about config and start getting traffic. Turn your site into a growth engine today. https://t.co/c8wZzgURBM https://t.co/EP7ITH4YV0

the Kalshi bets for Time Person of the Year being "AI" resolved to No because the actual winner was a bunch of AI executives they called "Architects of AI" https://t.co/TwOipqmjt1
Iβve watched this 200 times https://t.co/JhwGXUMsI6
Iβve watched this 200 times https://t.co/JhwGXUMsI6
https://t.co/ydnVN9sHcw
American Canto sold 1165 physical copies in its first week, per bookscan.
https://t.co/ydnVN9sHcw

Insane story about @JackPosobiec threatening a reporter over a Pentagon official's apparent cuckolding Goodreads account. https://t.co/FeQLirpVhV

.@OpenAIβs GPT-5.2 is now rolling out in public preview in GitHub Copilot. This model is focused on long context and front-end UI generation. Try it out in @code β¬οΈ https://t.co/0QSRnzTq5z
chilling "kill all white people." https://t.co/499gfozgJq
π¨πΈπ» BREAKING: ELON AND EL SALVADOR LAUNCH WORLDβS 1ST NATIONWIDE AI TUTORING PROGRAM! Elon and El Salvadorβs President Bukele are shaking up the world with a game-changing move. Elon tweeted, βGrok will be used nationwide by El Salvador for personalized education!β While xAI confirmed, βGrok for Education: xAI is thrilled to announce a partnership with El Salvador and Nayib Bukele to bring personalized Grok tutoring to every public-school student in the country, over 1 million children. The worldβs first nationwide AI tutor program.β While most of Latin America clings to outdated classrooms, chalkboards, overcrowded desks, and teachers stretched thin, El Salvador and Elon are diving into the future with AI. This isnβt just tech flexing; itβs personal. Imagine a kid in San Salvador getting a custom tutor via Grok, learning at their own pace, while neighbors in Guatemala or Honduras still memorize from dusty textbooks. Bukeleβs betting big on this, syncing with Elonβs vision to leapfrog education, potentially lifting 1M kids out of poverty with skills for tomorrowβs jobs. Latin peers like Mexico (stuck in teacher strikes) or Brazil (underfunded schools) look stuck in the past, proving El Salvadorβs bold step into a bright future, while others wallow in gray stagnation. This could spark a regional race, or leave laggards in the dust! Source: @elonmusk, @xai, @nayibbukele
Grok will be used nationwide by El Salvador for personalized education!

GPT-5.2 Thinking evals https://t.co/Kcnz3ZIwye

On GDPval, an eval measuring well-specified knowledge work tasks across 44 occupations, GPT-5.2 Thinking is our first model that performs at a human expert level. These tasks include making presentations, spreadsheets, and other artifacts. https://t.co/vyKSJrYLgG

GPT-5.2 Thinking Raises the bar for professional work: - State-of-the-art long-context reasoning - Major improvements in spreadsheet creation, analysis, and formatting - Early gains in slideshow creation

INSANE!!! A real-life βDavid vs. Goliath.β Do we really want ONE company having this much power to dictate what types of shows we watch??? βNetflix's equity value is nearly $40 billion greater than that of all other major media companies and theatrical exhibitors combined.β https://t.co/vbBtYIju1R
OmniPSD Layered PSD Generation with Diffusion Transformer https://t.co/IQnUyjCsZb
β΄ γγ€γγ£γγγ°ζ΄ζ°γγΎγγγΌγ η΅Άθ³ε注δΈοΌγͺγγ¨γδΊηΎγγζγγ¦γγ γγ£γHisasiε ηγγγ³γ‘γ³γγι γγΎγγγγγγγοΌοΌγγγγγγγγ¨γγγγγΎγβ¦οΌοΌοΌ(Β΄οΌΟοΌ`) https://t.co/KBfiqvxnxt https://t.co/iWr6eYtIMO
