Your curated collection of saved posts and media

Showing 32 posts ยท last 14 days ยท by score
E
emollick
@emollick
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”19887978

Had early access to GPT-5.2. Its an impressive model. Here is GPT 5.2 Pro's version of "create a visually interesting shader that can run in twigl-dot-app make it like an infinite city of neo-gothic towers partially drowned in a stormy ocean with large waves," single shot. https://t.co/ZLeXZ7OIIn

๐Ÿ–ผ๏ธ Media
E
emollick
@emollick
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”17300796

In the more practical example: "build me a graph of humanity's last exam scores over time" which involved looking up and cross-referencing a lot of material and then generating something useful in one shot: (Ironically does not include GPT-5.2 since scores weren't public) https://t.co/FmeyzzdGHP

Media 1Media 2
๐Ÿ–ผ๏ธ Media
E
emollick
@emollick
๐Ÿ“…
Sep 26, 2025
190d ago
๐Ÿ†”44348625

After reading it, this does seem like a big deal Industry experts outlined important, real-world, hard tasks for AI to do. Other experts were asked to do the tasks themselves & yet others graded human & AI output Models approached parity with humans & AI is getting better fast. https://t.co/z666YcNyH6

Media 1Media 2
+2 more
๐Ÿ–ผ๏ธ Media
E
emollick
@emollick
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”56263359

Whoa. This new GDPval score is a very big deal. Probably the most economically relevant measure of AI ability suggesting that in head-to-head competition with human experts on tasks that require 4-8 hours for a human to do, GPT-5.2 wins 71% of the time as judged by other humans https://t.co/M8NqSUXl6X

@emollick โ€ข Fri Sep 26 00:09

After reading it, this does seem like a big deal Industry experts outlined important, real-world, hard tasks for AI to do. Other experts were asked to do the tasks themselves & yet others graded human & AI output Models approached parity with humans & AI is getting

Media 1Media 2
๐Ÿ–ผ๏ธ Media
C
chehendriksen
@chehendriksen
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”81214293

Note that 5.2 Thinking gets a lot of ties to get above the 50 % mark - but 5.2 Pro has a 10 % point lead on pure wins vs 5.2 Thinking, even if the total win&tie-rate ends up being "only" 3,2 % higher. Clearly 5.2 Pro delivers more robust economically valuable quality. https://t.co/mRIloK299p

Media 1
๐Ÿ–ผ๏ธ Media
A
AWSstartups
@AWSstartups
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”16665130

๐Ÿ™Œ A huge thank you from #AWS to the #startups, speakers, experts & innovators for creating an unforgettable #awsreinvent. โœจ We shared insights, discovered new possibilities & left feeling inspired by the brilliant minds shaping the future of technology. https://t.co/aEY6rrjRug https://t.co/72uMrvdFkS

๐Ÿ–ผ๏ธ Media
S
SakanaAILabs
@SakanaAILabs
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”68583254

12ๆœˆ23ๆ—ฅ็ซๆ›œๆ—ฅใซSakana AI ้‡‘่žร—AIๅ‹‰ๅผทไผšใ‚’้–‹ๅ‚ฌใ—ใพใ™๏ผ๐ŸŸ๐ŸŸ๐ŸŸ ้‡‘่žๆฅญ็•Œๅ…จ่ˆฌใฎๅค‰้ฉใซ้–ขๅฟƒใฎใ‚ใ‚‹ใ€ใ‚จใƒณใ‚ธใƒ‹ใ‚ขใจใƒ“ใ‚ธใƒใ‚น่ท็จฎใฎๆ–นใ‚’ๅฏพ่ฑกใจใ—ใฆใ„ใพใ™ใ€‚ โ€ข ้‡‘่ž้ ˜ๅŸŸใซใŠใ‘ใ‚‹ๆœ€ๆ–ฐใฎ็Ÿฅ่ฆ‹ใจใ€ใใฎ็คพไผšๅฎŸ่ฃ…ใƒ—ใƒญใ‚ปใ‚น โ€ข ใ‚ฐใƒญใƒผใƒใƒซใช็’ฐๅขƒใงใ€ไธ–็•Œ็š„ใช็ ”็ฉถ่€…ใƒปใ‚จใƒณใ‚ธใƒ‹ใ‚ขใจๅ…ฑใซๅƒใ้ญ…ๅŠ› โ€ข ๅฎ‰ๅ…จๆ€งใŒๆฑ‚ใ‚ ใ‚‰ใ‚Œใ‚‹้‡‘่žใ‚ทใ‚นใƒ†ใƒ ใธใฎAIๅฐŽๅ…ฅใซใŠใ‘ใ‚‹้–‹็™บไฝ“ๅˆถ ใชใฉใ€ใ“ใ“ใงใ—ใ‹่žใ‘ใชใ„ๅ…ทไฝ“็š„ใชใ“ใจใŠไผใˆใงใใ‚Œใฐใจๆ€ใ„ใพใ™๏ผ 19ๆ™‚30ๅˆ†ใ”ใ‚ใ‹ใ‚‰ใฏๆ‡‡่ฆชไผšใ‚‚ๅฎŸๆ–ฝไบˆๅฎšใงใ™ใ€‚ๅฝ“ๆ—ฅใฏใ€Sakana AIใงๅƒใ้‡‘่žใƒ—ใƒญใ‚ธใ‚งใ‚ฏใƒˆใ‚’ๆ‹…ๅฝ“ใ™ใ‚‹ใ‚จใƒณใ‚ธใƒ‹ใ‚ขใ‚„ใƒ—ใƒญใ‚ธใ‚งใ‚ฏใƒˆใƒžใƒใƒผใ‚ธใƒฃใƒผใชใฉใŒๅคšๆ•ฐๅ‚ๅŠ ใ—ใพใ™ใ€‚ ้‡‘่žๆฅญ็•Œใฎๆœชๆฅใ‚’AIใงๅ…ฑใซๅ‰ตใ‚‹ใ“ใจใซ่ˆˆๅ‘ณใ‚’ใŠๆŒใกใฎๆ–นใฏใ€ใœใฒใŠๆฐ—่ปฝใซใŠ่ถŠใ—ใใ ใ•ใ„ใ€‚ ่ฉณ็ดฐใฏใ“ใกใ‚‰โ†’ https://t.co/HsdEXwdC7Z

Media 1Media 2
๐Ÿ–ผ๏ธ Media
S
sama
@sama
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”12947900

It is a very smart model, and we have come a long way since GPT-5.1: https://t.co/6FJG5FbOQG

Media 1Media 2
๐Ÿ–ผ๏ธ Media
A
afc
@afc
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”33537918

We at @MeritechCapital are excited to be in business with @getserval. For too long, IT teams have been held back by static, brittle ITSM solutions that prevent themโ€”and their employeesโ€”from performing at their best. AI-driven automation was always the promise, but no one delivered....until Serval. The company has grown revenue by 500% since closing its Series A only a few months ago and is used by some of the most forward-thinking IT teams today, including Clay, Together, Verkada, and many others. The company is also led by an exceptional team and CEO, @jakeserval. We're excited to be a small part of the future of ITSM with the Serval team. https://t.co/A62Bt8yuLA

Media 1
๐Ÿ–ผ๏ธ Media
R
runwayml
@runwayml
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”58039638

Livestream begins at 12pm ET. Tune in, we'll be sharing in Real-time. https://t.co/eddEUQnvn9

Media 1
๐Ÿ–ผ๏ธ Media
S
shraddhac92
@shraddhac92
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”13298908

Part 2 of @runwayml 4.5 out now! With prompts this timem. Forgot to enter them in the main post last time. Prompts used: - hand held shot of a baby lion waking up next to his mom's face. Focus on the textures. - First-person view of parkour runner's perspective. Sprint toward cliff edge, explosive takeoff, hands reaching out toward hot air balloon basket approaching fast. Balloon fills frame as jumper flies through air. Ground drops away far below. Adrenaline rush captured in motion. GoPro-style dynamic movement. - macro shot of sloths claws picking up a blade of grass -drone shot of bison crossing the nile river. Smooth cinematic motion #research #creative #researchanddevelopment

+2 more
๐Ÿ–ผ๏ธ Media
G
GoogleDeepMind
@GoogleDeepMind
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”39302079

Deep Research is the first agent released on the new Interactions API โ€“ offering a single endpoint for agentic workflows. Start building today โ†“ https://t.co/FlgtzbDYj7

Media 1
๐Ÿ–ผ๏ธ Media
S
SawyerMerritt
@SawyerMerritt
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”42114858

NEWS: Rivian haศ™ unveiled its Autonomy chip and Gen 3 Autonomy Computer, which the company says is designed to solve the needs of autonomous driving. Chip: โ€ข Mutli-chip module โ€ขย TSMC 5nm โ€ขย Neural engine: Rivian designed โ€ขย 800+ TOPS โ€ขย In-house designed software stack "RAP1 powers the companyโ€™s third-generation Autonomy computer, the Autonomy Compute Module 3. Key specs include: โ€ขย 1600 sparse INT8 TOPS โ€ขย The processing power of 5 billion pixels per second. โ€ขย RAP1 features RivLink, a low latency interconnect technology allowing chips to be connected to multiply processing power, making it inherently extensible. โ€ขย RAP1 is enabled by an in-house developed AI compiler and platform software.โ€ Rivian: โ€œIn addition to ACM3, Rivian plans to integrate LiDAR into future R2 models. LiDAR will augment the companyโ€™s multi-modal sensor strategy,ย providing detailed, three-dimensional spatial data and redundant sensing, and improving real-time detection for the edge cases of driving. Our Gen 3 Autonomy hardware including ACM3 and LiDARย is currently undergoing validation and we expect it to ship on R2 models starting at the end of 2026.โ€

๐Ÿ–ผ๏ธ Media
T
Techmeme
@Techmeme
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”54442604

OpenAI says GPT-5.2 Thinking hallucinates less than GPT-5.1 and has improved reliability for agentic AI needs; pre-release testers include Notion, Box, Shopify (@haydenfield / The Verge) https://t.co/I64lWbO1Om https://t.co/iLX7mjbATR ๐Ÿ“ฅ Send tips! https://t.co/wlNZvXuhJs

Media 1Media 2
๐Ÿ–ผ๏ธ Media
D
Dr_Singularity
@Dr_Singularity
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”63631630

This seems Huge Scientists at Caltech and Cedars-Sinai have built a new AI tool called NOBLE that can quickly and accurately create virtual versions of brain neurons. Basically, it helps researchers understand how the brain works a lot faster, which could eventually lead to better treatments for brain related disorders.

@Caltech โ€ข Wed Dec 10 19:53

A team of scientists led by Caltech and Cedars-Sinai has developed a new artificial intelligence framework that can accurately, quickly, and efficiently create virtual models of brain neurons. https://t.co/mxRpp2jvTM

Media 1
๐Ÿ–ผ๏ธ Media
๐Ÿ”Scobleizer retweeted
O
OpenAI
@OpenAI
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”62668275

GPT-5.2 Thinking evals https://t.co/Kcnz3ZIwye

Media 1Media 2
โค๏ธ471
likes
๐Ÿ”70
retweets
๐Ÿ–ผ๏ธ Media
๐Ÿ”jxnlco retweeted
O
OpenAI
@OpenAI
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”62668275

GPT-5.2 Thinking evals https://t.co/Kcnz3ZIwye

Media 1Media 2
โค๏ธ465
likes
๐Ÿ”68
retweets
๐Ÿ–ผ๏ธ Media
_
_akhaliq
@_akhaliq
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”07120776

WonderZoom Multi-Scale 3D World Generation https://t.co/6VAoAzZ4SO

๐Ÿ–ผ๏ธ Media
_
_akhaliq
@_akhaliq
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”33507420

discuss: https://t.co/FKh81wKg5f

Media 1
๐Ÿ–ผ๏ธ Media
_
_akhaliq
@_akhaliq
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”36101154

Apple presents Learning Unmasking Policies for Diffusion Language Models https://t.co/UGSQIdxxB5

Media 1Media 2
๐Ÿ–ผ๏ธ Media
_
_akhaliq
@_akhaliq
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”89955182

discuss: https://t.co/kCLhzs6dPO

Media 1
๐Ÿ–ผ๏ธ Media
V
victormustar
@victormustar
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”70521827

๐ŸคฏMUST TRY: Qwen-Image-i2L skips the training loop entirely. 1-5 images in โ†’ LoRA weights out in seconds. โฌ‡๏ธ Demo available on Hugging Face https://t.co/yuqEZ1GAnw

๐Ÿ–ผ๏ธ Media
_
_akhaliq
@_akhaliq
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”57809888

Towards a Science of Scaling Agent Systems https://t.co/JmCe3VLTby

Media 1Media 2
๐Ÿ–ผ๏ธ Media
_
_akhaliq
@_akhaliq
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”69255899

discuss: https://t.co/uJHaaQkg5G

Media 1
๐Ÿ–ผ๏ธ Media
S
SergioPaniego
@SergioPaniego
๐Ÿ“…
Dec 10, 2025
115d ago
๐Ÿ†”85947968

We just released TRL v0.26.0! It comes packed with updates: > Agent training with tools in GRPO > New CISPO & SAPO losses > Reasoning rewards > vLLM quantization in colocate mode > Dataset shuffling in SFT > Lots of NEW examples > Tons of fixes and documentation improvements https://t.co/Vt3dmI1sLU

Media 1
๐Ÿ–ผ๏ธ Media
L
LoubnaBenAllal1
@LoubnaBenAllal1
๐Ÿ“…
Dec 11, 2025
114d ago
๐Ÿ†”74888249

Sharing the slides from a talk I gave this week on bridging the gap between research experiments and building production-ready models, based on our recent Smol Training Playbook. https://t.co/RmG53PytMv

Media 1
๐Ÿ–ผ๏ธ Media
E
essential_ai
@essential_ai
๐Ÿ“…
Dec 10, 2025
114d ago
๐Ÿ†”35110383

We are now the #1 trending text-gen <256B size model on HuggingFace!! https://t.co/SyoOHjWfvH

Media 1
๐Ÿ–ผ๏ธ Media
๐Ÿ”huggingface retweeted
E
Essential AI
@essential_ai
๐Ÿ“…
Dec 10, 2025
114d ago
๐Ÿ†”35110383

We are now the #1 trending text-gen <256B size model on HuggingFace!! https://t.co/SyoOHjWfvH

Media 1Media 2
โค๏ธ69
likes
๐Ÿ”9
retweets
๐Ÿ–ผ๏ธ Media
O
OpenBMB
@OpenBMB
๐Ÿ“…
Dec 10, 2025
115d ago
๐Ÿ†”77921144

๐Ÿ”ฅ Ultra-FineWeb-en-v1.4 is coming! 2.2T tokens fully open-sourced! The core training fuel for MiniCPM4 / 4.1, fully updated based on FineWeb v1.4.0: ๐Ÿ†• What's New 1๏ธโƒฃ Fresher Data: Added CommonCrawl snapshots from Apr 2024 - Jun 2025 to capture the latest world knowledge. 2๏ธโƒฃ Easier Access: CC Dump Slices are here! No need to download the entire massive dataset anymore, fetch exactly what you need seamlessly. โšก Highlights & Performance - Efficient Verification: Efficient Verification Strategy: Reduces data verification cost by 90% - High-Efficiency Filtering Pipeline: Optimizes selection of both positive and negative samples - Performance Gains: +3.613/+1.331 (Eng) & +1.98/+0.61 (Chn) vs. FineWeb/FineWeb-edu & Chinese FineWeb-edu-v2. Still high-quality cleaning. Still true to the open-source spirit. Welcome to download and test! ๐Ÿš€ ๐Ÿ”— Resources ๐Ÿค— Dataset: https://t.co/KluL5t2kUn ๐Ÿ“„ Paper: https://t.co/Kg9LLUqZgB ๐Ÿงฉ Classifier:https://t.co/oUfxrN6AmP ๐Ÿค– MiniCPM4:https://t.co/IQ82jD1PTi #UltraFineWeb #MiniCPM4 #AI #LLM #OpenBMB #UltraData

Media 1Media 2
+1 more
๐Ÿ–ผ๏ธ Media
G
ggerganov
@ggerganov
๐Ÿ“…
Dec 10, 2025
114d ago
๐Ÿ†”07435615

> llama-cli -hf org/model

Media 1
๐Ÿ–ผ๏ธ Media
๐Ÿ”huggingface retweeted
G
Georgi Gerganov
@ggerganov
๐Ÿ“…
Dec 10, 2025
114d ago
๐Ÿ†”07435615

> llama-cli -hf org/model

Media 1Media 2
โค๏ธ470
likes
๐Ÿ”56
retweets
๐Ÿ–ผ๏ธ Media
V
victormustar
@victormustar
๐Ÿ“…
Dec 10, 2025
115d ago
๐Ÿ†”66967769

llama.cpp gets a new CLI (tested it and it's ๐Ÿ”ฅ) https://t.co/XKVicocKGC

Media 1
๐Ÿ–ผ๏ธ Media