Your curated collection of saved posts and media

Showing 14 posts ยท last 7 days ยท quality filtered
X
XianghuiXie
@XianghuiXie
๐Ÿ“…
Mar 15, 2026
7h ago
๐Ÿ†”66289235

Do you want a 3D character interacting with an object/pet/another person, following a desired action? Presenting Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D. Project: https://t.co/EE87KSjQCX Code: https://t.co/ddpLjciTWC https://t.co/QPTyXw45kk

Media 2
๐Ÿ–ผ๏ธ Media
๐Ÿ”Scobleizer retweeted
X
Xianghui Xie
@XianghuiXie
๐Ÿ“…
Mar 15, 2026
7h ago
๐Ÿ†”66289235
โญ0.32

Do you want a 3D character interacting with an object/pet/another person, following a desired action? Presenting Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D. Project: https://t.co/EE87KSjQCX Code: https://t.co/ddpLjciTWC https://t.co/QPTyXw45kk

โค๏ธ16
likes
๐Ÿ”6
retweets
S
Scobleizer
@Scobleizer
๐Ÿ“…
Mar 16, 2026
1h ago
๐Ÿ†”24269942
โญ0.40

All AI posters at GTC. This is not for human consumption. This video is for AI to watch. Click the grok button and talk to it about what it learned by seeing all the AI posters (highly technical) presented at @NVIDIAGTC tonight. Thanks NVIDIA for the badge and access. https://t.co/mKqIv1f6Dt

S
Scobleizer
@Scobleizer
๐Ÿ“…
Mar 16, 2026
1h ago
๐Ÿ†”32585940
โญ0.36

Wow. Grok watched this video and made a complete list of everything it saw: https://t.co/fqC1fuwhwX Do you have any idea how cool this is? It read every poster.

K
karpathy
@karpathy
๐Ÿ“…
Mar 16, 2026
1h ago
๐Ÿ†”46107835
โญ0.34

@Yulun_Du @ilyasut SGD is a ResNet too (the blocks of it are fwd+bwd), the residual stream is the weights so... ๐Ÿค” We're not taking the Attention is All You Need part literally enough? :D

๐Ÿ”omarsar0 retweeted
O
elvis
@omarsar0
๐Ÿ“…
Mar 15, 2026
13h ago
๐Ÿ†”07999894
โญ0.34

We mostly solved multi-node coordination decades ago in distributed computing. Turns out LLM teams face some of the same coordination problems today. Here is a really good read for anyone designing multi-agent systems. It applies distributed systems theory to LLM teams and finds the same O(nยฒ) communication bottlenecks, straggler delays, and consistency conflicts showing up directly. Decentralized teams wasted more rounds communicating without making progress, but they also recovered faster when individual agents stalled. How does this relate to distributed systems? The work attempts to evaluate LLM teams as distributed systems. It lays out a principled framework instead of trial and error for deciding when teams help, how many agents to use, and what coordination structure fits the task. Designing LLM teams without distributed systems principles is like building a cluster without understanding consensus protocols. Paper: https://t.co/klHzUFJL1R

โค๏ธ119
likes
๐Ÿ”29
retweets
_
_akhaliq
@_akhaliq
๐Ÿ“…
Mar 16, 2026
1h ago
๐Ÿ†”23785123

LookaheadKV Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation paper: https://t.co/j8lLnqUARR https://t.co/URKtNQkFKx

Media 1Media 2
๐Ÿ–ผ๏ธ Media
_
_akhaliq
@_akhaliq
๐Ÿ“…
Mar 16, 2026
1h ago
๐Ÿ†”78945671

LMEB Long-horizon Memory Embedding Benchmark paper: https://t.co/fT3sEwCRgd https://t.co/lCyEY9tadB

Media 1Media 2
๐Ÿ–ผ๏ธ Media
_
_akhaliq
@_akhaliq
๐Ÿ“…
Mar 16, 2026
2h ago
๐Ÿ†”38022438

Multimodal OCR Parse Anything from Documents On document parsing benchmarks, it ranks second only to Gemini 3 Pro on our OCR Arena Elo leaderboard, surpasses existing open-source document parsing systems, and sets a new state of the art of 83.9 on olmOCR Bench. On structured graphics parsing, dots.mocr achieves higher reconstruction quality than Gemini 3 Pro across image-to-SVG benchmarks, demonstrating strong performance on charts, UI layouts, scientific figures, and chemical diagrams paper: https://t.co/d3MkBHMuWc

Media 1Media 2
๐Ÿ–ผ๏ธ Media
K
karpathy
@karpathy
๐Ÿ“…
Mar 16, 2026
2h ago
๐Ÿ†”37734847
โญ0.38

@ChristosTzamos Wait this is so awesome!! Both 1) the C compiler to LLM weights and 2) the logarithmic complexity hard-max attention and its potential generalizations. Inspiring!

H
HuggingPapers
@HuggingPapers
๐Ÿ“…
Mar 15, 2026
13h ago
๐Ÿ†”52394270

IBM released NLE: Non-autoregressive LLM-based ASR by Transcript Editing A non-autoregressive approach that formulates speech recognition as conditional transcript editing, achieving 27x speedup over autoregressive baselines with 5.67% WER. https://t.co/LtjPtUxf5a

Media 1
๐Ÿ–ผ๏ธ Media
H
HuggingPapers
@HuggingPapers
๐Ÿ“…
Mar 15, 2026
17h ago
๐Ÿ†”28856067

XSkill: Continual learning from experience and skills A dual-stream framework enabling multimodal agents to accumulate and reuse knowledge without parameter updates. Grounded in visual context, it distills structured workflows and tactical insights to improve reasoning and tool use.

Media 1
๐Ÿ–ผ๏ธ Media
Y
yujifan_0326
@yujifan_0326
๐Ÿ“…
Mar 15, 2026
16h ago
๐Ÿ†”31475423

(1/2) Glad to announce our OpenMAIC! ๐ŸŽ‰ Open-sourcing MAIC (Multi-Agent Interactive Classroom) from Tsinghua University โ€” LLM-driven multi-agent classroom for scalable & adaptive online education. ๐Ÿ—๏ธ Core Architecture: โœ… MAIC-Craft: Read (multimodal extraction) โ†’ Plan (course components + agent generation) โœ… Adaptive Engine: Cognitive student modeling + Token-level personalization (RAG + Bloom's/ZPD/UDL) โœ… Multi-Agent Classroom: 1 Student + N Agents (Teacher, Assistant, 4 Peer Archetypes) โœ… Manager Agent: Class state receptor for turn-taking orchestration ๐Ÿ”— Give it a try ๐Ÿ‘‰๐Ÿป GitHub: https://t.co/yicGCKsF1E #AI #EdTech #MultiAgent #LLM #Research #OpenSource #Tsinghua

Media 2
๐Ÿ–ผ๏ธ Media
O
omar_or_ahmed
@omar_or_ahmed
๐Ÿ“…
Mar 15, 2026
12h ago
๐Ÿ†”26958308
โญ0.38

Foundation models are losing in healthcare. Vertical models are winning. Here's why: GPT-4 can pass the USMLE. It can't write a discharge summary that Epic EHR accepts. That gap is worth billions.