_

_akhaliq

@_akhaliq

📅

Mar 16, 2026

38m ago

🆔76800000

Mistral Small 4 is out https://t.co/IdAowSpHpN

🖼️ Media

View Details View on X ↗

D

DeryaTR_

@DeryaTR_

📅

Mar 16, 2026

19h ago

🆔06008849

⭐0.38

32× efficiency improvement in just the last 3 months, that’s the crazy jump from GPT-5.2 to GPT-5.4! 37 cents/task is essentially almost at human-level efficiency (target was 24 cents/task). This was inconceivable a year ago when o3 cost $4500/task on ARC-AGI-1, 12,000x improved!

@PoliticalKiwi • Sun Mar 15 11:24

GPT-5.4 (High) has now cleared 90% on this benchmark at a cost of just $0.37/task So that's a 32x efficiency improvement in the last three months, or 12000x since December 2024

View Details View on X ↗

🔁omarsar0 retweeted

O

elvis

@omarsar0

📅

Mar 16, 2026

8h ago

🆔09077648

⭐0.38

Banger report from the Kimi team: Attention Residuals Residual connections made deep Transformers trainable. But they also force uncontrolled hidden-state growth with depth. This work proposes a cleaner alternative. It introduces Attention Residuals, which replace fixed residual accumulation with softmax attention over previous layer outputs. Instead of blindly summing everything, each layer selectively retrieves the earlier representations it actually needs. To keep this practical at scale, they add a blockwise version that compresses layers into block summaries, recovering most of the gains with minimal systems overhead. Why does it matter? Residual paths have barely changed across modern LLMs, even though they govern how information moves through depth. This paper shows that making the mixing content-dependent improves scaling laws, matches a baseline trained with 1.25x more compute, boosts GPQA-Diamond by +7.5 and HumanEval by +3.1, while keeping inference overhead under 2%. Paper: https://t.co/04IG6FDiVr Learn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX

❤️116

likes

🔁15

retweets

View Details View on X ↗

H

HuggingPapers

@HuggingPapers

📅

Mar 16, 2026

2h ago

🆔83694046

OmniForcing unlocks real-time joint audio-visual generation Achieves ~25 FPS with 0.7s latency—a 35× speedup over offline diffusion models—by distilling bidirectional LTX-2 into a causal streaming generator with maintained multi-modal fidelity. https://t.co/UGYGMyTQOs

🖼️ Media

View Details View on X ↗

🔁_akhaliq retweeted

H

DailyPapers

@HuggingPapers

📅

Mar 16, 2026

2h ago

🆔83694046

⭐0.36

OmniForcing unlocks real-time joint audio-visual generation Achieves ~25 FPS with 0.7s latency—a 35× speedup over offline diffusion models—by distilling bidirectional LTX-2 into a causal streaming generator with maintained multi-modal fidelity. https://t.co/UGYGMyTQOs

❤️9

likes

🔁1

retweets

View Details View on X ↗

O

OpenAIDevs

@OpenAIDevs

📅

Mar 16, 2026

2h ago

🆔48174967

Subagents are now available in Codex. You can accelerate your workflow by spinning up specialized agents to: • Keep your main context window clean • Tackle different parts of a task in parallel • Steer individual agents as work unfolds https://t.co/QJC2ZYtYcA

🖼️ Media

View Details View on X ↗

🔁jxnlco retweeted

O

OpenAI Developers

@OpenAIDevs

📅

Mar 16, 2026

2h ago

🆔48174967

⭐0.34

Subagents are now available in Codex. You can accelerate your workflow by spinning up specialized agents to: • Keep your main context window clean • Tackle different parts of a task in parallel • Steer individual agents as work unfolds https://t.co/QJC2ZYtYcA

❤️798

likes

🔁74

retweets

View Details View on X ↗

R

RaiaHadsell

@RaiaHadsell

📅

Mar 16, 2026

4h ago

🆔56989392

⭐0.38

It's been about 20 years since I first started working on embeddings with Yann LeCun (siamese networks!), and I've been fascinated ever since. Gemini Embeddings 2 approaches the platonic ideal: native embedding of text, image, video, audio, and docs to a single space.

@GoogleAIStudio • Tue Mar 10 17:25

https://t.co/mIXzM657cR

View Details View on X ↗

🔁jeremyphoward retweeted

R

raia hadsell

@RaiaHadsell

📅

Mar 16, 2026

4h ago

🆔56989392

⭐0.36

It's been about 20 years since I first started working on embeddings with Yann LeCun (siamese networks!), and I've been fascinated ever since. Gemini Embeddings 2 approaches the platonic ideal: native embedding of text, image, video, audio, and docs to a single space.

❤️277

likes

🔁24

retweets

View Details View on X ↗

P

PyTorch

@PyTorch

📅

Mar 16, 2026

2h ago

🆔07617111

⭐0.38

@Nvidiadev 🗓️ MONDAY @ Booth #338 2PM: Shaping the Future w/ @matthew_d_white 3PM: TensorRT + PyTorch w/ Angela Yi & @narendasan 4PM: DeepSpeed Trillion-Param Training w/ @PKUWZP 5PM: PyTorch Export w/ Angela Yi 6PM: Ray Distributed Computing w/ @robertnishihara #AI #GTC2025

View Details View on X ↗

🔁_akhaliq retweeted

P

PixVerse

@PixVerse_

📅

Mar 16, 2026

9h ago

🆔08201897

⭐0.34

Your AI agent can now generate videos. PixVerse CLI ships today — JSON output, 6 deterministic exit codes, full PixVerse v5.6, Sora2 and Veo 3.1, Nano Banana access from terminal. Same account. Same credits. No new signup. -> Follow+ Reply+RT = 300 Creds(72H ONLY)

❤️432

likes

🔁155

retweets

View Details View on X ↗

A

alex_peys

@alex_peys

📅

Mar 16, 2026

4h ago

🆔51888850

⭐0.40

this was one of the things i co-led at fair, then fb had ~2b users, embeddings of ~128d made it a 300b-1T parameter model depending on how you count entities (e.g. ad campaigns). at the time, this was big, now it's medium. we trained it purely on distributed cpus

@ylecun • Mon Mar 16 18:09

@RaiaHadsell Universal embeddings FTW 😊 One of the flagship projects at FAIR was to "embed the world" (i.e. represent every entity on Facebook). The name was soon changed to "Filament", deployed internally, and eventually open-sourced as "PyTorch-BigGraph" The techniques were m

View Details View on X ↗

A

AdinaYakup

@AdinaYakup

📅

Mar 16, 2026

9h ago

🆔41999406

Covo Audio 🔊A end-to-end audio language model from @TencentAI_News https://t.co/tic5cH1A39 ✨ 7B ✨ Audio → Audio in one model ✨ Multi-speaker + voice transfer ✨ Real-time full duplex conversations https://t.co/hFrsxQgzkT

🖼️ Media

View Details View on X ↗

🔁ai_fast_track retweeted

A

Adina Yakup

@AdinaYakup

📅

Mar 16, 2026

9h ago

🆔41999406

Covo Audio 🔊A end-to-end audio language model from @TencentAI_News https://t.co/tic5cH1A39 ✨ 7B ✨ Audio → Audio in one model ✨ Multi-speaker + voice transfer ✨ Real-time full duplex conversations https://t.co/hFrsxQgzkT

❤️77

likes

🔁11

retweets

🖼️ Media

View Details View on X ↗

T

TeksEdge

@TeksEdge

📅

Mar 14, 2026

2d ago

🆔30554364

🚨 Want to parse complex PDFs with SOTA accuracy, 100% locally? 📄🔍 At just 0.9B parameters, you can drop GLM-OCR straight into LM Studio and run it on almost any machine! 🥔 🧠 0.9B total parameters 💾 Runs on < 1.5GB VRAM (or ~1GB quantized!) 💸 Zero API costs 🔒 Total data privacy Desktop document AI is officially here. 💻⚡

🖼️ Media

View Details View on X ↗

🔁ai_fast_track retweeted

T

David Hendrickson

@TeksEdge

📅

Mar 14, 2026

2d ago

🆔30554364

⭐0.34

🚨 Want to parse complex PDFs with SOTA accuracy, 100% locally? 📄🔍 At just 0.9B parameters, you can drop GLM-OCR straight into LM Studio and run it on almost any machine! 🥔 🧠 0.9B total parameters 💾 Runs on < 1.5GB VRAM (or ~1GB quantized!) 💸 Zero API costs 🔒 Total data privacy Desktop document AI is officially here. 💻⚡

❤️2,365

likes

🔁218

retweets

View Details View on X ↗

A

askalphaxiv

@askalphaxiv

📅

Mar 16, 2026

21h ago

🆔49397718

Yann LeCun is pumping out papers recently “Temporal Straightening for Latent Planning” This paper shows that by straightening latent trajectories in a world model, Euclidean distance starts to reflect true reachable progress, so it's closer to geodesic/minimum-step distance. This makes gradient-based planning far more stable and effective without relying as heavily on expensive search.

🖼️ Media

View Details View on X ↗

🔁ylecun retweeted

A

alphaXiv

@askalphaxiv

📅

Mar 16, 2026

21h ago

🆔49397718

⭐0.36

Yann LeCun is pumping out papers recently “Temporal Straightening for Latent Planning” This paper shows that by straightening latent trajectories in a world model, Euclidean distance starts to reflect true reachable progress, so it's closer to geodesic/minimum-step distance. This makes gradient-based planning far more stable and effective without relying as heavily on expensive search.

❤️702

likes

🔁115

retweets

View Details View on X ↗

J

jxnlco

@jxnlco

📅

Mar 16, 2026

4h ago

🆔10125942

⭐0.38

codex app automations: slack pending replies Review Slack for the current user and update today's daily summary note in /Users/jasonliu/vault at agent/daily-summary-YYYY-MM-DD.md with a single section titled ## Pending Slack Replies. Use Slack search and thread reads across public channels, private channels, DMs, and group DMs to find conversations where the current user is mentioned, directly addressed, or has already participated, and where the latest substantive message is from someone else and the current user has not replied. Focus on recent activity, prioritizing today and the last 36 hours. Read candidate threads before including them. Exclude resolved threads, FYIs that do not need a response, and anything the user already answered later. Rewrite the ## Pending Slack Replies section on each run instead of appending duplicates. For each pending item include: who is waiting, channel or DM name, last message time in America/Los_Angeles, a one-line summary of the ask or blocker, and a short snippet. If a stable Slack link is available, include it. If nothing is pending, keep the section and write - None right now. Keep the rest of the note unchanged.

View Details View on X ↗

T

TheTuringPost

@TheTuringPost

📅

Mar 15, 2026

1d ago

🆔18374889

7 emerging memory architectures for AI agents ▪️ Agentic Memory (AgeMem) ▪️ Memex ▪️ MemRL ▪️ UMA (Unified Memory Agent) ▪️ Pancake ▪️ Conditional memory ▪️ Multi-Agent Memory from a Computer Architecture Perspective https://t.co/5X5LxirSEx https://t.co/5Hi0Gn3aA4

🖼️ Media

View Details View on X ↗

🔁ai_fast_track retweeted

T

Ksenia_TuringPost

@TheTuringPost

📅

Mar 15, 2026

1d ago

🆔18374889

⭐0.34

7 emerging memory architectures for AI agents ▪️ Agentic Memory (AgeMem) ▪️ Memex ▪️ MemRL ▪️ UMA (Unified Memory Agent) ▪️ Pancake ▪️ Conditional memory ▪️ Multi-Agent Memory from a Computer Architecture Perspective https://t.co/5X5LxirSEx https://t.co/5Hi0Gn3aA4

❤️536

likes

🔁110

retweets

View Details View on X ↗

R

RoundtableSpace

@RoundtableSpace

📅

Mar 12, 2026

4d ago

🆔85178066

Microsoft has released a free, open-source course: GitHub Copilot CLI for Beginners. Includes 8 Chapters covering: • Walks through of installing Copilot CLI • Using context • Creating custom agents • Working with skills • Connecting MCP servers, and more. Start Learning - https://t.co/IIbauw5L7K

🖼️ Media

View Details View on X ↗

🔁github retweeted

R

0xMarioNawfal

@RoundtableSpace

📅

Mar 12, 2026

4d ago

🆔85178066

⭐0.32

Microsoft has released a free, open-source course: GitHub Copilot CLI for Beginners. Includes 8 Chapters covering: • Walks through of installing Copilot CLI • Using context • Creating custom agents • Working with skills • Connecting MCP servers, and more. Start Learning - https://t.co/IIbauw5L7K

❤️714

likes

🔁114

retweets

View Details View on X ↗

S

SpirosMargaris

@SpirosMargaris

📅

Mar 16, 2026

4h ago

🆔49671064

⭐0.44

Nvidia ruled the first wave of AI by powering the training of large models. But the next phase may look different. Running AI at scale, inference is now growing much faster than training. That’s where real-world deployment happens. If the center of gravity in AI shifts there, the question becomes: will Nvidia stay as dominant in the next chapter? https://t.co/MdG0zqBUWj @RWhelanWSJ @WSJ

View Details View on X ↗