Your curated collection of saved posts and media

Showing 24 posts Β· last 30 days Β· by score
πŸ”ylecun retweeted
S
Stephanie Daigle
@sumiedaigle
πŸ“…
Feb 28, 2026
15d ago
πŸ†”67332706

@newrepublic @Prof_Sugon_Deez https://t.co/KTMPjgMkp1

Media 1
❀️85
likes
πŸ”8
retweets
πŸ–ΌοΈ Media
R
RpsAgainstTrump
@RpsAgainstTrump
πŸ“…
Mar 01, 2026
14d ago
πŸ†”17872173

JD Vance, October 2024: β€œOut interests, I think, very much, is a not going to war in Iran." β€œKamala Harris kinda likes war…They seem to be sleepwalking us into a war with Iran." https://t.co/ptmhOuJJhW

πŸ–ΌοΈ Media
πŸ”ylecun retweeted
K
Klara
@klara_sjo
πŸ“…
Feb 28, 2026
14d ago
πŸ†”65913281

This is the AI that will be taking our jobs https://t.co/nycRqJimm6

❀️3,779
likes
πŸ”605
retweets
πŸ–ΌοΈ Media
L
luke_metro
@luke_metro
πŸ“…
Feb 16, 2026
26d ago
πŸ†”69397732

sneak peek of Anthropic's 2026 Super Bowl ad https://t.co/6v5ZFiHYgO

Media 1
πŸ–ΌοΈ Media
πŸ”jeremyphoward retweeted
L
Luke Metro
@luke_metro
πŸ“…
Feb 16, 2026
26d ago
πŸ†”69397732

sneak peek of Anthropic's 2026 Super Bowl ad https://t.co/6v5ZFiHYgO

Media 1
❀️4,217
likes
πŸ”188
retweets
πŸ–ΌοΈ Media
I
InfiniAILab
@InfiniAILab
πŸ“…
Feb 18, 2026
25d ago
πŸ†”93728105

Video generation models are improving fastβ€”real-time autoregressive models now deliver high quality at low latency, and they’re quickly being adopted for world models and robotics applications. So what’s the problem? They’re still too slow on consumer hardware. πŸš€ What if we told you that we can get true real-time 16 FPS video generation on a single RTX 5090? (1.5-12x over FA 2/3/4 on 5090, H100, B200) Today we release MonarchRT πŸ¦‹, an efficient video attention that parameterizes attention maps as (tiled) Monarch matrices and delivers real E2E gains. πŸ“„ Paper: https://t.co/d1AAMIseow 🌐 Website: https://t.co/41mqriKekx πŸ”— GitHub: https://t.co/hp5iJttviA 🧡1/n

Media 2
πŸ–ΌοΈ Media
M
MayankMish98
@MayankMish98
πŸ“…
Feb 25, 2026
17d ago
πŸ†”22259079

We identified an issue with the Mamba-2 🐍 initialization in HuggingFace and FlashLinearAttention repository (dt_bias being incorrectly initialized). This bug is related to 2 main issues: 1. init being incorrect (torch.ones) if Mamba-2 layers are used in isolation without the Mamba2ForCausalLM model class (this has been already fixed: https://t.co/oahfxjIsKb). 2. Skipping initialization due to meta device init for DTensors with FSDP-2 (https://t.co/hLC8nnQFc3 will fix this issue upon merging). The difference is substantial. Mamba-2 seems to be quite sensitive to the initialization. Check out our experiments at the 7B MoE scale: https://t.co/n8iuUICRux Special thanks to @kevinyli_, @bharatrunwal2, @HanGuo97, @tri_dao and @_albertgu πŸ™ Also thanks to @SonglinYang4 for quickly helping in merging the PR.

Media 1Media 2
+1 more
πŸ–ΌοΈ Media
B
BlancheMinerva
@BlancheMinerva
πŸ“…
Feb 21, 2026
21d ago
πŸ†”25075757

@Kaivalya_in @milindmghosh Cohere: https://t.co/eQGWmi0eM6 Sarashina: https://t.co/plQG6qPAqT but it looks like the first in Japan was actually Stockmark100B which beat it by a few months: https://t.co/q3sGmwogg1

Media 1Media 2
+1 more
πŸ–ΌοΈ Media
B
BlancheMinerva
@BlancheMinerva
πŸ“…
Feb 21, 2026
21d ago
πŸ†”20011196

@paws4puzzles @milindmghosh Firstly, DeepSeek and Qwen have both released multiple much larger and more powerful models with MIT and Apache 2.0 licenses. Secondly, Falcon 180B doesn't have an Apache 2.0 license: https://t.co/lYDKI8ox1f

Media 1
πŸ–ΌοΈ Media
B
BlancheMinerva
@BlancheMinerva
πŸ“…
Feb 21, 2026
21d ago
πŸ†”74909085

@milindmghosh Update: the Stockmark 100B model by Stockmark is actually the first 100B model from Japan, coming out in May as opposed to Sarashina2 in November. This doesn't change the order becasue Cohere Command R+ came out in April. https://t.co/q3sGmwogg1

Media 1
πŸ–ΌοΈ Media
N
NatalieShapira
@NatalieShapira
πŸ“…
Feb 23, 2026
19d ago
πŸ†”57712396

In this amazing multidisciplinary collaboration, we report our early experience with the @openclaw -> https://t.co/THXYyajfQB

Media 1
πŸ–ΌοΈ Media
H
HeidyKhlaaf
@HeidyKhlaaf
πŸ“…
Feb 26, 2026
17d ago
πŸ†”85098665

Some real cognitive dissonance happening with takes saying "but Anthropic HAD to drop their safety measures, they're the good guys you see!" Anyway from our paper last year: https://t.co/d0yyWfx0fe

Media 1
πŸ–ΌοΈ Media
T
TheMidasProj
@TheMidasProj
πŸ“…
Feb 25, 2026
17d ago
πŸ†”36872300

A new filing just dropped in the Musk v. Altman case, and it may be the most brazen and cynical document OpenAI has produced yet. It's a motion to exclude the testimony of Stuart Russell, but their attacks blatantly contradict things @OpenAI itself has said for years. 🧡 https://t.co/WSPSpNiYqV

Media 1
πŸ–ΌοΈ Media
G
ggerganov
@ggerganov
πŸ“…
Jan 29, 2026
45d ago
πŸ†”93417540

@UnslothAI Btw, I have some anecdotal evidence that disabling thinking for GLM-4.7-Flash improves performance for agentic coding stuff. Haven't evaluated in detail yet (only opencode) as it takes time, but would be interest to know if you give it a try and share your observations. To disable thinking with llama.cpp add this to the llama-server command: --chat-template-kwargs "{\"enable_thinking\": false}" Here is my config for reference:

Media 1
πŸ–ΌοΈ Media
G
ggerganov
@ggerganov
πŸ“…
Jan 29, 2026
45d ago
πŸ†”44057045

Introducing LlamaBarn β€” a tiny macOS menu bar app for running local LLMs Open source, built on llama.cpp https://t.co/F1Z3DVl9Kg

Media 1
πŸ–ΌοΈ Media
G
ggerganov
@ggerganov
πŸ“…
Feb 20, 2026
23d ago
πŸ†”23203520

I am deeply thankful to the Hugging Face team for this opportunity. With their support I will be able to continue my work on the projects and I feel optimistic about the great stuff that we are going to create with the community! https://t.co/Gl95jBPhly

Media 1
πŸ–ΌοΈ Media
L
lmstudio
@lmstudio
πŸ“…
Feb 25, 2026
17d ago
πŸ†”47663779

Introducing LM Link ✨ Connect to remote instances of LM Studio, securely. πŸ” End-to-end encrypted πŸ“‘ Load models locally, use them on the go πŸ–₯️ Use local devices, LLM rigs, or cloud VMs Launching in partnership with @Tailscale Try it now: https://t.co/Vl2vr6HlF5

Media 1
πŸ–ΌοΈ Media
A
allen_ai
@allen_ai
πŸ“…
Jan 27, 2026
47d ago
πŸ†”89006865

Introducing Ai2 Open Coding Agentsβ€”starting with SERA, our first-ever coding models. Fast, accessible agents (8B–32B) that adapt to any repo, including private codebases. Train a powerful specialized agent for as little as ~$400, & it works with Claude Code out of the box. 🧡 https://t.co/dor94O62B9

Media 1
πŸ–ΌοΈ Media
T
Tim_Dettmers
@Tim_Dettmers
πŸ“…
Jan 27, 2026
46d ago
πŸ†”08711522

SERA was driven by a classic research pattern similar to QLoRA: if you are resource contraint, build efficiency first, then do the actual research. The most surprising thing: verifying coding data correctness is not helpful and adds overhead to synthetic data generation. https://t.co/O6dMEqY6fF

Media 1
πŸ–ΌοΈ Media
T
Tim_Dettmers
@Tim_Dettmers
πŸ“…
Jan 27, 2026
46d ago
πŸ†”82548999

This is very impactful: you can now distill frontier performance into small models that are specialized to private repositories. Companies can quickly and cheaply train on their data and have super-efficient deployments of 32B agents. https://t.co/03jsS6cWJ3

Media 1
πŸ–ΌοΈ Media
Y
YiqingXieNLP
@YiqingXieNLP
πŸ“…
Feb 23, 2026
19d ago
πŸ†”96614263

Training on issue-solving only does NOT guarantee transfer to other tasks. 🎨Introducing Hybrid-Gym - synthetic training tasks for generalization (https://t.co/IrqQszPEYm) +25.4% on SWE-Bench / +7.9% on SWT-Bench / +5.1% on Commit-0 with NO issue-solving / test-gen/... training https://t.co/U9xc0yNYv4

Media 1
πŸ–ΌοΈ Media
L
LaudeInstitute
@LaudeInstitute
πŸ“…
Feb 26, 2026
16d ago
πŸ†”48207176

Introducing Slingshots // TWO: Research that ships. 14 projects, six institutions – let’s meet the batch 🧡 https://t.co/g3LTeewbqC

Media 1
πŸ–ΌοΈ Media
S
stacygriggs
@stacygriggs
πŸ“…
Feb 03, 2026
39d ago
πŸ†”01519887

@ElToroDotCom just launched Keeping Up with the Joneses Social Index Score It’s a proprietary score + dataset that’s 4Γ— more predictive of a new-car purchase than traditional demo or intent data. Learn more πŸ‘‰ https://t.co/BOtJcoszIX Interested? Let’s talk at #NADA2026

Media 1
πŸ–ΌοΈ Media
S
stacygriggs
@stacygriggs
πŸ“…
Feb 03, 2026
39d ago
πŸ†”80526223

#NewProfilePic https://t.co/Iw58K7i1Gv

Media 1
πŸ–ΌοΈ Media