Your curated collection of saved posts and media

Showing 32 posts ยท last 14 days ยท by score
O
omarsar0
@omarsar0
๐Ÿ“…
Aug 22, 2025
249d ago
๐Ÿ†”99046283

Why it matters With comparable cost, Avengersโ€‘Pro outperforms GPTโ€‘5โ€‘medium by about 7% average accuracy; with comparable accuracy, it reduces cost by about 27%. Hitting ~90% of GPTโ€‘5โ€‘mediumโ€™s accuracy costs ~63% less. https://t.co/ZIbNOoc9Nx

Media 1
๐Ÿ–ผ๏ธ Media
O
omarsar0
@omarsar0
๐Ÿ“…
Aug 22, 2025
249d ago
๐Ÿ†”21093716

Setup Ensemble of 8 models (GPTโ€‘5โ€‘chat/medium, Claudeโ€‘4.1โ€‘opus/Sonnetโ€‘4, Geminiโ€‘2.5โ€‘pro/flash, Qwen3 235B and thinking). Evaluated on GPQAโ€‘Diamond, HLE, ARCโ€‘AGI, SimpleQA, LiveCodeBench, and ฯ„ยฒโ€‘bench. Pricing via OpenRouter informs perโ€‘cluster cost scoring. Avengers-Pro consistently outperforms or matches the strongest single models while lowering costs. With ฮฑ=1 it achieves the highest accuracy (66.66%, +7% over GPT-5-medium), and with ฮฑ=0.53 it matches GPT-5-mediumโ€™s accuracy at 27% less cost; notably, with ฮฑ=0.39 it sustains 90% of GPT-5-mediumโ€™s performance at 63% lower cost.

Media 1
๐Ÿ–ผ๏ธ Media
O
omarsar0
@omarsar0
๐Ÿ“…
Aug 22, 2025
249d ago
๐Ÿ†”90943884

Knobs that matter ฮฑ tunes performance vs efficiency; accuracy rises fast until ~0.6 while cost stays low until ~0.4 then climbs. Implementation uses kโ€‘means with k=60, Qwen3โ€‘embeddingโ€‘8B (4096โ€‘d) and topโ€‘p=4 nearest clusters at inference. https://t.co/ExTscN7AeU

Media 1
๐Ÿ–ผ๏ธ Media
O
omarsar0
@omarsar0
๐Ÿ“…
Aug 22, 2025
249d ago
๐Ÿ†”28178599

Routing behavior Routing decisions shift as the trade-off parameter ฮฑ increases. At low ฮฑ, Avengers-Pro heavily routes to cheaper Qwen3 and Qwen3-thinking models, but as ฮฑ rises, usage shifts toward GPT-5-medium and, eventually, higher-priced models like Gemini-2.5-pro and Claude-opus-4.1, which excel at complex reasoning. Paper: https://t.co/EQeo6wMGZm

Media 1Media 2
๐Ÿ–ผ๏ธ Media
O
omarsar0
@omarsar0
๐Ÿ“…
Aug 22, 2025
249d ago
๐Ÿ†”51592242

Obsidian has really impressed me, and I like that I can customize it, build my own plugins, and more. Been using it directly with Claude Code as well. I will share the extended breakdown for my academy subs next week: https://t.co/yHVttRKGM7 https://t.co/3OzdOX49dv

Media 1Media 2
๐Ÿ–ผ๏ธ Media
O
omarsar0
@omarsar0
๐Ÿ“…
Aug 22, 2025
249d ago
๐Ÿ†”20441468

How to bring your bookmarks to life with ambient agents. I just built a little Obsidian plugin to take links and summarize them in the background using Claude Code SDK. I've been exploring how to best bring agents to the tools I use every day. https://t.co/VOB4Mznanl

๐Ÿ–ผ๏ธ Media
K
koustuvsinha
@koustuvsinha
๐Ÿ“…
Aug 22, 2025
249d ago
๐Ÿ†”59429527

Its a wrap - the first in-person event for #MLRC2025 successfully concluded yesterday - we witnessed some of the best talks I have ever heard on reproducibility issues in AI, ranging from issues regarding leakage and irreproducibility in ML-based science (@random_walker), determinism issues and data access restrictions (@soumithchintala), evaluations and how to make them robust/reproducible (@BlancheMinerva), & real world challenges of deploying LLM agents in production (@jefrankle). The panel discussion among the speakers and @adinamwilliams led by @sayashk painted a picture of the shortcomings in reproducibility and the road ahead. We heard from best +outstanding paper awardees, and the poster session had a lot of footfall! Outstanding support by @PrincetonAInews for hosting the event, and thanks @AIatMeta and @OpenAI for sponsoring it! I hope our efforts further encourage reproducibility research, which in turn strengthen our understanding of ML science! Stay tuned for the release of talk recordings, and looking forward to the next iteration!

@adinamwilliams โ€ข Thu Aug 21 15:02

Awesome #MLRC2025 talks kicking us off this morning! I'm learning lots @repro_challenge about science with ML and reproducibility for real world applications (@random_walker), and software/firmware and data concerns for reproducibility (@soumithchintala) Slides coming soon! https

Media 1Media 2
+2 more
๐Ÿ–ผ๏ธ Media
A
AlecStapp
@AlecStapp
๐Ÿ“…
Aug 22, 2025
250d ago
๐Ÿ†”67424258

The moral panic over data center water usage is out of control. This is just not that much water: https://t.co/SnyQkBHa1R

@_brianpotter โ€ข Thu Aug 21 14:59

The US uses around 322 billion gallons of water each day. This week on Construction Physics, I look at where it all goes. https://t.co/aORBJPSa0B

Media 1
๐Ÿ–ผ๏ธ Media
๐Ÿ”random_walker retweeted
A
Alec Stapp
@AlecStapp
๐Ÿ“…
Aug 22, 2025
250d ago
๐Ÿ†”67424258

The moral panic over data center water usage is out of control. This is just not that much water: https://t.co/SnyQkBHa1R

Media 1
โค๏ธ3,475
likes
๐Ÿ”344
retweets
๐Ÿ–ผ๏ธ Media
D
DorotheaBaur
@DorotheaBaur
๐Ÿ“…
Aug 22, 2025
249d ago
๐Ÿ†”56395752

"we have the agency to shape [AI] as normal technology. We have the agency to ensure that the path through which it diffuses through society is not governed by the logic of the technology itself but rather by humans and institutions" @random_walker https://t.co/3W9GEd5hZM

Media 1
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Aug 21, 2025
250d ago
๐Ÿ†”95237014

Introducing ๐˜ƒ๐—ถ๐—ฏ๐—ฒ-๐—น๐—น๐—ฎ๐—บ๐—ฎ to streamline your LlamaIndex development with context-aware coding agents. A command-line tool that that automatically configures your favorite coding agents with up-to-date context and best practices about LlamaIndex framework, LlamaCloud and workflows Running ๐˜ถ๐˜ท๐˜น ๐˜ท๐˜ช๐˜ฃ๐˜ฆ-๐˜ญ๐˜ญ๐˜ข๐˜ฎ๐˜ข@๐˜ญ๐˜ข๐˜ต๐˜ฆ๐˜ด๐˜ต ๐˜ด๐˜ต๐˜ข๐˜ณ๐˜ต๐˜ฆ๐˜ณ will automatically generates rule files for 16 of your favorite coding agents (including @cursor_ai, @claudeai Code and @github Copilot) to get started with building your awesome LlamaIndex-powered app right away, with all the relevant docs and info readily available to them. ๐Ÿ” Check out the repo: https://t.co/lsPL280fKk ๐Ÿฆ™ โ˜๏ธ Get started with LlamaCloud: https://t.co/zqUqveKp6b

Media 2
+1 more
๐Ÿ–ผ๏ธ Media
L
llama_index
@llama_index
๐Ÿ“…
Aug 22, 2025
249d ago
๐Ÿ†”29781401

Build durable workflows that persist across multiple runs ๐Ÿ‘‡ By default, LlamaIndex workflows are ephemeral - but production applications need persistence. Our new guide shows you three strategies: ๐Ÿ”„ Store data in workflow instances for simple persistence across multiple runs ๐Ÿ’พ Use the Context object's state store for async-safe, serializable workflow state that survives process restarts ๐Ÿšจ Implement external checkpointing to resume exactly where you left off โšก Bonus: inject dependencies directly into workflow steps to reduce boilerplate code Perfect for long-running document processing, multi-step AI agents, or any workflow that can't afford to start over from scratch. Learn how to write durable workflows: https://t.co/JnH9alMCoy

Media 1
๐Ÿ–ผ๏ธ Media
_
_willfalcon
@_willfalcon
๐Ÿ“…
Aug 20, 2025
251d ago
๐Ÿ†”93358631

For about ~4 years or so, we've had a dream to launch a GPU marketplace. This week it finally came true after a ton of hard work from the Lightning team. The vision is simple, enable you to use the best ML platform on any cloud of your choice. In the worst case, simply come get a GPU VM, ssh, and you're done. In the best case, you grow into the full enterprise-grade features we offer to manage teams, budgets, RBAC, observability, monitoring, etc... Please try it out and let me know your feedback!

๐Ÿ–ผ๏ธ Media
R
rasbt
@rasbt
๐Ÿ“…
Aug 23, 2025
248d ago
๐Ÿ†”34138989

I love this upgrade. Lightning AI is my go-to cloud compute platform due to its user-friendliness (great UI, persistent environment, multi-GPU and multi-node support, etc), and now the prices are also really great. An A100 for $1.55/hour through Lambda Labs or an H100 for $2.70 through Voltage Point. What's not to like! Disclaimer: This is NOT a sponsored post (I have never done those!). And I was NOT asked or contacted to say this; it's genuinely my preferred platform. I did work at Lightning AI at one time though.

@_willfalcon โ€ข Wed Aug 20 14:17

For about ~4 years or so, we've had a dream to launch a GPU marketplace. This week it finally came true after a ton of hard work from the Lightning team. The vision is simple, enable you to use the best ML platform on any cloud of your choice. In the worst case, simply come get

Media 1
๐Ÿ–ผ๏ธ Media
R
rasbt
@rasbt
๐Ÿ“…
Aug 23, 2025
248d ago
๐Ÿ†”46568256

@jasonth0 I guess different providers have different margins for different cards. For instance, GCP is cheaper than AWS for some cards, but more expensive for others etc. The trick here is you are not locked into using one of the providers but the one that offers your preferred GPU the cheapest at a given time.

Media 1
๐Ÿ–ผ๏ธ Media
R
rasbt
@rasbt
๐Ÿ“…
Aug 23, 2025
248d ago
๐Ÿ†”94257567

@jasonth0 Like GCP has cheaper L4's but AWS has cheaper T4's https://t.co/NLEHDUhRJG

Media 1
๐Ÿ–ผ๏ธ Media
R
rasbt
@rasbt
๐Ÿ“…
Aug 23, 2025
248d ago
๐Ÿ†”82169897

@rishabh063 Based on the main website (https://t.co/U1SRtfap6h) it seems like this is still supported; probably have to contact them though https://t.co/PCYZUo7UFw

Media 1Media 2
๐Ÿ–ผ๏ธ Media
M
Modular
@Modular
๐Ÿ“…
Aug 22, 2025
249d ago
๐Ÿ†”09461623

Go beyond correct code with our latest GPU puzzle! Puzzle #30 introduces systematic performance analysis to spot bottlenecks, understand memory system behavior, and optimize your code: https://t.co/b3ECUVgA0h

Media 1
๐Ÿ–ผ๏ธ Media
J
jxnlco
@jxnlco
๐Ÿ“…
Aug 22, 2025
249d ago
๐Ÿ†”52396118

if you want to see the full talk: https://t.co/vc6EX3cFuo

Media 1
๐Ÿ–ผ๏ธ Media
J
jxnlco
@jxnlco
๐Ÿ“…
Aug 23, 2025
249d ago
๐Ÿ†”79275734

https://t.co/21WhHBXLtf https://t.co/dqyfdvWWgi

Media 1
๐Ÿ–ผ๏ธ Media
G
GeoffreyHuntley
@GeoffreyHuntley
๐Ÿ“…
Aug 23, 2025
249d ago
๐Ÿ†”87081756

โŽฟ ย Read 20 lines (ctrl+r to expand) โบ Perfect! I now have a clear understanding of the codebase. https://t.co/ZnqdPUro6d

Media 1
๐Ÿ–ผ๏ธ Media
P
pitdesi
@pitdesi
๐Ÿ“…
Aug 23, 2025
248d ago
๐Ÿ†”89614595

a joyful brain glitch convinces me Iโ€™m watching a child ride a chicken skate across the rink https://t.co/2hKQOUolp4

๐Ÿ–ผ๏ธ Media
๐Ÿ”jxnlco retweeted
P
Sheel Mohnot
@pitdesi
๐Ÿ“…
Aug 23, 2025
248d ago
๐Ÿ†”89614595

a joyful brain glitch convinces me Iโ€™m watching a child ride a chicken skate across the rink https://t.co/2hKQOUolp4

โค๏ธ187
likes
๐Ÿ”7
retweets
๐Ÿ–ผ๏ธ Media
A
antonosika
@antonosika
๐Ÿ“…
Aug 23, 2025
248d ago
๐Ÿ†”71323278

I'm in SF and want to meet @swyx and users and get feedback on how the team can improve our product if you want to meet me or other vibe coding creators I'd love to meet you impromptu meetup (+ dinner) tomorrow, very informal, link below https://t.co/MgTq4MqJ8m

Media 1
๐Ÿ–ผ๏ธ Media
V
vig_xyz
@vig_xyz
๐Ÿ“…
Aug 23, 2025
248d ago
๐Ÿ†”46109965

first signup, feels cool. excited for the next couple months https://t.co/E5iLN5UX6n

Media 1
๐Ÿ–ผ๏ธ Media
๐Ÿ”jxnlco retweeted
V
Vignesh Mohankumar
@vig_xyz
๐Ÿ“…
Aug 23, 2025
248d ago
๐Ÿ†”46109965

first signup, feels cool. excited for the next couple months https://t.co/E5iLN5UX6n

Media 1
โค๏ธ7
likes
๐Ÿ”1
retweets
๐Ÿ–ผ๏ธ Media
G
gookedup
@gookedup
๐Ÿ“…
Aug 23, 2025
248d ago
๐Ÿ†”23218442

Life when u stop chud-maxxing https://t.co/CxijrChA80

Media 1Media 2
+2 more
๐Ÿ–ผ๏ธ Media
A
AravSrinivas
@AravSrinivas
๐Ÿ“…
Aug 22, 2025
249d ago
๐Ÿ†”39227746

Perplexity Max Subscribers can now use GPT-5-Thinking model for reasoning mode queries https://t.co/5lQaC8tNNy

Media 1
๐Ÿ–ผ๏ธ Media
A
AravSrinivas
@AravSrinivas
๐Ÿ“…
Aug 23, 2025
249d ago
๐Ÿ†”57999930

Update the Perplexity iOS app. You will be pleasantly surprised. Team has cooked. https://t.co/EawowNSyKg

Media 1
๐Ÿ–ผ๏ธ Media
J
jonathonstaff
@jonathonstaff
๐Ÿ“…
Aug 23, 2025
249d ago
๐Ÿ†”96980431

Earlier today, we launched a new version of our Perplexity iOS app, reimagined from the ground up and built with a lot of love. Swipe from the left access your library, swipe right to access Discover. Lots of intuitive gestures and motion. Can't wait for you to try it. https://t.co/aa72KgStzT

๐Ÿ–ผ๏ธ Media
A
AravSrinivas
@AravSrinivas
๐Ÿ“…
Aug 23, 2025
248d ago
๐Ÿ†”28641130

I have been dictating most of my queries instead of typing when on the iOS app. The new redesigned updated Perplexity iOS app has a chefโ€™s kiss on voice dictation UX. https://t.co/BcDExjoSxb

๐Ÿ–ผ๏ธ Media
A
AravSrinivas
@AravSrinivas
๐Ÿ“…
Aug 24, 2025
248d ago
๐Ÿ†”79977447

Apple-sque design of the library of past threads on the redesigned Perplexity iOS app. https://t.co/LiZhpw1PLc

๐Ÿ–ผ๏ธ Media