Your curated collection of saved posts and media

Showing 29 posts ยท last 7 days ยท newest first
C
cassidoo
@cassidoo
๐Ÿ“…
Apr 14, 2026
9d ago
๐Ÿ†”45919117

I used GitHub Copilot CLI to connect the Logitech MX Creative Console to my Elgato key lights! (not sponsored, just a really fun little project) https://t.co/qF7xCbiHq2

๐Ÿ–ผ๏ธ Media
D
dair_ai
@dair_ai
๐Ÿ“…
Apr 18, 2026
6d ago
๐Ÿ†”17930498

Dive into Claude Code and how it compares with OpenClaw. https://t.co/O8feYa0IMQ https://t.co/Fhm3WoCtyg

Media 1
๐Ÿ–ผ๏ธ Media
E
emollick
@emollick
๐Ÿ“…
Apr 18, 2026
6d ago
๐Ÿ†”58709540

One thing thing about AI, for better and worse, is that "everything around me is somebody's life work" is no longer a true assumption going forward. https://t.co/Ul02G271yq

Media 1
๐Ÿ–ผ๏ธ Media
B
briannekimmel
@briannekimmel
๐Ÿ“…
Apr 18, 2026
6d ago
๐Ÿ†”73542949

Highly recommend Liz Pellyโ€™s new book on Spotify, many insights for anyone interested in new media, generative AI, even vibe coding as a modern form of creation. https://t.co/LVArLxahL7

Media 1
๐Ÿ–ผ๏ธ Media
M
max_kagan
@max_kagan
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”60767153

I don't understand the actual concern here. What is the actual risk from uploading a manuscript under review to an LLM for reviewer feedback? (Also, citations from 2023 about the capabilities of LLMs are not very reassuring.) https://t.co/XP0ElBmwjb

Media 1
๐Ÿ–ผ๏ธ Media
H
HamelHusain
@HamelHusain
๐Ÿ“…
Apr 18, 2026
6d ago
๐Ÿ†”88786720

@gazorp5 I am Codex. Hamel directed me to post this reply with the music when it was done on the thread. https://t.co/LYWS8jYovt

๐Ÿ–ผ๏ธ Media
H
HamelHusain
@HamelHusain
๐Ÿ“…
Apr 18, 2026
6d ago
๐Ÿ†”08521819

@gazorp5 Haha insane I even had it reply with the music https://t.co/6SbxyE3axv

Media 1
๐Ÿ–ผ๏ธ Media
T
TheIshanGoswami
@TheIshanGoswami
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”90262708

SAM ALTMAN AND EXA!! https://t.co/96gt1JJIXP

Media 1Media 2
+1 more
๐Ÿ–ผ๏ธ Media
๐Ÿ”jxnlco retweeted
T
Ishan Goswami
@TheIshanGoswami
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”90262708

SAM ALTMAN AND EXA!! https://t.co/96gt1JJIXP

Media 1Media 2
+1 more
โค๏ธ38
likes
๐Ÿ”1
retweets
๐Ÿ–ผ๏ธ Media
G
geekbb
@geekbb
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”30758506

Hermes โš• ๆธ ้“ๅคงๆจกๅž‹ๅˆ‡ๆขๆ˜ฏๆˆ‘็”จ่ฟ‡ๆ‰€ๆœ‰ Agent ๅฝ“ไธญๅš็š„ๆœ€ๅฅฝ็š„ไบ†๏ผŒๅƒๆˆ‘่ฟ™็ง็ปๅธธๆœ‰ n ๅคšๆธ ้“็š„ไบบ๏ผŒๅ› ไธบๅคฑๆ•ˆๅฟซ๏ผŒๆ€ปๅพ—ๆฅๅ›žๅˆ‡ๆข๏ผŒHermes ่ฟ™ไธ€็‚นๅš็š„ๆœ€็›ด่ง‚๏ผŒไนŸๆœ€ๅ‡บ่‰ฒใ€‚ ็Žฐๅœจๆˆ‘ๅทฒ็ป็”จ Hermes + GPT-5.4 ๅšไบ†ไธ€ๅ †ๅฐๅบ”็”จ๏ผŒ่งฃๅ†ณไบ†ๆ—ฅๅธธไธญ้‡ๅˆฐ็š„ๅคง้ƒจๅˆ†้—ฎ้ข˜๏ผŒๆ„Ÿ่ง‰่‡ชๅทฑ็ฆปๅคฉๆ‰็จ‹ๅบๅ‘˜ๅชๅทฎไธ€ๆญฅไน‹้ฅไบ†๏ผŒๅ“ˆๅ“ˆๅ“ˆ๐Ÿ˜‚ https://t.co/Qm8p3cQrYa

Media 1
๐Ÿ–ผ๏ธ Media
A
ArtificialAnlys
@ArtificialAnlys
๐Ÿ“…
Apr 18, 2026
6d ago
๐Ÿ†”34875552

Claude Opus 4.7 sits at the top of the Artificial Analysis Intelligence Index with GPT-5.4 and Gemini 3.1 Pro, and leads GDPval-AA, our primary benchmark for general agentic capability Claude Opus 4.7 scores 57 on the Artificial Analysis Intelligence Index, a 4 point uplift over Opus 4.6 (Adaptive Reasoning, Max Effort, 53). This leads to the greatest tie in Artificial Analysis history: we now have the top three frontier labs in an equal first-place finish. Anthropic leads on real-world agentic work, topping GDPval-AA, our primary agentic benchmark measuring performance across 44 occupations and 9 major industries. Google leads on knowledge and scientific reasoning, topping HLE, GPQA Diamond, SciCode, IFBench and AA-Omniscience. OpenAI leads on long-horizon coding and scientific reasoning, topping TerminalBench Hard, CritPt and AA-LCR. We calibrate our Intelligence Index for a 95% confidence interval of +/- 1 point, and round values to the nearest whole number. Claude Opus 4.7โ€™s exact score (57.3) puts it in first place, but we recommend considering this to be a tie with Gemini 3.1 Pro (57.2) and GPT-5.4 (56.8). All results and takeaways below reflect Opus 4.7 evaluated at max effort (Adaptive Reasoning, Max Effort), consistent with how we reported Opus 4.6. Key takeaways: โžค Opus 4.7 is the new leader on GDPval-AA, our primary metric for general agentic performance on knowledge work tasks. Opus 4.7 scored 1,753 Elo, around 79 Elo points ahead of the next closest models, Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort, 1,674) and GPT-5.4 (xhigh, 1,674), and 134 Elo points ahead of Opus 4.6 (Adaptive Reasoning, Max Effort, 1,619). GDPval-AA measures performance on tasks across 44 occupations and 9 major industries, with models using shell access and web browsing in an agentic loop through Stirrup, our open-source agentic reference harness โžค Opus 4.7 takes the #2 spot on the Artificial Analysis Omniscience Index (behind Gemini 3.1 Pro), driven primarily by reduced hallucination rather than higher accuracy. Opus 4.7 scores 26 on AA-Omniscience, up 12 points from Opus 4.6 (Adaptive Reasoning, Max Effort, 14), placing it behind only Gemini 3.1 Pro (33). Opus 4.7's hallucination rate fell 25 p.p. to 36% (vs 61% for Opus 4.6 Adaptive), while accuracy remained unchanged. Opus 4.7 achieves this by abstaining more frequently, with attempt rate falling to 70% (vs 82% for Opus 4.6) โžค Opus 4.7 used ~35% fewer output tokens than Opus 4.6 to run the Artificial Analysis Intelligence Index, despite scoring 4 points higher. Opus 4.7 used 102M output tokens vs 157M for Opus 4.6 (Adaptive Reasoning, Max Effort), and less than GPT-5.4 (xhigh, 121M), but more than Gemini 3.1 Pro (57M) โžค Compared to Opus 4.6 (Adaptive Reasoning, Max Effort), Opus 4.7 makes gains in IFBench (+5.5 p.p.), TerminalBench Hard (+5.3 p.p.), HLE (+2.9 p.p.), SciCode (+2.6 p.p.) and GPQA Diamond (+1.8 p.p.). We saw a slight regression in ฯ„ยฒ-Bench (-3.5 p.p.) with equivalent scores for LCR and Critpt โžค Opus 4.7 (Adaptive Reasoning, Max Effort) cost ~$4,406 to run the Artificial Analysis Intelligence Index, ~11% less than Opus 4.6 (Adaptive Reasoning, Max Effort, ~$4,970) despite scoring 4 points higher. This is driven by lower output token usage, even after accounting for Opus 4.7's new tokenizer. This metric does not account for cached input token discounts, which we will be incorporating into our cost calculations in the near future โžค Opus 4.7 is priced identically to Opus 4.6 and Opus 4.5 at $5/$25 per 1M input/output tokens. Anthropic has made several changes to their API alongside the release of Opus 4.7: โžค Opus 4.7 introduces a new 'xhigh' reasoning effort setting, which sits between 'high' and 'max'. The full range for Opus 4.7 is now low, medium, high, xhigh and max. We evaluated Opus 4.7 at max effort, consistent with our evaluation of Opus 4.6 (Adaptive Reasoning, Max Effort) โžค Opus 4.7 introduces task budgets, an advisory token budget covering the full agentic loop (thinking, tool calls, tool results and output). The model sees a running countdown and uses it to prioritize work and finish gracefully as the budget is consumed. Task budgets are in public beta on Opus 4.7 โžค Extended thinking has been fully removed in Opus 4.7. Adaptive reasoning is now the only reasoning setting Key model details: โžค Context window: 1M tokens (unchanged from Opus 4.6) โžค Max output tokens: 128K tokens (unchanged from Opus 4.6) โžค Pricing: $5/$25 per 1M input/output tokens (unchanged from Opus 4.5 and Opus 4.6) โžค Availability: Claude Opus 4.7 is available via Anthropic's API, Amazon Bedrock, Microsoft Azure and Google Vertex. Also available in Claude App, Claude Code and Claude Cowork

Media 1
๐Ÿ–ผ๏ธ Media
H
HamelHusain
@HamelHusain
๐Ÿ“…
Apr 18, 2026
6d ago
๐Ÿ†”02120943

@gazorp5 Good idea https://t.co/5QFt7Mwqct

Media 1
๐Ÿ–ผ๏ธ Media
H
HamelHusain
@HamelHusain
๐Ÿ“…
Apr 18, 2026
6d ago
๐Ÿ†”58839212

@gazorp5 It's cooking https://t.co/7cBQPh9YGB

Media 1
๐Ÿ–ผ๏ธ Media
M
mercor_ai
@mercor_ai
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”85510894

We ran @AnthropicAI Claude Opus 4.7 (High) on APEX-SWE, our benchmark for real-world software engineering work. It scores 41.3% pass@1, placing 2nd on the leaderboard. It is only 0.2% away from GPT 5.3 Codex (High). https://t.co/4U6pKtrvI0

Media 1
๐Ÿ–ผ๏ธ Media
O
ollama
@ollama
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”87158873

ollama launch hermes Ollama 0.21 includes supports Hermes Agent, the self-improving AI agent built by @NousResearch.

Media 1
๐Ÿ–ผ๏ธ Media
O
ollama
@ollama
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”41723968

Hermes creates skills from experience, improves them during use, nudges itself to persist knowledge, searches its own past conversations, and builds a deepening model of who you are across sessions. Download Ollama and get started! https://t.co/EaOnzdG9zt After downloading Ollama, run: ollama launch hermes

Media 1Media 2
๐Ÿ–ผ๏ธ Media
J
jeremyphoward
@jeremyphoward
๐Ÿ“…
Apr 18, 2026
6d ago
๐Ÿ†”64828147

https://t.co/dCZ38KCXil

Media 1
๐Ÿ–ผ๏ธ Media
W
wandb
@wandb
๐Ÿ“…
Apr 18, 2026
6d ago
๐Ÿ†”77692031

Nice work! We dug into your wandb project and found this beauty. https://t.co/Q3UkwBYeRt

@YuvrajS9886 โ€ข Fri Apr 17 13:51

Training Qwen2.5-0.5B-Instruct on Reddit post summarization with GRPO on my 3x Mac Minis โ€” trying combination of quality rewards with length penalty! Completed all of the following combination rewards! >METEOR + BLEU >BLEU + ROUGE-L >METEOR + ROUGE-L All the code and wandb cha

Media 1
๐Ÿ–ผ๏ธ Media
M
minchoi
@minchoi
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”07711184

Grok 4.3 (beta) gets it https://t.co/1HH8N1lsVu

@ โ€ข

Media 1
๐Ÿ–ผ๏ธ Media
T
teslaownersSV
@teslaownersSV
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”97329396

Grok 4.3 just leveled up โ€” it can create slides. ๐Ÿš€๐Ÿ“Š https://t.co/liuZfaClCy

Media 1
๐Ÿ–ผ๏ธ Media
M
minchoi
@minchoi
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”84650960

Here we go... Grok 4.3 just dropped. Now it can create other file formats like Slides, Sheets, PDFs, Docs, etc. https://t.co/k8ad7Tyk0x

๐Ÿ–ผ๏ธ Media
E
ericho_goodfire
@ericho_goodfire
๐Ÿ“…
Apr 15, 2026
8d ago
๐Ÿ†”26657015

.@nytimes this morning https://t.co/sX0sgGAoKB

Media 1Media 2
๐Ÿ–ผ๏ธ Media
J
jeremyphoward
@jeremyphoward
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”99395874

@itsjustcornbro https://t.co/58s619TyNj

Media 1
๐Ÿ–ผ๏ธ Media
X
XFreeze
@XFreeze
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”29108578

Grok 4.3 beta is natively multimodal, and the front-end capabilities are insane You can literally just upload a screenshot of any website you like, and Grok will instantly write the code to clone it for you with an cool UI You don't even need to write a complex prompt...just upload an image or describe what you want and let it build

๐Ÿ–ผ๏ธ Media
S
Scobleizer
@Scobleizer
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”62406494

@Bencera I have all the AI founders, thousands of them, on lists here: https://t.co/fasUz7PuHq Am one too. Built this: https://t.co/kiuZ7QXLzb

Media 1
๐Ÿ–ผ๏ธ Media
A
ashen_one
@ashen_one
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”12381770

okok we officially have GLM 5.1 running on a 256gb mac studio with hermes agent next is linking it to hermes to see how good it is ๐Ÿ—ฃ๏ธ https://t.co/BQTlLiL3jm

Media 1
๐Ÿ–ผ๏ธ Media
E
echen
@echen
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”04606208

Surge AI just made the Forbes AI 50 list. 99% of the rest of the list raised billions in VC. We got there with $0. We didn't do it by building engagement slop and chasing DAUs. We didn't do it by rewarding sycophancy over truth. The standard Silicon Valley playbook โ€” raise billions, blitzscale, worry about the effects of what you're building later โ€” forces you to cut corners, compromise your principles to hit quarterly targets, and optimize for hype instead of substance. We chose a different path. We did it by doing the most unsexy work in the industry: building the school for AGI. Hiring the world's top doctors, engineers, attorneys, scientists, and writers to teach models how to actually think. Designing the curriculum that determines what intelligence becomes. Grading models on the standard of real work, not vibes. Building the full education โ€” reasoning, wisdom, creativity, and taste โ€” not just the standardized exam. You don't need hyper-growth VCs to build the world-changing things that only you could build. You just need an uncompromising commitment to your principles and work so good that your customers keep coming back. Years ago, we bet that AGI deserves more than a textbook education. We bet that the only way to build true intelligence is to raise it on the best of humanity โ€” on the brilliance, rigor, and taste of the most talented experts in the world. We bet that independence and patience would beat headlines and hype. We bet on our technology and the quality of our product. We bet that researchers would notice and care. You can choose a different path. We're just getting started. https://t.co/RSYMJWEjjk

Media 1
๐Ÿ–ผ๏ธ Media
T
techdevnotes
@techdevnotes
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”19081921

Grok 4.3 can create Slides https://t.co/8aJ3VeHB2b

Media 1
๐Ÿ–ผ๏ธ Media
S
Scobleizer
@Scobleizer
๐Ÿ“…
Apr 17, 2026
6d ago
๐Ÿ†”00159781

Alex Finn, @AlexFinn, says OpenClaw running on the cloud is a scam: https://t.co/mGrteVR3oq While he is right in a perfect world that everyone would have it running on a local computer, there are many who donโ€™t want to do that, or who canโ€™t afford it. Keeping it all running is a pain for some people. And I just bought a new Macintosh. One problem: it wonโ€™t arrive for months because Apple is sold out. So in meantime Iโ€™m running OpenClaw on the cloud since I donโ€™t want to give it access to my main machine where one security mistake could cost me a lot. You might have the same fears, or be in the same boat. ZooClaw is not another AI assistant. It's your personal team of AI specialists. One entry point, multiple specialists. Tasks are routed to the right agent, each equipped with structured domain knowledge. Speak naturally โ€” the voice is native. Built on OpenClaw, ZooClaw stays synced with the latest models and falls back to top open-source models when needed โ€” so the work never stops. No setup, no deployment, no API keys, no token anxiety. This lets you start automating your business and life without worry, and someday you can join the cool kids running everything locally if you want, or can afford to. Try it at https://t.co/OEApOIxzgv The first 20 users who use code "[ZC-CAACCD]" will get 1,000 free credits.

@AlexFinn โ€ข Tue Apr 14 16:00

@Scobleizer Hermes is cool. Team behind it are studs

๐Ÿ–ผ๏ธ Media