Your curated collection of saved posts and media
You can actually interact with the world simulator directly in the browser. π€ Here is a quick screen recording (8x speed) of me playing with it: real-time action-conditioned video prediction across rigid objects, deformable objects, rope, and object piles. Try it yourself (no install required): https://t.co/rqAmgzcB7F Huge kudos to my student @YXWangBot for making the interactive demo happen!
If Israel insists on destroying civilian targets in Lebanon, let them buy and build their own weapons. American taxpayers should not be funding this. https://t.co/xR3VLzmWYo
Transportation, redefined. History in the making. Introducing the Luma Hover Car β now available with 0% APR financing for qualified buyers. This ad was created entirely with Luma Agents. Take your ideas and be prolific. Ad created by Jon Finger @mrjonfinger https://t.co/rk9vq63oDy
π― "If you build it, they will come." :) ~Every business you go to is still so used to giving you instructions over legacy interfaces. They expect you to navigate to web pages, click buttons, they give out instructions for where to click and what to enter here or there. This suddenly feels rude - why are you telling me what to do? Please give me the thing I can copy paste to my agent.
Here is the key table showing the 30 day effects (which they misreport in the paper) & which apparently uses a completely different test, but the ChatGPT group remains ahead. The errors shouldnβt fill you with confidence about the study, though. https://t.co/hqocnAsF38
cows outperformed the s&p500 over the last 5 years https://t.co/7XfCUhrIgi

Another 15k like post that is wrong about an AI paperβs findings. And the community note undersells how wrong: the creativity paper measured 61 people (underpowered) and found NO drop in creativity at 30 days. The ChatGPT group was actual still (significantly!) higher at the end https://t.co/v1Z87oDCJI
Super Heavy booster is pure engineering art https://t.co/MWO5565aNb
π¨BREAKING: Researchers just built an AI that runs fully autonomous scam calls. It remembers your previous answers, adapts its persuasion in real-time, and sounds indistinguishable from a human. Every single safety guardrail failed to stop it. This is terrifying: A Rutgers University researcher built "ScamAgent" an autonomous AI that conducts full scam calls. No human operator needed. It doesn't read a script. It plans. It remembers. It adapts. Here's the trick: ScamAgent breaks scams into tiny, harmless-looking steps. Friendly greeting. Build trust. Create urgency. Ask for bank details. Each step looks innocent alone. Together - a complete social engineering attack. They tested it against GPT-4, Claude 3.7, and LLaMA3-70B. Direct harmful prompt? All three refused. Same task split across multiple turns? Refusal rates collapsed. Every model folded. It ran insurance scams, fake lotteries, government fraud, job cons, and official impersonation. Cautious target? It softened. Compliant target? It escalated. Resistant? It switched tactics mid-call. Then they added off-the-shelf voice cloning. Realistic tone, pacing, emotion. Once it's audio, content filters can't even see it. A Qualys VP said this is "likely already happening in the wild." You're not getting scam calls from overseas call centers anymore. You're getting them from an AI that never sleeps and never forgets what you said.
Perplexity is the most underrated AI tool for web search. I'm gonna integrate it into OpenClaw for SEO stuffs. Also, EXTREMELY cheap. https://t.co/WfC29oJ4Zn
The next step for autoresearch is that it has to be asynchronously massively collaborative for agents (think: SETI@home style). The goal is not to emulate a single PhD student, it's to emulate a research community of them. Current code synchronously grows a single thread of commits in a particular research direction. But the original repo is more of a seed, from which could sprout commits contributed by agents on all kinds of different research directions or for different compute platforms. Git(Hub) is *almost* but not really suited for this. It has a softly built in assumption of one "master" branch, which temporarily forks off into PRs just to merge back a bit later. I tried to prototype something super lightweight that could have a flavor of this, e.g. just a Discussion, written by my agent as a summary of its overnight run: https://t.co/tmZeqyDY1W Alternatively, a PR has the benefit of exact commits: https://t.co/CZIbuJIqlk but you'd never want to actually merge it... You'd just want to "adopt" and accumulate branches of commits. But even in this lightweight way, you could ask your agent to first read the Discussions/PRs using GitHub CLI for inspiration, and after its research is done, contribute a little "paper" of findings back. I'm not actually exactly sure what this should look like, but it's a big idea that is more general than just the autoresearch repo specifically. Agents can in principle easily juggle and collaborate on thousands of commits across arbitrary branch structures. Existing abstractions will accumulate stress as intelligence, attention and tenacity cease to be bottlenecks.

OpenAI's massive Stargate data center canceled as firm can't reach terms with Oracle, operator struggles with reliability issues β Meta said to be interested in snatching excess capacity https://t.co/16mCMGJ7LI
I visit my own website because it makes me happy. AI wrote every line of code, but I directed everything. If a thoughtless prompt can make what you make, why would I come to you? I can prompt AI myself This post shows the core skill for using AI: taste https://t.co/dLCceG0Wz0
How to effectively create, evaluate and evolve skills for AI agents? Without systematic skill accumulation, agents constantly reinvent the wheel. SkillNet introduces an open infrastructure for creating, evaluating, and organizing AI skills at scale. It structures over 200,000 skills within a unified ontology, supporting rich relational connections like similarity, composition, and dependency, and performs multi-dimensional evaluation. SkillNet improves average rewards by 40% and reduces execution steps by 30% across ALFWorld, WebShop, and ScienceWorld benchmarks. The key takeaway is treating skills as evolving, composable assets rather than transient solutions. Paper: https://t.co/Xv3uGLnPH2 Learn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX

Stephen Ireland, who targeted me with endless abuse on here because I oppose the chemical castration of children and the removal of protected spaces for women and girls, has just been sentenced to 30 years in jail for child rape. https://t.co/0HHctRocB4

Anthropic themselves found that vibecoding hinders SWEs ability to read, write, debug, and understand code. not only that, but AI generated code doesnβt result in a statistically significant increase in speed donβt let your managers scare you into increased productivity. show them this paper straight from Anthropic.
This (very small) study hints at something more interesting. If you use AI to support learning while coding you can gain additional skills, if you delegate all intellectual work to AI you learn nothing. This has also turned out to be true in other larger RCT studies in education https://t.co/sp4cqPNwBP

update, 45 min later: I already have so much material I am getting a βpost too long for emailβ warning. https://t.co/CusSzVHTQu
Planning for Long-Horizon Web Tasks Really solid work on making web agents better at complex, long-horizon tasks. STRUCTUREDAGENT introduces a hierarchical planning framework using dynamic AND/OR trees for efficient search and a structured memory module for tracking candidate solutions across browsing steps. It produces interpretable hierarchical plans that make debugging and human intervention easier. Current web agents struggle with multi-step tasks because they act greedily and lose track of alternatives. STRUCTUREDAGENT achieves 46.7% on complex shopping tasks, outperforming all baselines, by giving agents the ability to backtrack, revise, and maintain structured state. Paper: https://t.co/3UOqz5TvYW Learn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX

There are now more tech workers going from the US to Europe than the other way around https://t.co/r3cZJfLMKU
TOMORROW CUDE OIL MIGHT OPEN AT $116 π€― π¨ THE NEXT 24 HOURS WILL LIQUIDATE TRILLIONS... Oil just hit $116.06 on Hyperliquid. The last time crude oil traded above $116 was in β¦.see more https://t.co/KtwvAmJbaf
π¨: A petri dish of human brain cells just learned to play DOOM https://t.co/2giOzG1CSF
π¨: A petri dish of human brain cells just learned to play DOOM https://t.co/2giOzG1CSF