Your curated collection of saved posts and media
CITP + @PrincetonSPIADC invite the public to a Congressional briefing on Monday 4/20 in Washington, DC. Experts include Arvind Narayanan @random_walker, Liat Krawczyk of NJ AI Hub, & Harry Holzer of @Georgetown. Registration requested: https://t.co/PRSS8HHQVC @PrincetonSPIA https://t.co/rLymYtUMre
itβs happening https://t.co/Y0fVNA3zGI
JUST IN: Use of AI in the office is reportedly creating a flood of βworkslopβ that takes longer to fix than do from scratch.
itβs happening https://t.co/Y0fVNA3zGI
Stanford just tested whether LexisNexis and Thomson Reutersβ AI legal research tools are really βhallucination-free,β as they claim. Spoiler: not even close. Hereβs what the study found. https://t.co/lb2CekFeWn
Introducing TIPS v2 πFoundational text-image encoder πΈCan be used as the base for different multimodal applications π€Apache 2.0 π§βπ³New pre-training recipes https://t.co/A6H93YJhNx

@Graham_dePenros Love. It's at https://t.co/kiuZ7QXLzb
Was able to get a slick native swift desktop app v1.0 up and running for Hermes agent today (credit to redsparklabs) Can I get a few people to alpha test it with me? Works great for me so far! π DM me! @Teknium @NousResearch Check out this beauty! https://t.co/voNAvaUHIT
This dataset was crafted with a fine-tuned @NousResearch Hermes 4.3 36B model run on a RTX 6000 Blackwell Server Edition. (We simply love @NousResearch but this, by no means, does it signify a partnership) PMI-relevant results: 60.6% TruthfulQA (Delta: +11.7% vs Qwen3.5-4B) & 71.5% HellaSwag on a 4B fine tuned model.
So excited to share that Google DeepMind is joining Station F in Paris!π₯ Over the years, I've had the great opportunity to collaborate with @roxannevarza and team. I'm now so excited to officially announce a partnership to collaborate with the French startup ecosystem. https://t.co/hERwqbKo7t
Just shipped **artifact-preview** for Hermes π₯ Like Claude Artifacts, build dashboards, games, UIs, get a full interactive preview that instantly opens in a live browser. Real clickable code, smooth refreshes on prompt edits. cc @Teknium https://t.co/7S9N1Nn9mX
You can now bring your Cloudflare Sandbox to use with the @OpenAI AgentsSDK Click "Deploy to Cloudflare" enter in your keys and you're good to go! -https://t.co/yFv5M4Xl1f
Build long-running agents with more control over agent execution. New capabilities in the Agents SDK: β’ Run agents in controlled sandboxes β’ Inspect and customize the open-source harness β’ Control when memories are created and where theyβre stored https://t.co/zPyuLup6b6
@techno0ptimist Depends on how I fudge the data, looks like ~half the caf content by the way I usually measure it (although the spot is tighter than usual, which is inflating the 'after' prediction a bit) https://t.co/Bk6JwCtHmM
Why did Scoble sell out? I'm starting to get a lot more sponsors who are willing to pay me to introduce their companies to you. You'll see more later today. I just wanted to say a few words about this. First: thank you. Last year my wife was laid off, so budgets are a lot tighter than previously. She has a new contracting job, but not making as much as she was. Second, I'm investing in new projects: https://t.co/8L5xphk0qQ is a big one. It costs hundreds per day to run and I can't afford to do that without sponsors. It reads 40,000 posts a day (runs three times a day) and builds a new kind of way to read the AI community here on X (I developed it because I can't keep up with 40,000 posts a day). Third, I have three employees now. @IrenaCronin helps me with our newsletter, which is thematic on AI issues and technologies coming: https://t.co/HHwYy7NoAl and @samlevin is managing the business side of my life. He's working with a hyper smart 22 year old who is automating the business side of my life (I can't keep up anymore with all the DMs and emails while traveling around the San Francisco Bay Area to develop new content. Fourth, I continue to pour hours every day into developing my lists here on X, which are the most complete of Tech Industry. Now that AI is coming to let you build personalized news services they are getting more and more important: https://t.co/9eRY65x3IQ I've never been paid for the thousands of hours it took to develop them, but many are using them on their @OpenClaw or @NousResearch Hermes agentic systems to build personalized news services out of them. I try every sponsor's product and turn down those that I don't like, which happens frequently. But taking sponsorship has changed me and what I'm doing here. I try not to, but it does. First of all, just having someone paying you money to consider them forces me to put a lot more effort into trying their product than I might otherwise give. That alone changes me. How that changes my relationship with you? I'm taking this all a lot more seriously, truth be told, as I try to continue building media businesses that cover innovation and, especially, the AI world. Please let me know if I get it wrong. And Typeless is a great example of this. It's a great product. Way better than Apple's own keyboard in many ways. I use it every day to talk with you and with my agents. Funny enough, I manually typed this whole post since I find sometimes it changes my writing to be a little too clean and have a little bit of an AI voice rather than my own. That said, if you try it out please use this link so they can track how many people come from my posts here: https://t.co/5G0XxaTTjL Greatly appreciate all of you, and will try to get the mix right. And on posts that are paid I'll always use the "paid partnership" marker that I used on both of these posts so you can know which ones are things I'm compensated for writing. Thanks for helping put food on three people's tables too. In today's world that is getting tougher and tougher, I know.
How do you learn to trust AI? When it works even in a noisy environment. This is @typelessdotcom. Faster than typing. And you donβt need to turn down the music to use it. https://t.co/f2Oh0awxNb

Our paper on Subliminal Learning was just published in Nature! Last July we released our preprint. It showed that LLMs can transmit traits (e.g. liking owls) through data that is unrelated to that trait (numbers that appear meaningless). Whatβs new?π§΅ https://t.co/Iiv9sgjJki
Research we co-authored on subliminal learningβhow LLMs can pass on traits like preferences or misalignment through hidden signals in dataβwas published today in @Nature. Read the paper: https://t.co/b1BYwcW9dH
Our paper on Subliminal Learning was just published in Nature! Last July we released our preprint. It showed that LLMs can transmit traits (e.g. liking owls) through data that is unrelated to that trait (numbers that appear meaningless). Whatβs new?π§΅ https://t.co/Iiv9sgjJki
Surprisingly, people see AI for what it is for the most part: product with some very fantastic but also relatively limited utility being oversold from the top down as a project to drain humanity from art, meaning from existence, and the bank accounts of millions of people. https://t.co/1MeLkcCMPr
Excited about the Agents SDK updates we just launched. Check out my cookbook on using it with sandboxes for code migration: https://t.co/Fz7cknz64d
To show off what you can do with @OpenAI Agent SDK + @modal, we built an ML research agent (inspired by @karpathy). It can: - Spin up GPU sandboxes of any shape - Run a pool of subagents - Persist memory - Snapshot state for fork/resume Here it is playing Parameter Golf: https://t.co/r7QhvNmdEq
Agents need computers. And they need a lot of them. Modal is an official sandbox provider for the @OpenAI Agents SDK. https://t.co/Lu4cesspYq
OpenAI x E2B: build your agents with the new OpenAI Agents SDK, powered by E2B sandboxes. We're excited to support OpenAI as a launch partner! The new @OpenAI Agents SDK will now get dedicated sandboxes - perfect for persistent, long-running agents. With E2B, you'll get a custom environment with resource isolation and security boundaries, with no infrastructure setup required. Your agents will be able to: - Edit files and run shell commands in isolated environments - Maintain temporary workspace state across steps - Produce artifacts you can review before publishing - Run multiple sandboxes in parallel for concurrent workloads - Generate frontend output with live preview URLs ... and more, with a few lines of code! Learn more and see the end-to-end example in the thread:
Another example, in this case with computer use: https://t.co/gtMkZUPQP9
Another example, in this case with computer use: https://t.co/gtMkZUPQP9
OpenAI x e2b: Build your agents with the new OpenAI Agents SDK, powered by @E2B Sandboxes. Excited to support @OpenAI as a launch partner! https://t.co/RsSw1HsF86
Hammer down https://t.co/Pu5UpUCRrm

@techno0ptimist OK so that totally works!* Significantly less caffeine after filtering through activated charcoal. Taste: less bitter but also less 'complex'. mild, not very good (but then neither was the reference, instant decaf with ~1250mg/L caffeine added). Color: grey π https://t.co/8jrwq9D6DH

hermes agent is becoming the general agent and the poll i dropped this morning confirmed it in public. as community admin i have been seeing this for weeks. someone posts a coding setup, someone posts an automation, someone posts an article they wrote with it. same tool, different category, every day. i left all of the above off the poll on purpose. the top reply under it is tek himself asking where that option is. then the community saying the same under him. the founder and his room answered the poll i did not let them answer, in public. hermes agent is not a coding agent. not a research agent. not an automation agent. it is the general agent. one tool running every category of work a builder does in a day. tek built one thing that does all of it. if you are still slotting hermes agent into one category you are using the wrong model. every week it gets closer to being the only one you need.
what do you mainly use hermes agent for?
When ChatGPT first launched, there was an enormous gender gap, with our anonymized data showing roughly 80% having typically male first names. That gap is now gone. https://t.co/kWQjCImyri
Agents need computers. And they need a lot of them. Modal is an official sandbox provider for the @OpenAI Agents SDK. https://t.co/Lu4cesspYq
Build long-running agents with more control over agent execution. New capabilities in the Agents SDK: β’ Run agents in controlled sandboxes β’ Inspect and customize the open-source harness β’ Control when memories are created and where theyβre stored https://t.co/zPyuLup6b6
Weβve been exploring what a Stream SDK could look likeβwhere agents & voice are always within reach @sandbar https://t.co/aVEdMpXuRy
AI is driving more open source contributions than ever. But as a maintainer, how do you filter the noise to find the people who actually want mentorship? Enter the 3 Cs framework. Start mentoring with intention (and without the burnout). https://t.co/csQioRMpBr
The Hermes Agent running on my NVIDIA DGX Spark has generated over $10,000 in partnership deals for BridgeMind. I now have a second DGX Spark arriving this weekend. Pairing them together for more compute. The goal is to run GLM 5.1 locally. A Hermes Agent running on a $5,000 machine just paid for its own hardware upgrade. We are living in insane times.
@theerealkdc @albustime Yes, their servers have been overloaded by hermes, see: the pink is mimo lol https://t.co/F19NO5wsSS
Introducing Shipsafe x Hermes Agents. Built on the powerful capabilities developed by @NousResearch, @Teknium1, You can now configure, deploy, and orchestrate AI agent teams to find vulnerabilities before attackers do. Watch the video to see our AI Phantom Team run a full security assessment. π