Your curated collection of saved posts and media
Photography is deeply personal. Itโs about people, relationships, and moments that matter โ shaping how we remember the world and the people around us. So excited to finally share what weโve been working on - a personalized photo generation and editing model designed for your own photography. The idea is simple: your photos should still feel like you - preserving your identity, your expression, your story, while giving you the freedom to shape the image around it, or reimagine it. What excites me most isnโt just the ability to generate something entirely new, but the chance to fix the small, real things โ the missed moment, the person who wasnโt there, the expression that didnโt quite land โ and finally arrive at the photo you had in your head all along.

Thanks @_akhaliq sharing our work! ๐We are exciting to introduce VideoCUA ๐ทA Large-scale video dataset to advance human-level computer-use agent. - Total 55 hours & 6 million frames - 10,000 human-demonstrated tasks - 87 desktop apps, recorded at 30fps ๐คFully open-source at huggingface https://t.co/u0mmoTw5zD
CUA-Suite Massive Human-annotated Video Demonstrations for Computer-Use Agents paper: https://t.co/hi1WnffQSY https://t.co/ACjYOPcDzH
Btw you may need to force refresh the page due to browser caching (Cmd + Shift + R or Ctrl+ Shift + R). And yes, you can now also sort by date, size, and name: https://t.co/pow8khbpr9
NEW: Elonโs attorneys are calling on Clinton-appointed Judge Charles Breyer to investigate the jury in Musk's recent Twitter acquisition case after jurors were mocking Musk in court. RIGGED https://t.co/4qxyTm9PSx
"In a sane world, what happens is the leadership of the United States sits down with the leadership in China and leadership around the world to work together so that we don't go over the edge and create a technology which could perhaps destroy humanity. " โ Bernie Sanders https://t.co/mkJOJyBpKD
After @Pinterest @Airbnb @NotionHQ @cursor_ai, today itโs @eoghan @intercom publicly sharing that theyโre finding it better, cheaper, faster to use and train open models themselves rather than use APIs for many tasks. And hundreds of other companies are doing the same without sharing. Ultimately, I believe the majority of AI workflows will be in-house based on open-source (vs API). It took much more time than we anticipated but itโs happening now!
LagerNVS Latent Geometry for Fully Neural Real-time Novel View Synthesis paper: https://t.co/d8Boz9XIGR https://t.co/KWZQdHYL4L
When we first released Trackio, we put together a barebones UI just to get it out. We always knew we'd come back and redesign the UI. Now that Trackio is downloaded 1M times every wk, we figured it was time to release a beautiful UI. Out now in Trackio 0.20, try it out!
Qworld Question-Specific Evaluation Criteria for LLMs paper: https://t.co/UJvFr6xdpD https://t.co/n7RfxIIyB9
CUA-Suite Massive Human-annotated Video Demonstrations for Computer-Use Agents paper: https://t.co/hi1WnffQSY https://t.co/ACjYOPcDzH
Frankly Iโm shocked that not many more American startups and big tech have noticed the massive opportunity and gap in the market for American open-source AI https://t.co/93mzlvoD53
Launched Gemini 3.1 Flash Live. Itโs capable of handling the nuances of live speech, like tone and interruptions, that are critical for real-world interactions. You can experience it on Gemini Live and Search Live! https://t.co/ZUWbEL3HIJ
I had a fun time chatting last week w/ @BoWang87, a leader in biomedical AI research We discussed his awesome work at Xaira Therapeutics (their latest X-CELL model), how AI can impact drug discovery + drug development, the importance of scaling in the biomedical domain, & more! https://t.co/KwMBzJYvzc
The input box was never the interface. It was the limitation. You donโt walk into an art fair and type prompts. You look. You feel. You understand. A new behavior was born: You look โ โShow me how YOU see this.โ Introducing: Chance Visual Agent โ Live Official AI Partner of Art Central โ the 100,000-visitor international art fair packed with complex, subjective real-world scenarios. The first experiment where AI stops answeringโฆ and starts seeing the world with us. โ https://t.co/ZpWIVBW7Bw What are you looking at first? ๐ #VisualAgent #ChanceAI @Chance_vision
AI multiplying startup capabilities by 10x to 20x? Quim Allard, CEO and founder of Iqana, explains how agentic workflows compress months of research into minutesโsmall teams scaling globally to serve the world's biggest banks and asset managers. https://t.co/jdnUJ6KEjA
This image model is now publicly available ๐ @PhotaLabs is insanely good at generating AI images that actually look like you (and your pets). And it can also edit or enhance real photos to fix flaws! I've made hundreds of photos on Phota. A few things to try ๐ https://t.co/nP9uBAPU3h
Truly blown away by a new AI image model launching this week โจ Finally, you can generate photos that actually look like you! It's so much better than everything I've tried - from LoRAs to NB Pro. Onboarding some early testers. DM or comment if you want access ๐ https://t.co/x

Yo this is the best! Thanks @_akhaliq and the team. I built my /daily-research-paper to recap all of the paper with /schedule from @claudeai. I got to catch up all of the core ideas in the field every morning. Cooooolllll! https://t.co/6dJz5bMsS7
HF Papers is the biggest infra for AI agents to do retrieval over arxiv introducing ๐๐ ๐๐๐๐๐๐ cli so that autoresearch can do semantic search & markdown retrieval of papers ๐๐ ๐๐๐๐๐๐ [๐๐๐๐๐๐, ๐๐๐๐] https://t.co/cTWM4GO9E6
Added lots of improvements to the LLM Architecture Gallery in the last 2 weeks. Imho the coolest one yet: A diff tool many of you were asking for! https://t.co/NO7z6XSRHS https://t.co/5ZCIg15ml6
Gemini 3.1 Flash Live is here! ๐ฅ Our new live audio model for building voice AI experiences. Now with better instruction following, better understanding of tone and interruptions, lower latency. Now available in the API https://t.co/v7vZhpnGUm
we're excited to have @Calclavia speaking at @aiDotEngineer singapore! henry is the founder of @SmitheryDotAI - which connects your agents to thousands of tools & skills previously, he was the co-founder of @jenni, scaling it to $7M+ ARR and 300K+ active users @swyx @ivanleomk @agrimsingh @aimuggle @unprofeshme
The Leftโs response was also disgusting https://t.co/bChpysVSDk
Introducing Cline Kanban: A standalone app for CLI-agnostic multi-agent orchestration. Claude and Codex compatible. npm i -g cline Tasks run in worktrees, click to review diffs, & link cards together to create dependency chains that complete large amounts of work autonomously. https://t.co/4HjvwSu4Mo
Today, we introduce Phota Studio and Phota API, powered by our photography model that brings flagship image model capabilities, personalized to you. With personalization, an image model stops being just playful and starts becoming useful for photography. With Phota Studio, you can: - Reimagine composition, lighting, or posture while still looking like yourself - Create editorial, stylized, and studio-quality portraits of yourself, or bring someone you love into the frame - Revive the blurred shot, bring in the person who missed the group photo, fix the awkward expression - all without losing what made the moment worth keeping With Phota API, you can finally build photo experiences where real people are the core. Marketing assets, editorial shoots, wedding photography: workflows that needed identity fidelity that GenAI couldn't deliver. Until now. Ultimately, we want to make compelling photographs accessible to everyone. Phota API and Phota Studio start to make that possible: empowering people to explore, imagine, and create without losing themselves in the image. With Phota Studio and Phota API, developers can build new photo experiences, while photographers and creators can explore a new kind of AI-native editing and generation. The next photo experience starts here!
OpenAI's latest repo has Claude as the third top contributor ๐ญ๐ https://t.co/LZy8QKMe5H
I was shocked to learn this is true https://t.co/lapWCLOpQZ
OpenAI's latest repo has Claude as the third top contributor ๐ญ๐ https://t.co/LZy8QKMe5H
AI inside Unity is getting real. ๐ Join us on Open Source Friday with Andy Tsen to dive into Unity MCP. We'll cover: ๐ค How AI agents talk to Unity ๐ฎ What "context" actually means in a game engine โ๏ธ How to start building AI-assisted workflows Set a reminder for tomorrow's stream ๐ https://t.co/ByFvpdTaUJ
One more step towards a world without language barriers. Real-time translations just became even easier in the Google Translate app!ย Now you can get live translations with headphones on iOS (itโs already available on Android!).ย ๐ง Also available in more markets and over 70 languages! ๐
every arxiv paper on https://t.co/mlJE5Cr4AP https://t.co/IlTclXMqz7

Amazing. https://t.co/LEN8O2hA2v
"Top Secret!" (1984) is super silly and funny. This is a classic action comedy parody film by the trio Zucker-Abrahams-Zucker (the team behind Airplane! and The Naked Gun). Val Kilmer plays the lead role โ this is also his film debut. https://t.co/DlzQUO7bzc