Your curated collection of saved posts and media

Showing 9 posts Β· last 14 days Β· by score
βž• Add New Post
πŸ”GaryMarcus retweeted
E
Eric Topol
@EricTopol
πŸ“…
Jun 27, 2026
7d ago
πŸ†”23533676
⭐0.36

Thanks for running our open-source work on current frontier models β€œThe results are: the most capable models today (GPT-5.5 Pro) did outperform the best models from before (79/100 vs 69/100), but did not improve enough to be considered sufficient for reliable medical use.” Read full text and results below

❀️285
likes
πŸ”42
retweets
E
EricTopol
@EricTopol
πŸ“…
Jun 27, 2026
7d ago
πŸ†”23533676
⭐0.38

Thanks for running our open-source work on current frontier models β€œThe results are: the most capable models today (GPT-5.5 Pro) did outperform the best models from before (79/100 vs 69/100), but did not improve enough to be considered sufficient for reliable medical use.” Read full text and results below

@yishan β€’ Sat Jun 27 05:35

A big problem with research studies on AI models is that given how long the peer review process is, the results are always out-of-date by the time the paper is published. This time, we have something better! The typical reaction to research results like this roughly goes "You'r

K
kanavtwt
@kanavtwt
πŸ“…
Jun 20, 2026
14d ago
πŸ†”63563855

Day 1 of vibecoding https://t.co/n8ff35htEV

Media 1
πŸ–ΌοΈ Media
Z
ziv_ravid
@ziv_ravid
πŸ“…
Jul 01, 2026
3d ago
πŸ†”92616309

1/ On Training in Imagination - Dwarkesh's episode has a segment on dreaming as one of the next training paradigms. The idea is that a model learns mostly inside its own, by imagining what would happen, instead of trying out for real. We have a recent paper on exactly this πŸ₯³πŸ₯³πŸ₯³

@dwarkesh_sp β€’ Fri Jun 26 16:56

What does the next training paradigm look like? 0:00:00 – The big research bet the labs are making 0:02:12 – Grindability is just as important as verifiability 0:06:10 – Will RLVR alone generalize? 0:08:41 – Getting the learning back to the weights 0:15:22 – Dreaming 0:17:23 – W

Media 1
πŸ–ΌοΈ Media
D
dicksonneoh7
@dicksonneoh7
πŸ“…
May 02, 2023
1160d ago
πŸ†”36019456
⭐0.40

Visualizing your dataset (especially large ones) in a low-dimensional embedding space can tell you a lot about the patterns and clusters in your dataset. We release a notebook showing how you can visualize your dataset using DINOv2 models by running it on your CPU. Yes! CPU!

_
_albertgu
@_albertgu
πŸ“…
Jun 26, 2026
8d ago
πŸ†”43587996
⭐0.36

Transformers are better at copying, while RNNs are better at modeling "meaning-bearing wordsβ€”the nouns, verbs, & adjectives that say what a sentence is about"

@allen_ai β€’ Thu Jun 25 16:22

Hybrid (transformer–RNN) models are fast becoming a serious alternative to the transformer, but a big question remains: how do they process tokens differently & how does this impact performance? We compared our transformer (Olmo 3) & hybrid (Olmo Hybrid) models to find

V
vida_agent
@vida_agent
πŸ“…
Jun 27, 2026
7d ago
πŸ†”59024492

We open-sourced BrowserBC: A system that turns human browser trajectories into reusable agent skills. Just one recording is enough to generalize a skill. πŸ› οΈ GitHub: [https://t.co/WP8mQGuJ6N] Here’s how it works. πŸ‘‡

Media 1Media 2
πŸ–ΌοΈ Media
A
AravSrinivas
@AravSrinivas
πŸ“…
Jun 22, 2026
12d ago
πŸ†”25257913
⭐0.38

GLM is the kind of model that revives serious interest in open source AI. It passes the blind test relative to the frontier models on the median production grade knowledge worker task. It’s affordable to serve. And is a sub trillion parameter model, meaning it has a lot of potential to go beyond matching the frontier at the median level of difficulty to also doing it for the long tail. Plenty to look forward to!

T
Tesla_AI
@Tesla_AI
πŸ“…
Jun 29, 2026
5d ago
πŸ†”89260101
⭐0.40

v14 Lite Release Notes: – Distilled the intelligence from HW4 V14 into HW3. This allows HW3 to directly learn how to handle scenarios using HW4 V14 as a guide. This process unlocks the improvements that have been made to HW4 including Reinforcement Learning (RL) and offline models for HW3. – Improved both proactive and reactive responsiveness across a wide variety of categories including navigation handling, merges and forks, pedestrian interactions, traffic lights, and vehicle cut-in scenarios. – Improved general comfort in nominal scenarios through fewer false slowdowns, smoother steering and more consistent lane centering. – Introduced parking, unparking, and reversing capabilities. – Added Arrival Options for you to select where FSD should park: in a Parking Lot, on the Street, in a Driveway, or at the Curbside. – Speed Profiles are now available at all times, to further customize driving style preference.

@aelluswamy β€’ Mon Jun 29 06:54

FSD v14 Lite is now rolling out to AI3 early-access customers. Based on the feedback, will rollout to more customers over the next few weeks. This build distills the driving behavior from AI4’s v14 series into both the camera and compute config of AI3. It includes destination op