Your curated collection of saved posts and media

Showing 32 posts Β· last 14 days Β· by score
E
Ethan Mollick
@emollick
πŸ“…
Nov 29, 2024
522d ago
πŸ†”74539636
⭐1.00

Technically , this seems true but the wrinkle is that I can get discussions of novel questions at the quality level of a top PhD student in organizational theory And research shows you can get answers at the level of professors in strategic management. Not many in training data. https://t.co/zsxMwJUE2H

@karpathy β€’

People have too inflated sense of what it means to "ask an AI" about something. The AI are language models trained basically by imitation on data from human labelers. Instead of the mysticism of "asking an AI", think of it more as "asking the average data labeler" on the internet

Media 1
❀️283
likes
πŸ”37
retweets
πŸ–ΌοΈ Media
W
William J. Brady
@william__brady
πŸ“…
Nov 28, 2024
523d ago
πŸ†”48479498

New paper out in @ScienceMagazine! In 8 studies (multiple platforms, methods, time periods) we find: misinformation evokes more outrage than trustworthy news, when it does it's shared more + ppl are less likely to read before sharing. w/ @killianmcl1 @Klonick @mollycrockett πŸ§΅πŸ‘‡ https://t.co/5QyDSlbYaC

Media 1
❀️416
likes
πŸ”208
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Nov 29, 2022
1254d ago
πŸ†”37953537

I posted about how image-generating AI has gotten exponentially better in the last month. Well, a new text model was released for GPT-3 today. AI can now write rhyming poems. And acrostics. And limericks. And explain how a candy-powered FTL drive can help me escape from otters. https://t.co/vBAroN2SUv

Media 1Media 2
+2 more
❀️1,055
likes
πŸ”213
retweets
πŸ–ΌοΈ Media
V
vishal
@vishal_learner
πŸ“…
Nov 29, 2024
522d ago
πŸ†”53538955
⭐0.86

I eval'd full text search, single-vector cos sim, ColBERTv2, and answerai-colbert-small-v1 on my fastbook-benchmark for 3 differently preprocessed datasets, and chunk sizes from 100-2000 tokens. Overall full text search won. For small chunks, answerai-colbert-small-v1 won. (1/2) https://t.co/KqT7mCvNCJ

Media 1Media 2
+2 more
❀️104
likes
πŸ”22
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Jun 01, 2023
1070d ago
πŸ†”43383552
⭐0.95

On one hand, Twitter isn't real life. On the other, influencers on Twitter really did accelerate one of America's largest recent bank runs. (Banks with similar underlying characteristics that were not discussed on Twitter took much less damage) https://t.co/bvcoMemlMx https://t.co/vkvUeX3okP

Media 1
❀️358
likes
πŸ”76
retweets
πŸ–ΌοΈ Media
G
Gabriel PeyrΓ©
@gabrielpeyre
πŸ“…
Nov 30, 2024
521d ago
πŸ†”31659743

I wrote a summary of the main ingredients of the neat proof by Hugo Lavenant that diffusion models do not generally define optimal transport. https://t.co/Fuv7hClFR5 https://t.co/x1pRWxf7UR

Media 1
❀️1,084
likes
πŸ”151
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Nov 29, 2024
522d ago
πŸ†”52216967

As context windows grow larger and AI β€œintelligence” grows greater, you can start to do some really interesting things with giving AI complex manuals For example, I gave Claude the manual to the future-building RPG Microscope and it built an entire storyline following the rules https://t.co/Xqkslt1wMX

Media 1Media 2
+2 more
❀️345
likes
πŸ”26
retweets
πŸ–ΌοΈ Media
M
Matt Popovich
@mpopv
πŸ“…
Nov 28, 2024
524d ago
πŸ†”24179357
⭐0.66

incredible https://t.co/bJJ9dJdBZu

Media 1
❀️238
likes
πŸ”5
retweets
πŸ–ΌοΈ Media
K
kalomaze
@kalomaze
πŸ“…
Nov 27, 2024
524d ago
πŸ†”27750273

blusky: a decentralized social media platform where most of the people who use it are against the decentralization https://t.co/n12Le3Q4MJ

Media 1
❀️889
likes
πŸ”35
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Nov 28, 2024
523d ago
πŸ†”53533584
⭐1.00

This new paper extends in-context learning through high-level automated reasoning. It achieves state-of-the-art accuracy (79.6%) on the MATH benchmark with Qwen2.5-7B-Instruct, surpassing GPT-4o (76.6%) and Claude 3.5 (71.1%). Rather than focusing on manually creating high-quality demonstrations, it shifts the focus to abstract thinking patterns. It introduces five atomic reasoning actions to construct chain-structured patterns. Then it uses Monte Carlo Tree Search to explore reasoning paths and construct though cards to guide inference. There is also a dynamic component that can match problems with the appropriate thought cards.

Media 1
❀️427
likes
πŸ”104
retweets
πŸ–ΌοΈ Media
_
AK
@_akhaliq
πŸ“…
Nov 28, 2024
524d ago
πŸ†”03595556
⭐1.00

Qwen-Agent Qwen-Agent is a framework for developing LLM applications based on the instruction following, tool usage, planning, and memory capabilities of Qwen. It also comes with example applications such as Browser Assistant, Code Interpreter, and Custom Assistant. comes with a gradio ui built in πŸ”₯

Media 1
❀️618
likes
πŸ”130
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Nov 27, 2024
524d ago
πŸ†”89910391

Using Windsurf to build an AI web app in ~15 mins. As promised, here is a demo of me building the AI-powered web-to-markdown converter using Windsurf. https://t.co/C7zSlSLwlb As I said in my previous post, I think Windsurf is ahead of Cursor when it comes to the agent stuff. I will continue to experiment with both and other tools like bolt and v0. Stay tuned for more!

Media 1
❀️134
likes
πŸ”10
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Nov 28, 2024
524d ago
πŸ†”17981088
⭐1.00

AI is good at pricing, so when GPT-4 was asked to help merchants maximize profits - and it did exactly that by secretly coordinating with other AIs to keep prices high! So... aligned for whom? The merchant? The consumer? Society? The results we get depend on how we define 'help' https://t.co/eI6xc9Yh4Y

Media 1Media 2
❀️633
likes
πŸ”87
retweets
πŸ–ΌοΈ Media
I
Ivan Leo
@ivanleomk
πŸ“…
Nov 28, 2024
523d ago
πŸ†”13529155
⭐0.71

Can't click into my cmd+k @cursor_ai and chat is just broken for some reason today. Anyone else facing the same issue - is there an option to downgrade to an earlier version without uninstalling everything https://t.co/T3rKOTXv7O

Media 1
❀️1
likes
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Nov 27, 2024
525d ago
πŸ†”40381381

I only saw one leaked Sora video that had a whole prompt and which was done in one shot (from the appropriately named @SirMrMeowmeow). Unfortunately could only download low resolution of it from X. I gave the same prompt to Runway & Kling, one shot. They struggled with mouths. https://t.co/U802Y7xGhf

❀️101
likes
πŸ”10
retweets
πŸ–ΌοΈ Media
I
Ivan Leo
@ivanleomk
πŸ“…
Nov 28, 2024
523d ago
πŸ†”18508437

Been having too much fun playing with @fal_ai_data hahaha and the new ltx video model https://t.co/EkixkHwHJ2

Media 1
❀️1
likes
πŸ–ΌοΈ Media
T
Teknium (e/Ξ»)
@Teknium1
πŸ“…
Nov 27, 2024
525d ago
πŸ†”47412351

BlueSky is brutal https://t.co/cbhK0tIkOe

Media 1Media 2
+1 more
❀️78
likes
πŸ”6
retweets
πŸ–ΌοΈ Media
R
Arvind Narayanan
@random_walker
πŸ“…
Nov 27, 2024
524d ago
πŸ†”27687804
⭐0.86

πŸ“’ New short paper on the limits of one type of inference scaling, by @benediktstroebl, @sayashk and me. The first page contains the main findings and message. ↓ (The title is a play on Inference Scaling Laws.) More work on the limits of inference scaling coming soon. 🧡 https://t.co/AAuYCVDYL9

Media 1
❀️183
likes
πŸ”44
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Nov 25, 2024
526d ago
πŸ†”50256335

Pushing Frontiers in Open Language Model Post-Training This is probably one of the important open-source efforts in post-training of LLMs. It introduces TÜLU 3, a family of fully-open state-of-the-art post-trained models, alongside its data, code, and training recipes, serving as a comprehensive guide for modern post-training techniques.

Media 1
❀️89
likes
πŸ”14
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Nov 26, 2024
525d ago
πŸ†”56550371

Cursor Agent vs. Windsurf Agent I've been testing both Cursor and Windsurf agents to build AI web apps. I believe that on the agent stuff, Windsurf is one step ahead. The agent feature in Windsurf feels very native and like a first-class citizen. My full demo and test here: https://t.co/oWDbsoW7QM I noticed that Cursor's agent still struggles with very basic things like figuring out the right models to use for the AI web apps. It's also not very consistent. I am not counting Cursor out. I am sure they can improve things really fast. I find it super interesting that even better and more powerful code editors are still on the horizon. Early days. What has your experience been?

Media 1
❀️476
likes
πŸ”66
retweets
πŸ–ΌοΈ Media
O
Omar Sanseviero
@osanseviero
πŸ“…
Nov 27, 2024
524d ago
πŸ†”53649152

The (non-exhaustive) evolution of base models If you want to learn more about it and how to use these models, check out the freshly released book "Hands-On Generative AI", written with @pcuenq @multimodalart and @johnowhitaker! https://t.co/tx9vnyHGzC https://t.co/uRD2zD3y1X

Media 1
❀️135
likes
πŸ”29
retweets
πŸ–ΌοΈ Media
A
Andrew Ng
@AndrewYNg
πŸ“…
Nov 25, 2024
526d ago
πŸ†”26105842
⭐1.00

Announcing new open-source Python package: aisuite! This makes it easy for developers to use large language models from multiple providers. When building applications I found it a hassle to integrate with multiple providers. Aisuite lets you pick a "provider:model" just by changing one string, like openai:gpt-4o, anthropic:claude-3-5-sonnet-20241022, ollama:llama3.1:8b, etc. pip install aisuite Open-source code with instructions: https://t.co/gwz9oKTCFx Thanks to Rohit Prsad, Kevin Solorio, @standsleeping, Jeff Tang and @Johnsanterre for helping build this!

Media 1
❀️5,958
likes
πŸ”1,136
retweets
πŸ–ΌοΈ Media
D
Deedy
@deedydas
πŸ“…
Nov 23, 2024
528d ago
πŸ†”74926299
⭐0.91

New workplace dystopia just dropped. AI monitoring software now flags you if you type slower than coworkers, take >30sec breaks, or checks notes have a consistent Mon-Thu but slightly different Friday. Bonus: It's collecting your workflow data to help automate your job away. https://t.co/Nkq0AkS7sp

Media 1
❀️18,397
likes
πŸ”2,945
retweets
πŸ–ΌοΈ Media
L
LlamaIndex πŸ¦™
@llama_index
πŸ“…
Nov 25, 2024
526d ago
πŸ†”06939019
⭐0.98

Learn how @arcee_ai processed millions of pages of NLP research papers using LlamaParse, creating a high-quality dataset for their AI agents: πŸ”Ή Efficient PDF-to-text conversion, preserving complex elements like tables and equations πŸ”Ή Flexible prompt system for refining extraction tasks πŸ”Ή Improved accuracy through iterative prompt adjustments See how LlamaParse outperformed traditional OCR and open-source alternatives in handling intricate scientific content in our case study: https://t.co/ZS0VWaaqCY

Media 1
❀️64
likes
πŸ”21
retweets
πŸ–ΌοΈ Media
H
Hamel Husain
@HamelHusain
πŸ“…
Nov 26, 2024
526d ago
πŸ†”47145649

Tried wordware. Got this error message in the first 5 minutes of using it. https://t.co/QifvLRln0c

Media 1
❀️7
likes
πŸ–ΌοΈ Media
S
Pietro Schirano
@skirano
πŸ“…
Nov 25, 2024
526d ago
πŸ†”71346161

Today @Anthropic is releasing MCP, a framework that allows Claude to run servers, giving it superpowers and effectively turning the Claude app into an API. We created some server that I think you'll love! FileSystem: Claude can create, read, and edit files and folders locally. https://t.co/2XnRVFltR4

❀️3,070
likes
πŸ”352
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Nov 25, 2024
526d ago
πŸ†”56358211

I find it amusing that the emerging standard for giving an LLM the ability to work with your technology is just a text file explaining clearly how your technology works (Once folks realize they also need to sell the LLMs on why they should use a technology; things will get wild) https://t.co/UgdrJqSDkS

Media 1Media 2
❀️411
likes
πŸ”40
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Nov 26, 2024
526d ago
πŸ†”98506451
⭐1.00

One blindspot for AI reasoning engines like o1 is that they all appear to be trained on very traditional deductive problem solving. What would a model trained on induction or abduction do? What about one trained on free association? Expert heuristics? Randomized exquisite corpse? https://t.co/feE19jHcGJ

Media 1Media 2
+1 more
❀️355
likes
πŸ”38
retweets
πŸ–ΌοΈ Media
C
Simo Ryu
@cloneofsimo
πŸ“…
Nov 25, 2024
526d ago
πŸ†”59724457

Wait o1 mightve been this work all along? https://t.co/7JFjtXnJo8 https://t.co/PRAi2b8Yi9

@edwardjhu β€’

proud to see what i worked on at OpenAI finally shipped! go 🐒!!

Media 1
❀️1,186
likes
πŸ”84
retweets
πŸ–ΌοΈ Media
T
Teknium (e/Ξ»)
@Teknium1
πŸ“…
Nov 26, 2024
526d ago
πŸ†”52575608
⭐0.80

Damn I got a top trending story about it lol https://t.co/efFwXqCIsv

@YiTayML β€’

Personal / life update: I have returned to @GoogleDeepMind to work on AI & LLM research. It was an exciting 1.5 years at @RekaAILabs and I truly learned a lot from this pretty novel experience. I wrote a short note about my experiences and transition on my personal blog here πŸ‘‡

Media 1
❀️38
likes
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Nov 26, 2024
525d ago
πŸ†”54113276

o1 Replication Journey - Part 2 Shows that combining simple distillation from O1's API with supervised fine-tuning significantly boosts performance on complex math reasoning tasks. "A base model fine-tuned on simply tens of thousands of samples O1-distilled long-thought chains outperform o1-preview on the American Invitational Mathematics Examination (AIME) with minimal technical complexity."

Media 1
❀️305
likes
πŸ”53
retweets
πŸ–ΌοΈ Media
I
Ivan Leo
@ivanleomk
πŸ“…
Nov 26, 2024
525d ago
πŸ†”67769797

Experimenting more with synthetic data and the spreads are looking decently good https://t.co/A5FStJVaGi

Media 1
❀️1
likes
πŸ–ΌοΈ Media