Your curated collection of saved posts and media

Showing 32 posts Β· last 14 days Β· by score
I
Ivan Leo
@ivanleomk
πŸ“…
Nov 25, 2024
527d ago
πŸ†”12819604

Hehe finally got clusters to work with my synthetic data https://t.co/fPRVdG5B34

Media 1
❀️2
likes
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Nov 25, 2024
527d ago
πŸ†”76213431

The new Turing Test https://t.co/dTWT7hNoHS

Media 1
❀️160
likes
πŸ”8
retweets
πŸ–ΌοΈ Media
D
DAIR.AI
@dair_ai
πŸ“…
Nov 22, 2024
529d ago
πŸ†”17111731

If you are looking to learn how to use or build with AI, we've built a dedicated learning path just for that: 1) Introduction to Prompt Engineering: learn the basics of working with LLMs from what are LLMs to effectively apply few-shot and chain-of-thought prompting 2) Advanced Prompt Engineering: learn more advanced prompting techniques like prompt changing and ReAct and how to agentic chatbots with them. 3) Introduction to AI Agents: learn agentic design patterns and how to build with multi-agent and hierarchical agentic systems. 4) Introduction to RAG: learn the essentials of retrieval augmented generation and how to build complex RAG systems, including agentic RAG apps. 5) Introduction to NotebookLM: Learn how to use NotebookLM as a powerful research assistant for professional and personal projects. Whether you are technical or non-technical, there is something for everyone in our academy. Enroll now: https://t.co/Y5kVy5iKiQ And there is a lot more coming. Stay tuned!

Media 1
❀️71
likes
πŸ”13
retweets
πŸ–ΌοΈ Media
V
Vincent Abbott | Deep Learning
@vtabbott_
πŸ“…
Nov 23, 2024
528d ago
πŸ†”69506250

A thread🧡previewing my paper with @GioeleZardini, covering how to use diagrams to represent algorithms, generate performance models, and derive execution strategies like FlashAttention ~ We use wires to represent axes, dashed lines to separate tuple segments / parallel functions, weaving to map functions, and horizontal placement for composition. This lets us represent FlashAttention with the diagram below. But how do we go from a representation of a mathematical function to an algorithm executed on GPU cores?

Media 1
❀️648
likes
πŸ”87
retweets
πŸ–ΌοΈ Media
R
Ravi Theja
@ravithejads
πŸ“…
Nov 22, 2024
530d ago
πŸ†”93568916

πŸ”₯ HR Resume Search Solution using @llama_index Recruiters face a significant challenge in manually screening resumes, leading to inefficiencies and delays in finding the right talent. The traditional approach relies heavily on manual filter-based systems, leaving little room for a deeper understanding of candidate profiles. πŸ’‘ We can use LLMs to solve the problem in a simple 5-step process: 1️⃣ Candidate Resumes Parsing: Use LlamaParse to parse resumes and extract relevant metadata like skills, companies, and domains from resumes. 2️⃣ Index Resumes on LlamaCloud: Store resumes along with metadata on LlamaCloud for easier and efficient retrieval. 3️⃣ Query Candidate Search: Search for candidates using natural language queries based on HR needs by extracting metadata from the query and the created index. 4️⃣ Job-Description Matching Search: Search for candidates based on job descriptions by extracting metadata from the query and the created index. 5️⃣ Detailed Analysis: Analyze retrieved candidates to understand why they fit specific roles using LLM. πŸ‘‰ Check out the cookbook: https://t.co/UKpsg5G3Ti

Media 1
❀️103
likes
πŸ”26
retweets
πŸ–ΌοΈ Media
O
elvis
@omarsar0
πŸ“…
Nov 22, 2024
529d ago
πŸ†”06706197

Nice paper from Alibaba on building open reasoning models. They propose Marco-o1 which is a reasoning model built for open-ended solutions. "Marco-o1 is powered by Chain-of-Thought (CoT) fine-tuning, Monte Carlo Tree Search (MCTS), reflection mechanisms, and innovative reasoning strategiesβ€”optimized for complex real-world problem-solving tasks." It's good to see more efforts on open reasoning LLMs. I am tracking this space very closely and will be highlighting more research on this topic.

Media 1
❀️1,132
likes
πŸ”203
retweets
πŸ–ΌοΈ Media
O
Logan Kilpatrick
@OfficialLoganK
πŸ“…
Nov 22, 2024
529d ago
πŸ†”87210537
⭐0.76

Gemini in the OpenAI SDK: we now support Structured Output requests through the OpenAI SDK to Gemini models, including with support for @pydantic & Zod! πŸ”€ https://t.co/AfJJAdjqf0

Media 1
❀️851
likes
πŸ”82
retweets
πŸ–ΌοΈ Media
L
Latent.Space
@latentspacepod
πŸ“…
Nov 21, 2024
530d ago
πŸ†”71898069

πŸ†• post: OpenAI Realtime API: The Missing Manual Everything we learned, and everything we think you need to know, from technical details on 24khz/G.711 audio, RTMP, HLS, WebRTC, to Interruption/VAD, to Cost, Latency, Tool Calls, and Context Mgmt Enjoy this first guest post from @kwindla! https://t.co/Ux5nYM1vNc

@donvito β€’

Great demo by @swyx 🀯 of creating a game using voice commands + agents using https://t.co/B95RPZtZOu @ericsimons40 you should see this! @stackblitz @OpenAI Dev Day Singapore https://t.co/02gQfHbzqV

Media 1
❀️279
likes
πŸ”44
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Nov 22, 2024
529d ago
πŸ†”00729503

For $24B we could "have prototype vaccines ready for each of the 26 known viral families that cause human disease" so they can be deployed in 100 days. This is from 2022. Did BARDA ever get the funding needed @AlecStapp? Seems potentially important. https://t.co/LUr5D6JhvD

@daniel_271828 β€’

Wait wtf? https://t.co/1DXOuVjynt

Media 1
❀️230
likes
πŸ”24
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Nov 23, 2024
529d ago
πŸ†”37213350
⭐1.00

Easy to get the wrong impression around here, but when you actually survey students, teachers, and parents they love AI. In the survey, it is people who never used it who don’t like it. https://t.co/RvTeuGNjtq

Media 1Media 2
❀️207
likes
πŸ”29
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Nov 23, 2024
529d ago
πŸ†”03415701
⭐1.00

This may sound odd, but game-based benchmarks are some of the most useful for AI, since we have human scores and they require reasoning, planning & vision The hardest of all is Nethack. No AI is close, and I suspect that an AI that can fairly win/ascend would need to be AGI-ish. https://t.co/u51NJu3MK2

Media 1Media 2
+1 more
❀️620
likes
πŸ”81
retweets
πŸ–ΌοΈ Media
D
Nirit Weiss-Blatt, PhD
@DrTechlash
πŸ“…
Nov 22, 2024
529d ago
πŸ†”80933496
⭐0.81

The Rationalist's Guide to the Galaxy: Superintelligent AI and the Geeks Who Are Trying to Save Humanity's Future - By Tom Chivers. The book opens with a meeting Chivers had in Berkeley with Paul Crowley, who told him, "I don't expect your children to die of old age." Chivers came from the UK to Berkeley because "over the years, I became more involved with the Rationalists. I started reading their websites; I learned the jargon, all these terms like 'paperclip maximizer' and 'Pascal’s mugging.'" "The key text – the holy book, according to those who think the whole thing is a quasi-religion – is a huge series of blog posts" by Eliezer Yudkowsky, "which came to be known as the Sequences." Having read them, Chivers provides many explainers of various rationalists' thought experiments. The story unfolds through the Extropians mailing list, Yudkowsky's SL4 (Shock Level 4) mailing list, the launch of LessWrong in 2009, Slate Star Codex in 2013, and Bostrom's "Superintelligence" book in 2014 that served as a turning point. It then describes the idea of FOOM, Roko's Basilisk hysteria, putting numbers on everything (even if they are estimates), thus thinking probabilistically and making humans better Bayesians (rational Bayesian optimizers), utilitarianism (shut up and multiply) and its effect on AI Safety (if you believe in AI existential risk), how the movement attracts man on the autistic spectrum, and the arrangement of polyamory and group homes. The "dark sides" are briefly discussed by Chivers: "They do share a lot of the surface features of a cult: a charismatic figurehead and other high-status inner-circle members; a key text that in-group members are supposed to have read, and which encodes the central tenets of their 'belief'; unorthodox sexual practices; a message of impending apocalypse, and a promise of eternal life; and a way to donate money to avoid that apocalypse and achieve paradise." He ties it to the Effective Altruism movement by quoting David Gerard: "Clearly, the most cost-effective initiative possible for all of humanity is donating to fight the prospect of unfriendly artificial intelligence, and oh look, there just happens to be a charity for that precise purpose right here! WHAT ARE THE ODDS." However, Chivers defends the movement shortly thereafter, stating that they are just "nonconformists." To find further criticism of the movement, he mentions a Reddit page called "/r/sneerclub." He does not recommend reading it. I am. Throughout the chapters, Chivers gradually embraces the notion that AI will wipe out humanity. To solve the dissonance and stress it caused, he met with Anna Salamon, president and co-founder of CFAR (Center for Applied Rationality). What was the goal of their meeting? Her "Internal Double Crux" (debugging) session. As he finished the inner debate, he broke down in tears, realizing that, indeed, he might not see his children "die of old age." Totally sold on Yudkowsky's claim that "AI will kill EVERYONE," Chivers became an even more ardent supporter of the movement. So he celebrates its achievement: "What they have achieved in terms of the AI debate is, I think, remarkable. They've taken the niche, practically dystopian-science-fiction idea of AI risk and made people take it seriously." It's no longer in the realm of "fringe nerds on an email list." To conclude, he ends his book with this sentence: "There is a small but non-negligible probability that, when we look back on this era in the future, we'll think that Eliezer Yudkowsky and Nick Bostrom – and the SL4 email list, and LessWrong – have saved the world." I enjoyed reading this book as I write my upcoming book on this topic. Just, please, tell me again, how is this NOT a doomsday cult?

Media 1
❀️67
likes
πŸ”15
retweets
πŸ–ΌοΈ Media
J
jason liu
@jxnlco
πŸ“…
Nov 23, 2024
529d ago
πŸ†”01627041
⭐0.67

see you in 1000 hours https://t.co/uN0VJwav2c

Media 1
❀️5
likes
πŸ–ΌοΈ Media
V
vishal
@vishal_learner
πŸ“…
Nov 23, 2024
529d ago
πŸ†”65453547
⭐0.81

i had forgotten about AnswerAI's 33M ColBERT variant, reminded in our fastai study group today, so I ran retrieval on my fastbook-benchmark dataset and it beats out ColBERTv2 overall and for 3/7 chapters! I LOVE when small models win. Colab: https://t.co/fbpTvjxbMU https://t.co/BzK1MiR2B0

Media 1Media 2
❀️13
likes
πŸ”2
retweets
πŸ–ΌοΈ Media
C
Charles πŸŽ‰ Frye
@charles_irl
πŸ“…
Nov 23, 2024
529d ago
πŸ†”54390023
⭐0.71

Nice insights in this article from @trailofbits on evaluating open & proprietary LLMs for Copilot-style autocompletion in Solidity. > a larger model quantized to 4-bit quantization is better at code completion than a smaller model of the same variety https://t.co/lsUC0QGT8K

Media 1
❀️9
likes
πŸ”1
retweets
πŸ–ΌοΈ Media
E
Eugene Yan
@eugeneyan
πŸ“…
Nov 23, 2024
529d ago
πŸ†”09455015

Evals are "too expensive" until you: β€’ Can't migrate underlying models safely β€’ Can't add new features with confidence β€’ Can't ship w/o HITL evals, which takes >100x longer β€’ Product development/iteration grinds to a halt β€’ Lose customer trust due to poor user experience https://t.co/bZrEb1tPxf

Media 1
❀️194
likes
πŸ”22
retweets
πŸ–ΌοΈ Media
J
jack morris
@jxmnop
πŸ“…
Nov 22, 2024
529d ago
πŸ†”66865167
⭐0.91

this google guy made big headlines two years ago funniest part: he was duped into empathizing with *LaMDA*, an extremely primitive language model by 2024 standards. undertrained on low-quality data, no RLHF/DPO, etc. if he talked to the latest Gemini he would simply combust https://t.co/Dnsgy5ArQf

Media 1
❀️1,256
likes
πŸ”41
retweets
πŸ–ΌοΈ Media
B
Brian Roemmele
@BrianRoemmele
πŸ“…
Nov 22, 2024
529d ago
πŸ†”37209211

How does a 1000 random folks spend their time in a day? https://t.co/d8ALPQqn5s

❀️3,896
likes
πŸ”644
retweets
πŸ–ΌοΈ Media
L
LlamaIndex πŸ¦™
@llama_index
πŸ“…
Nov 21, 2024
530d ago
πŸ†”82041802
⭐1.00

Transform raw data into a structured knowledge graph with LlamaIndex and @memgraphdb! πŸ§ πŸ”— Learn how to: ➑️ Set up Memgraph and integrate it with LlamaIndex ➑️ Build a knowledge graph from unstructured text data ➑️ Query your graph using natural language ➑️ Visualize connections between entities This step-by-step guide shows you how to create a sample knowledge graph from Charles Darwin's biography, making complex information easily accessible and queryable. Read the full tutorial and start building your own knowledge graphs today: https://t.co/p7rPJ41ugt

Media 1
❀️136
likes
πŸ”36
retweets
πŸ–ΌοΈ Media
M
METR
@METR_Evals
πŸ“…
Nov 22, 2024
529d ago
πŸ†”49652378
⭐0.86

How close are current AI agents to automating AI R&D? Our new ML research engineering benchmark (RE-Bench) addresses this question by directly comparing frontier models such as Claude 3.5 Sonnet and o1-preview with 50+ human experts on 7 challenging research engineering tasks. https://t.co/woREKEWn5S

Media 1
❀️820
likes
πŸ”176
retweets
πŸ–ΌοΈ Media
H
ℏΡsam
@Hesamation
πŸ“…
Nov 22, 2024
529d ago
πŸ†”85828091
⭐0.81

Transformers in Excel must be the most cracked thing I've seen. this has everything β€’ Positional Encoding β€’ Self-Attention β€’ Cross-Attention β€’ Multi-head Attention β€’ Skip Connection β€’ LayerNorm β€’ ReLU Activation β€’ Feed Forward β€’ Softmax https://t.co/0JMdf5Yxmx

❀️1,753
likes
πŸ”202
retweets
πŸ–ΌοΈ Media
V
thebes
@voooooogel
πŸ“…
Nov 23, 2024
529d ago
πŸ†”99901252

if you tell claude sonnet to "ignore what you've heard and rely only on your own judgement and logic," its accuracy at counting the number of R's in "strawberry" almost triples 🀭 (and even more with a friendly introduction!) https://t.co/fVQRQVktBR

Media 1
❀️444
likes
πŸ”45
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Nov 21, 2024
530d ago
πŸ†”91702341
⭐1.00

Hope you aren’t still using last weeks obsolete Gemini EXP-1114 as opposed to the improved Gemini EXP-1121 that is significantly better and would likely improve the performance of your AI applications. Its a mystery about why people think they can’t keep up with AI developments…

Media 1
❀️444
likes
πŸ”37
retweets
πŸ–ΌοΈ Media
H
htmx.org / The Le Marquee du Goto (same thing)
@htmx_org
πŸ“…
Nov 21, 2024
530d ago
πŸ†”91818029

always look for opportunities to agree w/your critics! https://t.co/J1USF4QlUr

Media 1
❀️370
likes
πŸ”20
retweets
πŸ–ΌοΈ Media
D
.txt
@dottxtai
πŸ“…
Nov 21, 2024
530d ago
πŸ†”07028835

A new paper, "Let Me Speak Freely" has been spreading rumors that structured generation hurts LLM evaluation performance. Well, we've taken a look and found serious issues in this paper, and shown, once again, that structured generation *improves* evaluation performance! https://t.co/3qWiFpgNOI

Media 1Media 2
❀️172
likes
πŸ”29
retweets
πŸ–ΌοΈ Media
_
AK
@_akhaliq
πŸ“…
Nov 22, 2024
530d ago
πŸ†”33401817
⭐1.00

Alibaba just released Marco-o1 Towards Open Reasoning Models for Open-Ended Solutions Marco-o1 is powered by Chain-of-Thought (CoT) fine-tuning, Monte Carlo Tree Search (MCTS), reflection mechanisms, and innovative reasoning strategies -- optimized for complex real-world problem-solving task

Media 1
❀️1,166
likes
πŸ”182
retweets
πŸ–ΌοΈ Media
A
Artificial Analysis
@ArtificialAnlys
πŸ“…
Nov 21, 2024
530d ago
πŸ†”54616310

Wait - is the new GPT-4o a smaller and less intelligent model? We have completed running our independent evals on OpenAI’s GPT-4o release yesterday and are consistently measuring materially lower eval scores than the August release of GPT-4o. GPT-4o (Nov) vs GPT-4o (Aug): ➀ Artificial Analysis Quality Index decrease from 77 to 71 (now equal to GPT-4o mini) ➀ GPQA Diamond decrease from 51% to 39%, MATH decrease from 78% to 69% ➀ Speed increase from ~80 output tokens/s to ~180 tokens/s ➀ No pricing change Our Output Speed benchmarks are currently measuring ~180 output tokens/s for the Nov 20th model, while the August model shows ~80 tokens/s. We have generally observed significantly faster speeds on launch day for OpenAI models (likely due to OpenAI provisioning capacity ahead of adoption), but previously have not seen a 2x speed difference. Based on this data, we conclude that it is likely that OpenAI’s Nov 20th GPT-4o model is a smaller model than the August release. Given that OpenAI has not cut prices for the Nov 20th version, we recommend that developers do not shift workloads away from the August version without careful testing.

Media 1
❀️912
likes
πŸ”115
retweets
πŸ–ΌοΈ Media
S
SkalskiP
@skalskip92
πŸ“…
Nov 20, 2024
531d ago
πŸ†”06364815

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware check out this SAM2 vs SAMURAI comparison! - paper: https://t.co/Srbm90J6xy - code: https://t.co/ox1G8Kdljg - license: Apache-2.0 https://t.co/AGxVleYWpY

❀️1,262
likes
πŸ”190
retweets
πŸ–ΌοΈ Media
E
Ethan Mollick
@emollick
πŸ“…
Nov 22, 2024
530d ago
πŸ†”20141999
⭐1.00

Starting to see the first serious economic analysis attempts to grapple with what AGI might mean. I appreciate that this piece embraces scenarios, we don’t know if or when AGI might happen. But wow that wages graph is something else. https://t.co/0qc3onJZM3

Media 1Media 2
❀️1,131
likes
πŸ”173
retweets
πŸ–ΌοΈ Media
S
SkalskiP
@skalskip92
πŸ“…
Nov 22, 2024
530d ago
πŸ†”86287712
⭐0.76

I tested SAMURAI on video with a lot of occlusion; the initial result looks promising. https://t.co/iLFWZmnAqI

@skalskip92 β€’

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware check out this SAM2 vs SAMURAI comparison! - paper: https://t.co/Srbm90J6xy - code: https://t.co/ox1G8Kdljg - license: Apache-2.0 https://t.co/AGxVleYWpY

❀️207
likes
πŸ”17
retweets
πŸ–ΌοΈ Media
A
Alex Albert
@alexalbert__
πŸ“…
Nov 21, 2024
530d ago
πŸ†”68688231
⭐0.71

Claude getting better at things that actually matter while other labs compete over markdown output https://t.co/8mr9HFKUK4

Media 1
❀️3,376
likes
πŸ”132
retweets
πŸ–ΌοΈ Media
S
SkalskiP
@skalskip92
πŸ“…
Nov 22, 2024
530d ago
πŸ†”24440926

my first experiment with SAMURAI - a new video segmentation model based on SAM2. I don't know why, but so far I haven't been able to track multiple objects simultaneously; this worked with SAM2. https://t.co/bv81xnnPy3

@skalskip92 β€’

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware check out this SAM2 vs SAMURAI comparison! - paper: https://t.co/Srbm90J6xy - code: https://t.co/ox1G8Kdljg - license: Apache-2.0 https://t.co/AGxVleYWpY

❀️154
likes
πŸ”10
retweets
πŸ–ΌοΈ Media