Your curated collection of saved posts and media

Showing 32 posts ยท last 14 days ยท by score
_
AK
@_akhaliq
๐Ÿ“…
Wed
๐Ÿ†”83886083

๐ŸŽGPT or iGPT https://t.co/EF1baAIINN https://t.co/Qa24tJhecn

Media 1
โค๏ธ48
likes
๐Ÿ”8
retweets
๐Ÿ–ผ๏ธ Media
L
Tom Lieberum ๐Ÿ”Ž
@lieberum_t
๐Ÿ“…
Wed
๐Ÿ†”92370180

Mech interp has been very successful in tiny models, but does it scale? โ€ฆKinda! Our new @GoogleDeepMind paper studies how Chinchilla70B can do multiple-choice Qs, focusing on picking the correct letter. Small model techniques mostly work but it's messy!๐Ÿงตhttps://t.co/SLFEOqltYR https://t.co/LT29ry8o9t

Media 1
โค๏ธ197
likes
๐Ÿ”39
retweets
๐Ÿ–ผ๏ธ Media
O
elvis
@omarsar0
๐Ÿ“…
Wed
๐Ÿ†”06606848

How is ChatGPTโ€™s behavior changing over time? If you are developing with LLMs or in this case GPT-3.5 or GPT-4, it's definitely worth taking a look at this report. There is suspicion in the AI community that models like GPT-4 are changing/degrading in performance and behavior.โ€ฆ https://t.co/Z7J5pKpRnm https://t.co/a5csuHKaUf

Media 1
โค๏ธ187
likes
๐Ÿ”41
retweets
๐Ÿ–ผ๏ธ Media
_
AK
@_akhaliq
๐Ÿ“…
Sat
๐Ÿ†”09506818

Localizing Object-level Shape Variations with Text-to-Image Diffusion Models @Gradio demo is out on @huggingface demo: https://t.co/D2hMIVgess https://t.co/Q6iM9mrcLp

Media 1
โค๏ธ212
likes
๐Ÿ”40
retweets
๐Ÿ–ผ๏ธ Media
M
Marko ๐ŸŽฉ
@markopolojarvi
๐Ÿ“…
Wed
๐Ÿ†”19828480

.@huggingface LLM leaderboard is saying fine-tuned 30b llama v1 beats 70b llama v2 chat. I have a creeping feeling that something in our LLM benchmarks is not working that great. https://t.co/OUwfXwHKkn

Media 1
โค๏ธ18
likes
๐Ÿ”2
retweets
๐Ÿ–ผ๏ธ Media
A
abhishek
@abhi1thakur
๐Ÿ“…
Wed
๐Ÿ†”96835841

LLAMA-v2 training successfully on Google Colab's free version! "pip install autotrain-advanced" ๐Ÿ’ฅ Yes, you can also use your local machine! https://t.co/VOvocAQ46c

Media 1Media 2
โค๏ธ1,442
likes
๐Ÿ”248
retweets
๐Ÿ–ผ๏ธ Media
E
Ethan Mollick
@emollick
๐Ÿ“…
Wed
๐Ÿ†”40761601

So the suspicions about the dumbing-down of GPT-4 may actually be right! Here is some initial hard evidence that GPT-4 is actually getting less capable (and GPT-3.5 is getting more so), since launch. Also, why it is hard to build on AI, when model abilities are quietly changed. https://t.co/xyGyPdhhIE https://t.co/uszx7m0qV9

Media 1
โค๏ธ1,975
likes
๐Ÿ”493
retweets
๐Ÿ–ผ๏ธ Media
_
AK
@_akhaliq
๐Ÿ“…
Wed
๐Ÿ†”53345024

Augmenting CLIP with Improved Visio-Linguistic Reasoning paper page: https://t.co/PHbgZCUuRi Image-text contrastive models such as CLIP are useful for a variety of downstream applications including zero-shot classification, image-text retrieval and transfer learning. However,โ€ฆ https://t.co/Eu1TgBgmyb https://t.co/JP6iuPXSMV

Media 1
โค๏ธ135
likes
๐Ÿ”26
retweets
๐Ÿ–ผ๏ธ Media
L
LlamaIndex ๐Ÿฆ™ (GPT Index)
@llama_index
๐Ÿ“…
Tue Jul 18
๐Ÿ†”96991749

๐Ÿฆ™x๐Ÿฆ™ = ๐Ÿ’ช Experiment with Llama 2 now via LlamaIndex! We made a special release (v0.7.10.post1) to help you get started super easily ๐Ÿ‘‡ https://t.co/ddLURZwmPG https://t.co/CkfBHHE1Cb

Media 1
โค๏ธ114
likes
๐Ÿ”18
retweets
๐Ÿ–ผ๏ธ Media
O
Omar Sanseviero
@osanseviero
๐Ÿ“…
Tue Jul 18
๐Ÿ†”82593292
โญ0.84

Many are asking how Llama 2 compares to other popular models. It is clearly better compared to other models of similar size and the best OS model based on the benchmarks! ๐Ÿ”ฅ Compare against dozens of other models: https://t.co/szZabnc2bY See screenshot below โคต๏ธ https://t.co/ifblNapfwW

Media 1
โค๏ธ73
likes
๐Ÿ”16
retweets
๐Ÿ–ผ๏ธ Media
A
Anthropic
@AnthropicAI
๐Ÿ“…
Tue Jul 18
๐Ÿ†”83229189

When language models โ€œreason out loud,โ€ itโ€™s hard to know if their stated reasoning is faithful to the process the model actually used to make its prediction. In two new papers, we measure and improve the faithfulness of language modelsโ€™ stated reasoning. https://t.co/eumrl2gxk1

Media 1
โค๏ธ729
likes
๐Ÿ”127
retweets
๐Ÿ–ผ๏ธ Media
C
Thomas Capelle
@capetorch
๐Ÿ“…
Tue Jul 18
๐Ÿ†”49099008

Just realising that training Llama 2 70B emits as much CO2 as a full transatlantic flight from New York <-> Paris (back and forth) ๐Ÿ˜Ž GPUs are getting pretty efficient! also, this can be offset; planes don't. https://t.co/cVxtS0WQcJ

Media 1
โค๏ธ85
likes
๐Ÿ”5
retweets
๐Ÿ–ผ๏ธ Media
_
AK
@_akhaliq
๐Ÿ“…
Tue Jul 18
๐Ÿ†”84721920
โญ1.00

Measuring Faithfulness in Chain-of-Thought Reasoning paper: https://t.co/cSSNX0zkOK Large language models (LLMs) perform better when they produce step-by-step, โ€œChain-ofThoughtโ€ (CoT) reasoning before answering a question, but it is unclear if the stated reasoning is a faithfulโ€ฆ https://t.co/qGLFZb3Y5G https://t.co/gL0JSXRRN3

Media 1
โค๏ธ156
likes
๐Ÿ”44
retweets
๐Ÿ–ผ๏ธ Media
_
AK
@_akhaliq
๐Ÿ“…
Tue Jul 18
๐Ÿ†”85542404

Meta releases Llama 2: Open Foundation and Fine-Tuned Chat Models paper: https://t.co/bhG3W56DCW blog: https://t.co/iPHa0PL0DU develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billionโ€ฆ https://t.co/bMAVVNjQbU https://t.co/5Zqk3QAq6m

โค๏ธ2,235
likes
๐Ÿ”599
retweets
๐Ÿ–ผ๏ธ Media
Y
Yusuf Mehdi
@yusuf_i_mehdi
๐Ÿ“…
Tue Jul 18
๐Ÿ†”87526658

Announcing Bing Chat Enterprise, the AI-powered chat for work with commercial data protection! Now you can take full advantage of Generative AI creativity at work with the confidence your confidential work information wonโ€™t leak outside your company. https://t.co/8q47eijGzl https://t.co/OrkZbdLeIi

โค๏ธ223
likes
๐Ÿ”53
retweets
๐Ÿ–ผ๏ธ Media
N
Nathan Lambert
@natolambert
๐Ÿ“…
Tue Jul 18
๐Ÿ†”51901953

@_akhaliq If you're looking for long form technical analysis of the paper -- ie what you need to actually know, here's my piece: https://t.co/NwDokIYq3W https://t.co/MS3QfMunBi

Media 1
โค๏ธ38
likes
๐Ÿ”7
retweets
๐Ÿ–ผ๏ธ Media
G
Gradio
@Gradio
๐Ÿ“…
Mon
๐Ÿ†”26695686

BIG NEWS ๐Ÿฅณ๐ŸŽˆ Building Chatbots apps just got wayyy easier: announcing the new ๐™ฒ๐š‘๐šŠ๐š๐™ธ๐š—๐š๐šŽ๐š›๐š๐šŠ๐šŒ๐šŽ class ๐Ÿ™Œ The *fastest* way to build to build a Chatbot UI in Python -- including streaming, undo/retry, API, all out of the box! Let's take a look at a few examples... https://t.co/vsoWnmN7uP

Media 1
โค๏ธ478
likes
๐Ÿ”96
retweets
๐Ÿ–ผ๏ธ Media
I
INDRAJEET
@indrajeet877
๐Ÿ“…
Mon
๐Ÿ†”93036035

Incredible to see big tech's massive contributions to open-source #AI on @huggingface ๐Ÿš€๐Ÿค– 1. @Meta: 689 models including the likes of MusicGen๐ŸŽต, Galactica๐ŸŒŒ, Wav2Vec๐ŸŽ™๏ธ, RoBERTa๐Ÿ“š! 2. @Google: 591 models powering AI with BERT๐Ÿฆœ, Flan๐Ÿฎ, T5๐Ÿ”ข, mobilnet๐Ÿ“ฑ... 3. @Microsoft: 252โ€ฆ https://t.co/68Xp0DtauI https://t.co/mIVJ7eAC30

Media 1
โค๏ธ204
likes
๐Ÿ”43
retweets
๐Ÿ–ผ๏ธ Media
D
Zara Safsovski
@drfzs
๐Ÿ“…
Sat
๐Ÿ†”06058240

First attempt at using #PIKALABS @pika_labs. #aivideo #texttovideo #AIgirl #AIcommunity https://t.co/4wpVHrsaQe https://t.co/Lu5SbJ6ZsE

โค๏ธ70
likes
๐Ÿ”6
retweets
๐Ÿ–ผ๏ธ Media
T
Tom Jobbins
@TheBlokeAI
๐Ÿ“…
Fri
๐Ÿ†”33173266

The other day I discovered a little environment variable buried in the @huggingface Hub Python docs: ๐™ท๐™ต_๐™ท๐š„๐™ฑ_๐™ด๐™ฝ๐™ฐ๐™ฑ๐™ป๐™ด_๐™ท๐™ต_๐šƒ๐š๐™ฐ๐™ฝ๐š‚๐™ต๐™ด๐š It has changed my life! Docs say 2x faster, but in my testing it's 3-5x faster ๐Ÿš€๐Ÿ˜ (and it's just as fast for uploads!) https://t.co/u1XVULnidb

Media 1Media 2
โค๏ธ232
likes
๐Ÿ”49
retweets
๐Ÿ–ผ๏ธ Media
E
Enrico Shippole
@EnricoShippole
๐Ÿ“…
Fri
๐Ÿ†”20256000

Introducing LLongMA, a series of OpenLLaMA models, trained at 8k context length using linear positional interpolation scaling. The model was trained in collaboration with @theemozilla of @NousResearch and Kaiokendev. https://t.co/uMCy5Da14Z

Media 1
โค๏ธ108
likes
๐Ÿ”22
retweets
๐Ÿ–ผ๏ธ Media
M
merve
@mervenoyann
๐Ÿ“…
Fri
๐Ÿ†”65845763

Text-to-Video task page just landed at @huggingface Tasks ๐Ÿค— In this page you will learn how you can generate videos from text ๐Ÿฟ๐ŸŽฅ Get started here ๐Ÿ‘‰ https://t.co/rCvsEVSavw ๐ŸŒŠ https://t.co/NQboUEsMTS

Media 1
โค๏ธ170
likes
๐Ÿ”43
retweets
๐Ÿ–ผ๏ธ Media
A
Adina Yakup
@AdeenaY8
๐Ÿ“…
Fri
๐Ÿ†”21294080

ไฝ ๆ”ถๅˆฐไปŠๅคฉ็š„Daily Papersไบ†ๅ—๏ผŸ๐Ÿ‘€ ่ฎข้˜…๐Ÿ‘‰ https://t.co/W7m6clPrtf https://t.co/RZtmaguAw7

Media 1
โค๏ธ18
likes
๐Ÿ”4
retweets
๐Ÿ–ผ๏ธ Media
O
elvis
@omarsar0
๐Ÿ“…
Fri
๐Ÿ†”44404994

Today completes the first cohort of our new course on Prompt Engineering for LLMs. Participants worked on projects like customer support email generation, prompt injection detectors, LLM-based evaluators, and LLM-powered conversational assistants. The discussions in this cohortโ€ฆ https://t.co/6iOdsGzUkp https://t.co/ASaS2XH3tq

Media 1
โค๏ธ182
likes
๐Ÿ”27
retweets
๐Ÿ–ผ๏ธ Media
B
John Backus
@backus
๐Ÿ“…
Sat
๐Ÿ†”93516544

The code interpreter feature on ChatGPT is the most mind blowing thing I've seen yet. All I did was upload a CSV of SF crime data and ask it to visualize trends(!!) https://t.co/pkFdPqgAzb

Media 1Media 2
+2 more
โค๏ธ2,992
likes
๐Ÿ”469
retweets
๐Ÿ–ผ๏ธ Media
_
AK
@_akhaliq
๐Ÿ“…
Fri
๐Ÿ†”22870784
โญ1.00

VideoGLUE: Video General Understanding Evaluation of Foundation Models paper page: https://t.co/Y97nZAXGm9 We evaluate existing foundation models video understanding capabilities using a carefully designed experiment protocol consisting of three hallmark tasks (actionโ€ฆ https://t.co/WJxQbZNRc2 https://t.co/sgnEheZKj2

Media 1
โค๏ธ110
likes
๐Ÿ”25
retweets
๐Ÿ–ผ๏ธ Media
_
AK
@_akhaliq
๐Ÿ“…
Fri
๐Ÿ†”96118792

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding paper page: https://t.co/t52BJ76bYv Document understanding refers to automatically extract, analyze and comprehend information from various types of digital documents, such as a web page.โ€ฆ https://t.co/J6lkXD3dSc https://t.co/Xk2P2cUaTX

Media 1
โค๏ธ81
likes
๐Ÿ”19
retweets
๐Ÿ–ผ๏ธ Media
K
Kevin Black
@kvablack
๐Ÿ“…
Wed
๐Ÿ†”21289216

The biggest problem with our RL diffusion paper was that nobody could run our Jax+TPU code. No more! I've reimplemented DDPO in PyTorch, plus replicated our results using LoRA for low-memory training! Links below ๐Ÿ‘‡ https://t.co/J7J51mpPot

โค๏ธ291
likes
๐Ÿ”43
retweets
๐Ÿ–ผ๏ธ Media
A
Aravind Srinivas
@AravSrinivas
๐Ÿ“…
Thu Jul 06
๐Ÿ†”79134977

A lot of users just wanted to just ask on perplexity via their search bar on Chrome thanks to muscle memory. We listened and have it ready now! Install the Perplexity Default Search Chrome Plugin and all your searches are now just Perplexity queries! https://t.co/tTTf2Kh3gv https://t.co/wpa7Qi1ml2

โค๏ธ121
likes
๐Ÿ”14
retweets
๐Ÿ–ผ๏ธ Media
G
Gabriel Peyrรฉ
@gabrielpeyre
๐Ÿ“…
Thu Jul 06
๐Ÿ†”73553153

The hairy ball theorem states that in odd dimension d, vector fields on the tangent plane of a (d-1)-sphere necessarily contain a singular point (where it vanishes). https://t.co/5UNiMnjI42 https://t.co/I6RsLEvzry

โค๏ธ917
likes
๐Ÿ”172
retweets
๐Ÿ–ผ๏ธ Media
F
Julian Bilcke
@flngr
๐Ÿ“…
Thu Jul 06
๐Ÿ†”18351872
โญ0.74

Made a quick demo reel for the future HD version of the AI Web TV stream ๐Ÿ‘€ https://t.co/sF4GVqsOJh

โค๏ธ46
likes
๐Ÿ”9
retweets
๐Ÿ–ผ๏ธ Media
_
AK
@_akhaliq
๐Ÿ“…
Thu Jul 06
๐Ÿ†”61685250

The https://t.co/4n2b8GFMaM email just went out https://t.co/2GUVYgiXbs

Media 1
โค๏ธ54
likes
๐Ÿ”8
retweets
๐Ÿ–ผ๏ธ Media