_

AK

@_akhaliq

📅

Wed

🆔83886083

🍎GPT or iGPT https://t.co/EF1baAIINN https://t.co/Qa24tJhecn

❤️48

likes

🔁8

retweets

🖼️ Media

View Details View on X ↗

L

Tom Lieberum 🔎

@lieberum_t

📅

Wed

🆔92370180

Mech interp has been very successful in tiny models, but does it scale? …Kinda! Our new @GoogleDeepMind paper studies how Chinchilla70B can do multiple-choice Qs, focusing on picking the correct letter. Small model techniques mostly work but it's messy!🧵https://t.co/SLFEOqltYR https://t.co/LT29ry8o9t

❤️197

likes

🔁39

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Wed

🆔06606848

How is ChatGPT’s behavior changing over time? If you are developing with LLMs or in this case GPT-3.5 or GPT-4, it's definitely worth taking a look at this report. There is suspicion in the AI community that models like GPT-4 are changing/degrading in performance and behavior.… https://t.co/Z7J5pKpRnm https://t.co/a5csuHKaUf

❤️187

likes

🔁41

retweets

🖼️ Media

View Details View on X ↗

_

AK

@_akhaliq

📅

Sat

🆔09506818

Localizing Object-level Shape Variations with Text-to-Image Diffusion Models @Gradio demo is out on @huggingface demo: https://t.co/D2hMIVgess https://t.co/Q6iM9mrcLp

❤️212

likes

🔁40

retweets

🖼️ Media

View Details View on X ↗

M

Marko 🎩

@markopolojarvi

📅

Wed

🆔19828480

.@huggingface LLM leaderboard is saying fine-tuned 30b llama v1 beats 70b llama v2 chat. I have a creeping feeling that something in our LLM benchmarks is not working that great. https://t.co/OUwfXwHKkn

❤️18

likes

🔁2

retweets

🖼️ Media

View Details View on X ↗

A

abhishek

@abhi1thakur

📅

Wed

🆔96835841

LLAMA-v2 training successfully on Google Colab's free version! "pip install autotrain-advanced" 💥 Yes, you can also use your local machine! https://t.co/VOvocAQ46c

❤️1,442

likes

🔁248

retweets

🖼️ Media

View Details View on X ↗

E

Ethan Mollick

@emollick

📅

Wed

🆔40761601

So the suspicions about the dumbing-down of GPT-4 may actually be right! Here is some initial hard evidence that GPT-4 is actually getting less capable (and GPT-3.5 is getting more so), since launch. Also, why it is hard to build on AI, when model abilities are quietly changed. https://t.co/xyGyPdhhIE https://t.co/uszx7m0qV9

❤️1,975

likes

🔁493

retweets

🖼️ Media

View Details View on X ↗

_

AK

@_akhaliq

📅

Wed

🆔53345024

Augmenting CLIP with Improved Visio-Linguistic Reasoning paper page: https://t.co/PHbgZCUuRi Image-text contrastive models such as CLIP are useful for a variety of downstream applications including zero-shot classification, image-text retrieval and transfer learning. However,… https://t.co/Eu1TgBgmyb https://t.co/JP6iuPXSMV

❤️135

likes

🔁26

retweets

🖼️ Media

View Details View on X ↗

L

LlamaIndex 🦙 (GPT Index)

@llama_index

📅

Tue Jul 18

🆔96991749

🦙x🦙 = 💪 Experiment with Llama 2 now via LlamaIndex! We made a special release (v0.7.10.post1) to help you get started super easily 👇 https://t.co/ddLURZwmPG https://t.co/CkfBHHE1Cb

❤️114

likes

🔁18

retweets

🖼️ Media

View Details View on X ↗

O

Omar Sanseviero

@osanseviero

📅

Tue Jul 18

🆔82593292

⭐0.84

Many are asking how Llama 2 compares to other popular models. It is clearly better compared to other models of similar size and the best OS model based on the benchmarks! 🔥 Compare against dozens of other models: https://t.co/szZabnc2bY See screenshot below ⤵️ https://t.co/ifblNapfwW

❤️73

likes

🔁16

retweets

🖼️ Media

View Details View on X ↗

A

Anthropic

@AnthropicAI

📅

Tue Jul 18

🆔83229189

When language models “reason out loud,” it’s hard to know if their stated reasoning is faithful to the process the model actually used to make its prediction. In two new papers, we measure and improve the faithfulness of language models’ stated reasoning. https://t.co/eumrl2gxk1

❤️729

likes

🔁127

retweets

🖼️ Media

View Details View on X ↗

C

Thomas Capelle

@capetorch

📅

Tue Jul 18

🆔49099008

Just realising that training Llama 2 70B emits as much CO2 as a full transatlantic flight from New York <-> Paris (back and forth) 😎 GPUs are getting pretty efficient! also, this can be offset; planes don't. https://t.co/cVxtS0WQcJ

❤️85

likes

🔁5

retweets

🖼️ Media

View Details View on X ↗

_

AK

@_akhaliq

📅

Tue Jul 18

🆔84721920

⭐1.00

Measuring Faithfulness in Chain-of-Thought Reasoning paper: https://t.co/cSSNX0zkOK Large language models (LLMs) perform better when they produce step-by-step, “Chain-ofThought” (CoT) reasoning before answering a question, but it is unclear if the stated reasoning is a faithful… https://t.co/qGLFZb3Y5G https://t.co/gL0JSXRRN3

❤️156

likes

🔁44

retweets

🖼️ Media

View Details View on X ↗

_

AK

@_akhaliq

📅

Tue Jul 18

🆔85542404

Meta releases Llama 2: Open Foundation and Fine-Tuned Chat Models paper: https://t.co/bhG3W56DCW blog: https://t.co/iPHa0PL0DU develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion… https://t.co/bMAVVNjQbU https://t.co/5Zqk3QAq6m

❤️2,235

likes

🔁599

retweets

🖼️ Media

View Details View on X ↗

Y

Yusuf Mehdi

@yusuf_i_mehdi

📅

Tue Jul 18

🆔87526658

Announcing Bing Chat Enterprise, the AI-powered chat for work with commercial data protection! Now you can take full advantage of Generative AI creativity at work with the confidence your confidential work information won’t leak outside your company. https://t.co/8q47eijGzl https://t.co/OrkZbdLeIi

❤️223

likes

🔁53

retweets

🖼️ Media

View Details View on X ↗

N

Nathan Lambert

@natolambert

📅

Tue Jul 18

🆔51901953

@_akhaliq If you're looking for long form technical analysis of the paper -- ie what you need to actually know, here's my piece: https://t.co/NwDokIYq3W https://t.co/MS3QfMunBi

❤️38

likes

🔁7

retweets

🖼️ Media

View Details View on X ↗

G

Gradio

@Gradio

📅

Mon

🆔26695686

BIG NEWS 🥳🎈 Building Chatbots apps just got wayyy easier: announcing the new 𝙲𝚑𝚊𝚝𝙸𝚗𝚝𝚎𝚛𝚏𝚊𝚌𝚎 class 🙌 The *fastest* way to build to build a Chatbot UI in Python -- including streaming, undo/retry, API, all out of the box! Let's take a look at a few examples... https://t.co/vsoWnmN7uP

❤️478

likes

🔁96

retweets

🖼️ Media

View Details View on X ↗

I

INDRAJEET

@indrajeet877

📅

Mon

🆔93036035

Incredible to see big tech's massive contributions to open-source #AI on @huggingface 🚀🤖 1. @Meta: 689 models including the likes of MusicGen🎵, Galactica🌌, Wav2Vec🎙️, RoBERTa📚! 2. @Google: 591 models powering AI with BERT🦜, Flan🍮, T5🔢, mobilnet📱... 3. @Microsoft: 252… https://t.co/68Xp0DtauI https://t.co/mIVJ7eAC30

❤️204

likes

🔁43

retweets

🖼️ Media

View Details View on X ↗

D

Zara Safsovski

@drfzs

📅

Sat

🆔06058240

First attempt at using #PIKALABS @pika_labs. #aivideo #texttovideo #AIgirl #AIcommunity https://t.co/4wpVHrsaQe https://t.co/Lu5SbJ6ZsE

❤️70

likes

🔁6

retweets

🖼️ Media

View Details View on X ↗

T

Tom Jobbins

@TheBlokeAI

📅

Fri

🆔33173266

The other day I discovered a little environment variable buried in the @huggingface Hub Python docs: 𝙷𝙵_𝙷𝚄𝙱_𝙴𝙽𝙰𝙱𝙻𝙴_𝙷𝙵_𝚃𝚁𝙰𝙽𝚂𝙵𝙴𝚁 It has changed my life! Docs say 2x faster, but in my testing it's 3-5x faster 🚀😍 (and it's just as fast for uploads!) https://t.co/u1XVULnidb

❤️232

likes

🔁49

retweets

🖼️ Media

View Details View on X ↗

E

Enrico Shippole

@EnricoShippole

📅

Fri

🆔20256000

Introducing LLongMA, a series of OpenLLaMA models, trained at 8k context length using linear positional interpolation scaling. The model was trained in collaboration with @theemozilla of @NousResearch and Kaiokendev. https://t.co/uMCy5Da14Z

❤️108

likes

🔁22

retweets

🖼️ Media

View Details View on X ↗

M

merve

@mervenoyann

📅

Fri

🆔65845763

Text-to-Video task page just landed at @huggingface Tasks 🤗 In this page you will learn how you can generate videos from text 🍿🎥 Get started here 👉 https://t.co/rCvsEVSavw 🌊 https://t.co/NQboUEsMTS

❤️170

likes

🔁43

retweets

🖼️ Media

View Details View on X ↗

A

Adina Yakup

@AdeenaY8

📅

Fri

🆔21294080

你收到今天的Daily Papers了吗？👀 订阅👉 https://t.co/W7m6clPrtf https://t.co/RZtmaguAw7

❤️18

likes

🔁4

retweets

🖼️ Media

View Details View on X ↗

O

elvis

@omarsar0

📅

Fri

🆔44404994

Today completes the first cohort of our new course on Prompt Engineering for LLMs. Participants worked on projects like customer support email generation, prompt injection detectors, LLM-based evaluators, and LLM-powered conversational assistants. The discussions in this cohort… https://t.co/6iOdsGzUkp https://t.co/ASaS2XH3tq

❤️182

likes

🔁27

retweets

🖼️ Media

View Details View on X ↗

B

John Backus

@backus

📅

Sat

🆔93516544

The code interpreter feature on ChatGPT is the most mind blowing thing I've seen yet. All I did was upload a CSV of SF crime data and ask it to visualize trends(!!) https://t.co/pkFdPqgAzb

+2 more

❤️2,992

likes

🔁469

retweets

🖼️ Media

View Details View on X ↗

_

AK

@_akhaliq

📅

Fri

🆔22870784

⭐1.00

VideoGLUE: Video General Understanding Evaluation of Foundation Models paper page: https://t.co/Y97nZAXGm9 We evaluate existing foundation models video understanding capabilities using a carefully designed experiment protocol consisting of three hallmark tasks (action… https://t.co/WJxQbZNRc2 https://t.co/sgnEheZKj2

❤️110

likes

🔁25

retweets

🖼️ Media

View Details View on X ↗

_

AK

@_akhaliq

📅

Fri

🆔96118792

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding paper page: https://t.co/t52BJ76bYv Document understanding refers to automatically extract, analyze and comprehend information from various types of digital documents, such as a web page.… https://t.co/J6lkXD3dSc https://t.co/Xk2P2cUaTX

❤️81

likes

🔁19

retweets

🖼️ Media

View Details View on X ↗

K

Kevin Black

@kvablack

📅

Wed

🆔21289216

The biggest problem with our RL diffusion paper was that nobody could run our Jax+TPU code. No more! I've reimplemented DDPO in PyTorch, plus replicated our results using LoRA for low-memory training! Links below 👇 https://t.co/J7J51mpPot

❤️291

likes

🔁43

retweets

🖼️ Media

View Details View on X ↗

A

Aravind Srinivas

@AravSrinivas

📅

Thu Jul 06

🆔79134977

A lot of users just wanted to just ask on perplexity via their search bar on Chrome thanks to muscle memory. We listened and have it ready now! Install the Perplexity Default Search Chrome Plugin and all your searches are now just Perplexity queries! https://t.co/tTTf2Kh3gv https://t.co/wpa7Qi1ml2

❤️121

likes

🔁14

retweets

🖼️ Media

View Details View on X ↗

G

Gabriel Peyré

@gabrielpeyre

📅

Thu Jul 06

🆔73553153

The hairy ball theorem states that in odd dimension d, vector fields on the tangent plane of a (d-1)-sphere necessarily contain a singular point (where it vanishes). https://t.co/5UNiMnjI42 https://t.co/I6RsLEvzry