Your curated collection of saved posts and media

Showing 32 posts Β· last 14 days Β· by score
H
HelloSurgeAI
@HelloSurgeAI
πŸ“…
Wed
πŸ†”17795965

What is SFT data and what role does it play in state-of-the-art LLMs? Supervised finetuning (SFT) in the context of RLHF deals with further tuning an initial language model using demonstration data. At Surge AI, we provide SFT data for top LLM teams to finetune their LLMs. Here is what we have observed: SFT data typically involves collecting demonstration data including prompts and in-depth responses written by human annotators demonstrating how the model should respond to the prompt. Specifically, You take a set of commands and obtain human-written responses for each. The SFT training dataset consists of <prompt, ideal generation> pairs used to finetune the pre-trained LLM to output human-like responses. So let’s say you are aligning an LLM-powered dialogue system then you need to collect dialogue-style instructions/responses data that cater to that use case. Similarly, as shown in the figure, if you want high-quality code generation capabilities you can also provide instruction + written responses as part of the SFT data. This leads to the first important component, also referred to as supervised policy, for training an RLHF LLM. But why go through all this process when training RLHF LLMs? The core idea of SFT is to provide a high-quality initialization for the RLHF process. It’s widely applied by some of the most advanced closed and open-sourced LLMs. To make this work, you need to collect lots of demonstration data but the challenge is collecting high-quality and diverse demonstration data at scale. SFT data can be written by different annotators and can incorporate a lot of noise as response quality and style can vary from annotator to annotator. Controlling for this is key. According to reported insights, you will need to collect thousands of examples to ensure you are tuning a high-quality LLM. SFT data helps to improve target areas that allow steering the LLM better to your needs. We can help with your SFT data needs! If you need help with collecting high-quality preference or SFT data, reach out to our team here: https://t.co/OSm4aHIOP6

Media 1
πŸ–ΌοΈ Media
H
HelloSurgeAI
@HelloSurgeAI
πŸ“…
Fri
πŸ†”02866662

Making LLMs reliable is a tough task. But this is where a lot of the LLM research and development work is focused. Let's take a look at how LLMs are made reliable: At Surge AI, we work with the top AI companies to improve LLM reliability. This effort is essential to enable wider applicability in even higher-stakes domains. Reliability not only focuses on getting models to output what users want in terms of specifics and quality but also ensuring that no unwanted output (e.g., toxic content) is produced by the model. A lot of the current efforts to increase reliability focus on ad-hoc approaches and prompt engineering. More recently, there have been more efforts to develop a more systematic framework to improve reliability while training models. This has led to a lot of interest in red teaming. Red teaming deals with identifying risks in LLMs through adversarial prompting. It has been applied not only to general-purpose LLMs like Claude and ChatGPT but also to more recent code LLMs like Llama Code. The challenge with red teaming is that, if not done right, it can lead to LLMs over-refusing and potentially leading to a bad user experience. In addition, the reality is that red teaming requires deep expertise in working with LLMs. We deeply believe that in order to make LLMs safer, useful, and more reliable, comprehensive red teaming is critical. But you don't need to hear this from us. Many large LLM companies have also publicly expressed huge interest in red teaming. If you are looking for deep expertise in training LLMs and red teaming, reach out to learn how our world-class team can help: https://t.co/iFBrffKYKT

Media 1
πŸ–ΌοΈ Media
H
HelloSurgeAI
@HelloSurgeAI
πŸ“…
Tue Jul 23
πŸ†”41854468
⭐0.38

πŸ¦™ Congrats Llama 3! πŸ¦™ Frontier LLM developers know the only human data that brings them to the top 😎 https://t.co/3GcEWMpugq

Media 1
πŸ–ΌοΈ Media
H
HelloSurgeAI
@HelloSurgeAI
πŸ“…
Fri
πŸ†”11381191

Olympics? Forget it. There's a more exciting race going on 😎 Congrats to all our friends at Google! https://t.co/amLWRxCIjZ

Media 1
πŸ–ΌοΈ Media
H
HelloSurgeAI
@HelloSurgeAI
πŸ“…
Mon
πŸ†”86446510

Small, focused teams can achieve incredible things β€” very proud of what we’ve built! https://t.co/f1lIkCYOkG https://t.co/7h8jmN8Ckw

Media 1
πŸ–ΌοΈ Media
H
HelloSurgeAI
@HelloSurgeAI
πŸ“…
Tue Jun 24
πŸ†”46226300

Nice words from @timoreilly! https://t.co/U6TRiV2TND

Media 1
πŸ–ΌοΈ Media
πŸ”ai_fast_track retweeted
T
Tom DΓΆrr
@tom_doerr
πŸ“…
Aug 03, 2025
271d ago
πŸ†”89140394

open source platform for logs, metrics, and traces https://t.co/TCvn2wYXSe

Media 1
❀️698
likes
πŸ”80
retweets
πŸ–ΌοΈ Media
πŸ”ai_fast_track retweeted
G
GitHub Projects Community
@GithubProjects
πŸ“…
Aug 10, 2025
264d ago
πŸ†”64626783

Open source alternative to AWS. https://t.co/bsMbIzSOZ8

Media 1
❀️1,350
likes
πŸ”121
retweets
πŸ–ΌοΈ Media
1
1000wattbrian
@1000wattbrian
πŸ“…
Thu Mar 21
πŸ†”08415117

Well, there just goes a whole lotta votes… https://t.co/L9NfzathuG

Media 1
πŸ–ΌοΈ Media
1
1000wattbrian
@1000wattbrian
πŸ“…
Sun
πŸ†”38153097

Wow. Ok. https://t.co/zgO725MAba

Media 1
πŸ–ΌοΈ Media
S
Shirleysway
@Shirleysway
πŸ“…
Wed
πŸ†”66985741

Drawing day - Get your tickets! Go to https://t.co/BbJjbIA21S for a chance at #HALF of a projected $14,500. Drawing tonight at 6. #ShirleysWay Lic#Org2527 Watch LIVE on our Joker's Wild LIVE FB page https://t.co/mVbUXWDQXG

Media 1
πŸ–ΌοΈ Media
S
Shirleysway
@Shirleysway
πŸ“…
Wed
πŸ†”39127046

2 days left to get tickets at https://t.co/BAqsFBIKgV. You could win #HALF of a projected $24,000 in prize money. #ShirleysWay Lic#Org2527 Watch LIVE on our Wheel of Cash LIVE FB page https://t.co/zC7JWM1Rnk

Media 1
πŸ–ΌοΈ Media
S
Shirleysway
@Shirleysway
πŸ“…
Thu Apr 14
πŸ†”70377473

1 day left to get tickets at https://t.co/BAqsFBIKgV. You could win #HALF of a projected $24,000 in prize money. #ShirleysWay Lic#Org2527 Watch LIVE on our Wheel of Cash LIVE FB page https://t.co/3lvN7xSJ6J

Media 1
πŸ–ΌοΈ Media
S
Shirleysway
@Shirleysway
πŸ“…
Fri
πŸ†”69627141

Drawing day! Get your tickets at https://t.co/BAqsFBIKgV. You could win #HALF of a projected $24,000 in prize money. Drawing tonight at 5! #ShirleysWay Lic#Org2527 Watch LIVE on our Wheel of Cash LIVE FB page https://t.co/u5ockA9L1i

Media 1
πŸ–ΌοΈ Media
S
Shirleysway
@Shirleysway
πŸ“…
Mon
πŸ†”52465920

Drawing Day - Get your tickets online at https://t.co/Fff8vgqRdJ . You could win #HALF of a projected $52,000. Drawing tonight at 8:30! Lic#Org2527 We will be at South End BBQ Watch LIVE on our Queen of Hearts LIVE FB page https://t.co/paY62Ss678

Media 1
πŸ–ΌοΈ Media
S
Shirleysway
@Shirleysway
πŸ“…
Mon
πŸ†”36415748

Drawing Day - Get your tickets online at https://t.co/Fff8vgqRdJ . You could win #HALF of a projected $52,000. Drawing tonight at 8:30! Lic#Org2527 We will be at South End BBQ Watch LIVE on our Queen of Hearts LIVE FB page https://t.co/YNJiBVIaeN

Media 1
πŸ–ΌοΈ Media
S
Shirleysway
@Shirleysway
πŸ“…
Wed
πŸ†”20726784

Drawing day - Get your tickets! Go to https://t.co/BbJjbIA21S for a chance at #HALF of a projected $16,000. Drawing tonight at 6. #ShirleysWay Lic#Org2527 Watch LIVE on our Joker's Wild LIVE FB page https://t.co/7ULQSaX4Ai

Media 1
πŸ–ΌοΈ Media
N
NaderLikeLadder
@NaderLikeLadder
πŸ“…
Fri
πŸ†”57688271

Nader Summer camp: I corralled the kids from bring your child to work day and taught them how to vibe code on Replit! πŸ€©πŸ€™ They designed and built a video game while their parents had β€œmeetings” https://t.co/lqeaKmhZcc

Media 1
πŸ–ΌοΈ Media
πŸ”NaderLikeLadder retweeted
N
Natalie Khalil
@natalienkhalil
πŸ“…
Aug 04, 2025
270d ago
πŸ†”34255381
⭐0.34

All in a day's work https://t.co/sWURFlRz8P

Media 1
❀️10
likes
πŸ”1
retweets
πŸ–ΌοΈ Media
N
NaderLikeLadder
@NaderLikeLadder
πŸ“…
Fri
πŸ†”19104171
⭐0.41

Great week for ai Couple of OSS models, one that can run on edge, and GPT 5 Gonna be a busy weekend πŸ€™ https://t.co/shimU9HMTC

Media 1
πŸ–ΌοΈ Media
N
NaderLikeLadder
@NaderLikeLadder
πŸ“…
Fri
πŸ†”84478176

I found the most dedicated employee in the NVIDIA parking garage https://t.co/0p2Q96qmDe

Media 1
πŸ–ΌοΈ Media
T
tessalau
@tessalau
πŸ“…
Fri
πŸ†”83509046

I'm 100% in favor of purpose-built robots. While AI could enable humanoids that do lots of things imperfectly, specialized form factors are the only path to a sustainable robotics business. Thanks to everyone who turned out to see Dusty at the Cerebral Valley AI Summit. https://t.co/fM8Hn7sgPw

Media 1
πŸ–ΌοΈ Media
J
JoshuaRosenthal
@JoshuaRosenthal
πŸ“…
Sun
πŸ†”16425200

Crypto unlocks vocation - aka doing what you like and are good at: @BAXUSco https://t.co/MrXoaUuJlD https://t.co/zyP6q2RKqj

Media 1
πŸ–ΌοΈ Media
J
JoshuaRosenthal
@JoshuaRosenthal
πŸ“…
Fri
πŸ†”35236790

and if you want to get a bit 'meta': https://t.co/0JamV4pIMJ

Media 1
πŸ–ΌοΈ Media
J
JoshuaRosenthal
@JoshuaRosenthal
πŸ“…
Fri
πŸ†”36445914

History, not as litany of facts, but as a window from the past into the present, and plausible futures. Braudel and the annales school are where it's at Related / an intro for folks: https://t.co/J2LcKazs5K

@j_amesmarriott β€’ Sun Jul 13 08:25

This is amazing. I'm learning to accept that I fundamentally just don't care about that much battles, diplomatic treaties etc. But if I can read about birth rates or why people started drinking coffee or whatever I'm transfixed https://t.co/hEhJEBbdLY

Media 1
πŸ–ΌοΈ Media
A
acossta
@acossta
πŸ“…
Tue Jul 29
πŸ†”81595767

It happens in better families https://t.co/DMUqPXSnel

Media 1
πŸ–ΌοΈ Media
A
acossta
@acossta
πŸ“…
Fri
πŸ†”63791142

Anthropic Tool Caching in AI SDK v5 https://t.co/G3eaQpncmL

Media 1
πŸ–ΌοΈ Media
A
acossta
@acossta
πŸ“…
Wed
πŸ†”21851511

New in-product feedback page. Simple, clean. effective. Goes directly to a slack channel we read. https://t.co/JDzDPWkGFz

Media 1
πŸ–ΌοΈ Media
A
acossta
@acossta
πŸ“…
Mon
πŸ†”55559995

Productive couple of days https://t.co/Ox5DRUAdkh

Media 1Media 2
πŸ–ΌοΈ Media
M
marouen19
@marouen19
πŸ“…
Sun
πŸ†”50417113

This agent has been actively managing my Base AI portfolio Forgot about it Checked in today It’s up 19% Good stimmy https://t.co/zPJOtzt0gO

@0x_WolfXBT β€’ Sat Jul 26 22:33

Rebalanced: ETH remains the largest position. DEGEN fully exited. Increased GIZA, KTA, VIRTUAL on strong price trends. Trimmed weaker tokens. USDbC set at 20% for capital defense. Monitoring for further momentum among top ranked tokens.

Media 1
πŸ–ΌοΈ Media
M
marouen19
@marouen19
πŸ“…
Fri
πŸ†”58759347

It’s on August 16 my first boxing fight How to create a market on @Polymarket ? https://t.co/ACJ5AhT04j

Media 1
πŸ–ΌοΈ Media
M
marouen19
@marouen19
πŸ“…
Fri
πŸ†”89207374

Breaking: GPT 5 still can’t draw proper charts Accenture consultants safe for a few more months https://t.co/tJWGjRcGI2

Media 1
πŸ–ΌοΈ Media