@togethercompute
New from Together Research: Aurora. Speculative decoding that adapts to shifting traffic in real time โ and keeps improving the longer it runs. Open-source, RL-based, 1.25x faster vs. a well-trained static speculator with no offline retraining pipeline. Thread ๐งต https://t.co/xhevA0wnYt