@jerryjliu0
The recently released @huggingface text-embeddings-inference server is game-changing: Get production-scale serving w/ distributed tracing for any BERT model, and it’s blazing fast⚡️ What else is blazing fast? @LoganMarkewich adding the @llama_index integration in an hour 👇: https://t.co/oDpb4OY26U