Linden Li

Linden Li's posts

Abstract representation of LLM Inference token generation pipeline

March 20, 2024/14 min read

Fast, Secure and Reliable: Enterprise-grade LLM Inference

Integrating NVIDIA TensorRT-LLM with the Databricks Inference Stack

December 21, 2023/3 min read

Integrating NVIDIA TensorRT-LLM with the Databricks Inference Stack

mixtral_social

December 21, 2023/5 min read

Introducing Mixtral 8x7B with Databricks Model Serving

LLM Inference Performance Engineering: Best Practices

October 12, 2023/15 min read

LLM Inference Performance Engineering: Best Practices

September 29, 2022/10 min read

Mosaic LLMs: GPT-3 quality for <$500k

August 11, 2022/7 min read

Mosaic LLMs (Part 1): Billion-Parameter GPT Training Made Easy