Skip to main content
Brian Chu

Brian Chu

Brian Chu's posts

Llama Finetuning Blog Post Graphic

Generative AI

September 19, 2024/4 min read

Fine-tuning Llama 3.1 with Long Sequences

Hero image with abstract drawing representing a sparse matrix on a dark background

Mosaic Research

July 1, 2024/Less than a minute

Training MoEs at Scale with PyTorch and Databricks