🧶Hi, I'm Sherlock!

Greetings, readers, welcom to my blog!

I am Sherlock, a research engineer specializing in LLM, computer vision, and deep learning. Recently, I delve into various aspects of LLM, including pre-training, SFT, RLHF, and the related infra.

Featured

LLM Quantization Review
Posted on:October 2, 2023 at 12:00 AM
This blog post provides an overview of the fundamental concepts of quantization, as well as a review of mainstream quantization methods in the context of LLMs.

Recent Posts

8-bit KV Cache
Posted on:January 24, 2024 at 12:00 AM
This blog introduces KV Cache quantization in LLM inference.
SmoothQuant and AWQ
Posted on:October 8, 2023 at 12:00 AM
This blog post compares *SmoothQuant* and *AWQ* differences and their code implementation.
LLM Quantization Review
Posted on:October 2, 2023 at 12:00 AM
This blog post provides an overview of the fundamental concepts of quantization, as well as a review of mainstream quantization methods in the context of LLMs.
Triton Tutorial #2
Posted on:September 20, 2023 at 12:00 AM
third blogpost of triton tutorial series, gemm and autotune.

All Posts

🧶Hi, I'm Sherlock!

Featured

LLM Quantization Review

Recent Posts

8-bit KV Cache

SmoothQuant and AWQ

LLM Quantization Review

Triton Tutorial #2