LLM Quantization Review

This blog post provides an overview of the fundamental concepts of quantization, as well as a review of mainstream quantization methods in the context of LLMs.

October 2, 2023 · 11 min · Sherlock

SmoothQuant and AWQ

This blog post compares SmoothQuant and AWQ differences and their code implementation.

October 8, 2023 · 12 min · Sherlock