Tag:gptq
All the articles with the tag "gptq".
GPTQ Math Derivation
Posted on:September 9, 2023 at 12:00 AMThis blog post traces the development of GPTQ, starting from its roots in OBD, through OBS, and finally to OBC.
GPTQ Code Implementation
Posted on:September 18, 2023 at 12:00 AMThis blog post delved into the code implementation of the GPTQ quantization process, using the Llama model as a case study.
LLM Quantization Review
Posted on:October 2, 2023 at 12:00 AMThis blog post provides an overview of the fundamental concepts of quantization, as well as a review of mainstream quantization methods in the context of LLMs.