view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval +1 aamirshakir, tomaarsen, SeanLee97 โข Mar 22, 2024 โข 135
view article Article Making LLMs lighter with AutoGPTQ and transformers +4 marcsun13, fxmarty, PanEa, qwopqwop, ybelkada, TheBloke โข Aug 23, 2023 โข 64
view article Article Introduction to Quantization cooked in ๐ค with ๐๐งโ๐ณ merve โข Aug 25, 2023 โข 40