AutoGPTQ is a Python library that simplifies GPTQ-based weight quantization for large language models. It reduces model sizes by 4x with minimal accuracy loss, making it possible to run large models on consumer GPUs for inference.
AutoGPTQ — Easy-to-Use GPTQ Quantization for Large Language Models
AutoGPTQ is a Python library that simplifies GPTQ-based weight quantization for large language models. It reduces model sizes by 4x with minimal accuracy loss, making it possible to run large models on consumer GPUs for inference.
Este activo puede ser leído e instalado directamente por agents
TokRepo expone un comando CLI universal, contrato de instalación, metadata JSON, plan según adaptador y contenido raw para que los agents evalúen compatibilidad, riesgo y próximos pasos.
npx tokrepo install 80f47f36-54af-11f1-9bc6-00163e2b0d79Discusión
Activos relacionados
React Hook Form — Performant Forms with Easy Validation
React Hook Form is a performant, flexible and extensible forms library for React with easy-to-use validation. Minimizes re-renders by leveraging uncontrolled components and refs, achieving the fastest form library for React.
D2 — Declarative Diagram Scripting Language
A modern diagram scripting language that turns text into diagrams, offering a readable syntax for architecture, flow, and sequence diagrams rendered from code.
Poetry — Python Packaging and Dependency Management Made Easy
Poetry is a comprehensive tool for Python dependency management and packaging. It handles virtual environments, dependency resolution, lockfiles, and publishing to PyPI — all through a single, intuitive command-line interface.
EasyOCR — Ready-to-Use OCR with 80+ Language Support
A Python library for optical character recognition supporting 80+ languages with a two-line API. Built on PyTorch with CRAFT detection and CRNN recognition.