Lorbus/Qwen3.6-27B-int4-AutoRound

🧠 AI ModelLorbus

High-performance 4-bit quantized Qwen3.6-27B model optimized for efficient image-to-text inference.

The Lorbus/Qwen3.6-27B-int4-AutoRound model represents a significant step forward in making large-scale vision-language models accessible. By utilizing AutoRound quantization (w4g128, w4a16), the model compresses the original 27B parameter architecture into a 4-bit format. This technical optimization allows the model to run on hardware with significantly less VRAM than the full-precision version, while the AutoRound algorithm ensures that the degradation in model intelligence is kept to an absolute minimum. It supports the transformers and safetensors formats, ensuring seamless integration with modern AI stacks like vLLM. This model is specifically tuned for complex image-text-to-text pipelines, enabling advanced multimodal reasoning, image captioning, and visual question answering in a lightweight, high-speed package.

💡Highlights

├─4-bit AutoRound quantization
├─Optimized w4g128, w4a16 weights
└─High-speed image-to-text inference

🎯For

├─AI Researchers
├─ML Engineers
└─Edge Computing Developers

🔗Links

└─HuggingFace Repository