sentence-transformers/multi-qa-MiniLM-L6-cos-v1
🧠 AI Modelsentence-transformers
A high-performance, lightweight BERT-based model optimized for semantic search and multi-QA retrieval tasks.
The multi-qa-MiniLM-L6-cos-v1 model is a specialized version of the MiniLM architecture, fine-tuned specifically for multi-QA retrieval tasks. By leveraging a 6-layer transformer structure, it achieves a balance between high-quality semantic embeddings and low-latency inference. It is trained to minimize the cosine distance between questions and their corresponding answers, ensuring that relevant documents are ranked higher in search results. The model supports multiple formats including PyTorch, TensorFlow, ONNX, and Safetensors, allowing for seamless integration into diverse production environments. Its lightweight footprint makes it particularly suitable for edge deployment or high-throughput search pipelines where computational resources are constrained.
💡Highlights
- ├─6-layer MiniLM architecture
- ├─Optimized for semantic search
- └─Multi-format support (ONNX/PT/TF)
🎯For
- ├─NLP Engineers
- └─Search Developers