Production inference for encoder models with vLLM plugins.
colbert, colpali, GLiNER, GLiNER2 etc. — github.com/ddickmann/vllm-factory
-
fastino/gliner2-large-v1
0.5B • Updated • 356k • 83 -
knowledgator/gliner-x-large
Token Classification • 0.9B • Updated • 409 • 45 -
LiquidAI/LFM2-ColBERT-350M
Sentence Similarity • 0.4B • Updated • 77.1k • 131 -
VAGOsolutions/SauerkrautLM-Multi-Reason-ModernColBERT
Sentence Similarity • 0.1B • Updated • 75.8k • • 13