Production inference for encoder models with vLLM plugins.
colbert, colpali, GLiNER, GLiNER2 etc. — github.com/ddickmann/vllm-factory
-
fastino/gliner2-large-v1
0.5B • Updated • 344k • 83 -
knowledgator/gliner-x-large
Token Classification • 0.9B • Updated • 407 • 45 -
LiquidAI/LFM2-ColBERT-350M
Sentence Similarity • 0.4B • Updated • 82k • 131 -
VAGOsolutions/SauerkrautLM-Multi-Reason-ModernColBERT
Sentence Similarity • 0.1B • Updated • 69.4k • • 12