Oliver Pfaffel
OliP
AI & ML interests
None yet
Organizations
2024 Papers of the year
LLM Deployment
- Paused272
Llm Pricing
π272Display a React app with TypeScript
- RunningFeatured1.05k
Can You Run It? LLM version
π1.05kCheck if your GPU can run a chosen LLM model
-
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems
Paper β’ 2312.15234 β’ Published β’ 3 -
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
Paper β’ 2407.11062 β’ Published β’ 10
Long-Context
-
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
Paper β’ 2407.14057 β’ Published β’ 46 -
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Paper β’ 2407.14482 β’ Published β’ 26 -
NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?
Paper β’ 2407.11963 β’ Published β’ 44
Special LMs <10B
-
Salesforce/xLAM-1b-fc-r
Text Generation β’ 1B β’ Updated β’ 2.6k β’ 59 -
AI-MO/NuminaMath-7B-TIR
Text Generation β’ 7B β’ Updated β’ 282 β’ 351 -
google/shieldgemma-9b
Text Generation β’ 9B β’ Updated β’ 4.82k β’ β’ 28 -
meta-llama/Llama-Guard-3-8B
Text Generation β’ 8B β’ Updated β’ 65.8k β’ β’ 298
Evaluation
-
Self-Taught Evaluators
Paper β’ 2408.02666 β’ Published β’ 28 -
Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries
Paper β’ 2409.12640 β’ Published β’ 4 -
openai/MMMLU
Viewer β’ Updated β’ 393k β’ 11.7k β’ 522 -
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Paper β’ 2409.16191 β’ Published β’ 41
Coding
-
SciCode: A Research Coding Benchmark Curated by Scientists
Paper β’ 2407.13168 β’ Published β’ 17 -
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Paper β’ 2407.16741 β’ Published β’ 78 -
CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases
Paper β’ 2408.03910 β’ Published β’ 18 -
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
Paper β’ 2408.07060 β’ Published β’ 41
Leading Leaderboards
- Running on CPU Upgrade14k
Open LLM Leaderboard
π14kTrack, rank and evaluate open LLMs and chatbots
- Running on CPU Upgrade7.43k
MTEB Leaderboard
π₯7.43kEmbedding Leaderboard
- Running4.9k
Arena Leaderboard
π4.9kView the LMArena leaderboard in fullβscreen
- RunningAgents230
BigCodeBench Leaderboard
π₯230Explore code-generation model leaderboards and task details
2023 (and before) Papers of the Year
-
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Paper β’ 2306.00989 β’ Published β’ 1 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper β’ 2305.18290 β’ Published β’ 66 -
Scalable Diffusion Models with Transformers
Paper β’ 2212.09748 β’ Published β’ 17 -
Matryoshka Representation Learning
Paper β’ 2205.13147 β’ Published β’ 27
Vision-Language
-
EVLM: An Efficient Vision-Language Model for Visual Understanding
Paper β’ 2407.14177 β’ Published β’ 45 -
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
Paper β’ 2407.04172 β’ Published β’ 25 -
facebook/chameleon-7b
Image-Text-to-Text β’ 7B β’ Updated β’ 238k β’ 201 -
vidore/colpali
Visual Document Retrieval β’ Updated β’ 6.25k β’ 479
Audio
πΆοΈ Spaces
Applications
-
Integrating Large Language Models into a Tri-Modal Architecture for Automated Depression Classification
Paper β’ 2407.19340 β’ Published β’ 58 -
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Paper β’ 2408.02900 β’ Published β’ 31 -
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Paper β’ 2408.06292 β’ Published β’ 128
NewGen small LMs
Leading Leaderboards
- Running on CPU Upgrade14k
Open LLM Leaderboard
π14kTrack, rank and evaluate open LLMs and chatbots
- Running on CPU Upgrade7.43k
MTEB Leaderboard
π₯7.43kEmbedding Leaderboard
- Running4.9k
Arena Leaderboard
π4.9kView the LMArena leaderboard in fullβscreen
- RunningAgents230
BigCodeBench Leaderboard
π₯230Explore code-generation model leaderboards and task details
2024 Papers of the year
2023 (and before) Papers of the Year
-
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Paper β’ 2306.00989 β’ Published β’ 1 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper β’ 2305.18290 β’ Published β’ 66 -
Scalable Diffusion Models with Transformers
Paper β’ 2212.09748 β’ Published β’ 17 -
Matryoshka Representation Learning
Paper β’ 2205.13147 β’ Published β’ 27
LLM Deployment
- Paused272
Llm Pricing
π272Display a React app with TypeScript
- RunningFeatured1.05k
Can You Run It? LLM version
π1.05kCheck if your GPU can run a chosen LLM model
-
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems
Paper β’ 2312.15234 β’ Published β’ 3 -
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
Paper β’ 2407.11062 β’ Published β’ 10
Vision-Language
-
EVLM: An Efficient Vision-Language Model for Visual Understanding
Paper β’ 2407.14177 β’ Published β’ 45 -
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
Paper β’ 2407.04172 β’ Published β’ 25 -
facebook/chameleon-7b
Image-Text-to-Text β’ 7B β’ Updated β’ 238k β’ 201 -
vidore/colpali
Visual Document Retrieval β’ Updated β’ 6.25k β’ 479
Long-Context
-
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
Paper β’ 2407.14057 β’ Published β’ 46 -
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Paper β’ 2407.14482 β’ Published β’ 26 -
NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?
Paper β’ 2407.11963 β’ Published β’ 44
Audio
Special LMs <10B
-
Salesforce/xLAM-1b-fc-r
Text Generation β’ 1B β’ Updated β’ 2.6k β’ 59 -
AI-MO/NuminaMath-7B-TIR
Text Generation β’ 7B β’ Updated β’ 282 β’ 351 -
google/shieldgemma-9b
Text Generation β’ 9B β’ Updated β’ 4.82k β’ β’ 28 -
meta-llama/Llama-Guard-3-8B
Text Generation β’ 8B β’ Updated β’ 65.8k β’ β’ 298
πΆοΈ Spaces
Evaluation
-
Self-Taught Evaluators
Paper β’ 2408.02666 β’ Published β’ 28 -
Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries
Paper β’ 2409.12640 β’ Published β’ 4 -
openai/MMMLU
Viewer β’ Updated β’ 393k β’ 11.7k β’ 522 -
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Paper β’ 2409.16191 β’ Published β’ 41
Applications
-
Integrating Large Language Models into a Tri-Modal Architecture for Automated Depression Classification
Paper β’ 2407.19340 β’ Published β’ 58 -
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Paper β’ 2408.02900 β’ Published β’ 31 -
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Paper β’ 2408.06292 β’ Published β’ 128
Coding
-
SciCode: A Research Coding Benchmark Curated by Scientists
Paper β’ 2407.13168 β’ Published β’ 17 -
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Paper β’ 2407.16741 β’ Published β’ 78 -
CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases
Paper β’ 2408.03910 β’ Published β’ 18 -
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
Paper β’ 2408.07060 β’ Published β’ 41