Mariusj G
MariusjG
AI & ML interests
None yet
Organizations
None yet
LLM Papers
-
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
Paper • 2006.03654 • Published • 3 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 29 -
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Paper • 1907.11692 • Published • 10 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 20
OCR
LLM Papers
-
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
Paper • 2006.03654 • Published • 3 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 29 -
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Paper • 1907.11692 • Published • 10 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 20
models 0
None public yet
datasets 0
None public yet