artefactory 's Collections

MLM versus CLM for NLP tasks

Related paper: "Should We Still Pretrain Encoders with Masked Language Modeling?"