Papers
arxiv:2011.06993

FLERT: Document-Level Features for Named Entity Recognition

Published on Nov 13, 2020
Authors:

Abstract

Transformer-based models for named entity recognition are evaluated for capturing document-level features, leading to new state-of-the-art scores on CoNLL-03 benchmark datasets.

AI-generated summary

Current state-of-the-art approaches for named entity recognition (NER) typically consider text at the sentence-level and thus do not model information that crosses sentence boundaries. However, the use of transformer-based models for NER offers natural options for capturing document-level features. In this paper, we perform a comparative evaluation of document-level features in the two standard NER architectures commonly considered in the literature, namely "fine-tuning" and "feature-based LSTM-CRF". We evaluate different hyperparameters for document-level features such as context window size and enforcing document-locality. We present experiments from which we derive recommendations for how to model document context and present new state-of-the-art scores on several CoNLL-03 benchmark datasets. Our approach is integrated into the Flair framework to facilitate reproduction of our experiments.

Community

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2011.06993
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 25

Browse 25 models citing this paper

Datasets citing this paper 1

Spaces citing this paper 41

Collections including this paper 1