arxiv:2011.06993

FLERT: Document-Level Features for Named Entity Recognition

Published on Nov 13, 2020

flair

Upvote

Authors:

Stefan Schweter ,

Abstract

Transformer-based models for named entity recognition are evaluated for capturing document-level features, leading to new state-of-the-art scores on CoNLL-03 benchmark datasets.

AI-generated summary

Current state-of-the-art approaches for named entity recognition (NER) typically consider text at the sentence-level and thus do not model information that crosses sentence boundaries. However, the use of transformer-based models for NER offers natural options for capturing document-level features. In this paper, we perform a comparative evaluation of document-level features in the two standard NER architectures commonly considered in the literature, namely "fine-tuning" and "feature-based LSTM-CRF". We evaluate different hyperparameters for document-level features such as context window size and enforcing document-locality. We present experiments from which we derive recommendations for how to model document context and present new state-of-the-art scores on several CoNLL-03 benchmark datasets. Our approach is integrated into the Flair framework to facilitate reproduction of our experiments.

View arXiv page View PDF GitHub 14.4k Add to collection

Community

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Get this paper in your agent:

hf papers read 2011.06993

Don't have the latest CLI?

curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 25

Browse 25 models citing this paper

FLERT: Document-Level Features for Named Entity Recognition

Abstract

Community

Models citing this paper 25

Datasets citing this paper 1

Spaces citing this paper 41

Collections including this paper 1