ModernBERT-large-madon-arg-detection

This model is a fine-tuned version of ModernBERT-large for Czech legal argument detection. It was introduced in the paper Mining Legal Arguments to Study Judicial Formalism.

The model is part of the MADON project, which focuses on detecting and classifying judicial reasoning in Czech court decisions. This specific model corresponds to Task 1 in the paper: detecting whether a paragraph in a legal decision is argumentative or non-argumentative.

Model Description

The model was adapted to the Czech legal domain through continued pretraining on a corpus of over 300,000 court decisions and fine-tuned on the MADON dataset. In the paper's evaluation, this model achieved a Balanced F1 score of 82.6% for argument detection.

Usage

You can use this model for presence classification of Czech legal arguments using the transformers library:

from transformers import AutoModelForSequenceClassification, AutoTokenizer, pipeline

model = AutoModelForSequenceClassification.from_pretrained("TrustHLT/ModernBERT-large-madon-arg-detection")
tokenizer = AutoTokenizer.from_pretrained("TrustHLT/ModernBERT-large-madon-arg-detection")

pipe = pipeline("text-classification", model=model, tokenizer=tokenizer)

text = "This is a legal paragraph" # Replace with Czech legal text

print(pipe(text))

Citation

If you find this model useful, please cite:

@article{madon2025,
  title={Mining Legal Arguments to Study Judicial Formalism},
  author={Anonymous},
  journal={arXiv preprint arXiv:2512.11374},
  year={2025}
}
Downloads last month
22
Safetensors
Model size
0.4B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including TrustHLT/ModernBERT-large-madon-arg-detection

Paper for TrustHLT/ModernBERT-large-madon-arg-detection