Model description:

Model: microsoft/mdeberta-v3-base

Dataset: TASTEset

Unshuffled ratio: ['0']

Shuffled ratio: ['1']

Best exact match epoch: 8

Best exact match: 93.09

Best epoch: 8

Drop duplicates: ['1']

Max epochs = 10

Optimizer lr = 3e-05

Optimizer eps = 1e-08

Batch size = 8

Dataset path = pgajo/mdeberta_EW-TT-PE_U0_S1_DROP1

Results

epoch train_loss train_f1 train_exact dev_loss dev_f1 dev_exact test_loss test_f1 test_exact
1 1.95 48.6 40.98 0.46 85.03 81.22 0 0 0
2 0.36 88.68 85.63 0.39 90.41 89.23 0 0 0
3 0.25 91.68 89.5 0.36 90.08 88.12 0 0 0
4 0.13 95.32 94.4 0.29 91.88 90.61 0 0 0
5 0.09 97 96.27 0.3 93.72 92.54 0 0 0
6 0.09 97.04 96.34 0.37 91.44 89.78 0 0 0
7 0.07 97.2 96.61 0.29 92.68 91.99 0 0 0
8 0.05 98.34 98.06 0.33 93.5 93.09 0 0 0
9 0.03 98.67 98.55 0.35 93.67 91.99 0 0 0
10 0.02 99.33 98.96 0.4 93.54 92.54 0 0 0
Downloads last month
4
Safetensors
Model size
0.3B params
Tensor type
F32
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support