COMET-early-exit
Collection
Models introduced in the paper Early-Exit and Instant Confidence Translation Quality Estimation https://github.com/zouharvi/COMET-early-exit
•
4 items
•
Updated
This model is based on COMET-early-exit, which is a fork but not compatible with original Unbabel's COMET. To run the model, you need to first install this version of COMET either with:
pip install "git+https://github.com/zouharvi/COMET-early-exit#egg=comet-early-exit&subdirectory=comet_early_exit"
or in editable mode:
git clone https://github.com/zouharvi/COMET-early-exit.git
cd COMET-early-exit
pip3 install -e comet_early_exit
This model specifically makes prediction at each of the 25 layers, both the score and the confidence. This time, the confidence is the absolute error with respect to the final layer's prediction.
model = comet_early_exit.load_from_checkpoint(comet_early_exit.download_model("zouharvi/COMET-instant-self-confidence"))
data = [
{
"src": "Can I receive my food in 10 to 15 minutes?",
"mt": "Moh bych obdržet jídlo v 10 do 15 minut?",
},
{
"src": "Can I receive my food in 10 to 15 minutes?",
"mt": "Mohl bych dostat jídlo během 10 či 15 minut?",
}
]
model_output = model.predict(data, batch_size=8, gpus=1)
# print predictions at 5th, 12th, and last layer
print("scores", model_output["scores"][0][5], model_output["scores"][0][12], model_output["scores"][0][-1])
print("estimated errors", model_output["confidences"][0][5], model_output["confidences"][0][12], model_output["confidences"][0][-1])
# two top-level outputs
assert len(model_output["scores"]) == 2 and len(model_output["confidences"]) == 2
# each output contains prediction per each layer
assert all(len(l) == 25 for l in model_output["scores"]) and all(len(l) == 25 for l in model_output["confidences"])
Outputs (formatted):
scores 75.60 86.60 85.74
estimated errors 10.48 3.52 0.83
This model is based on the work Early-Exit and Instant Confidence Translation Quality Estimation which can be cited as:
@misc{zouhar2025earlyexitinstantconfidencetranslation,
title={Early-Exit and Instant Confidence Translation Quality Estimation},
author={Vilém Zouhar and Maike Züfle and Beni Egressy and Julius Cheng and Jan Niehues},
year={2025},
eprint={2502.14429},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2502.14429},
}
Base model
FacebookAI/xlm-roberta-large