VTC 2.1

This repository only contains the model weights. For more informations on how to use the model, please look over at the LAAC-LSCP/VTC repository.

Citation

To cite this work, please use the following bibtex.

@misc{charlot2025babyhubertmultilingualselfsupervisedlearning,
    title={BabyHuBERT: Multilingual Self-Supervised Learning for Segmenting Speakers in Child-Centered Long-Form Recordings}, 
    author={Théo Charlot and Tarek Kunze and Maxime Poli and Alejandrina Cristia and Emmanuel Dupoux and Marvin Lavechin},
    year={2025},
    eprint={2509.15001},
    archivePrefix={arXiv},
    primaryClass={eess.AS},
    url={https://arxiv.org/abs/2509.15001}, 
}

Changelog

To retrieve a specific version:

git clone --branch v2.1 --single-branch https://huggingface.co/coml/VTC-2

10265c5: VTC 2.1 - tag: v2.1
91e67b5: VTC 2.0 - tag: v2.0

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for coml/VTC-2

Base model

coml/BabyHuBERT

Finetuned

(3)

this model

Paper for coml/VTC-2

BabyHuBERT: Multilingual Self-Supervised Learning for Segmenting Speakers in Child-Centered Long-Form Recordings

Paper • 2509.15001 • Published Sep 18, 2025