BabyHuBERT: Multilingual Self-Supervised Learning for Segmenting Speakers in Child-Centered Long-Form Recordings
Paper • 2509.15001 • Published
This repository only contains the model weights. For more informations on how to use the model, please look over at the LAAC-LSCP/VTC repository.
To cite this work, please use the following bibtex.
@misc{charlot2025babyhubertmultilingualselfsupervisedlearning,
title={BabyHuBERT: Multilingual Self-Supervised Learning for Segmenting Speakers in Child-Centered Long-Form Recordings},
author={Théo Charlot and Tarek Kunze and Maxime Poli and Alejandrina Cristia and Emmanuel Dupoux and Marvin Lavechin},
year={2025},
eprint={2509.15001},
archivePrefix={arXiv},
primaryClass={eess.AS},
url={https://arxiv.org/abs/2509.15001},
}
To retrieve a specific version:
git clone --branch v2.1 --single-branch https://huggingface.co/coml/VTC-2
10265c5: VTC 2.1 - tag: v2.191e67b5: VTC 2.0 - tag: v2.0Base model
coml/BabyHuBERT