Papers
arxiv:2402.01808

KS-Net: Multi-band joint speech restoration and enhancement network for 2024 ICASSP SSI Challenge

Published on Feb 2, 2024
Authors:
,
,
,
,
,
,
,
,
,

Abstract

The proposed speech restoration and enhancement system uses a complex-domain GAN and multi-band fusion module, achieving top rankings in both real-time and non-real-time tracks of the ICASSP 2024 SSI Challenge.

This paper presents the speech restoration and enhancement system created by the 1024K team for the ICASSP 2024 Speech Signal Improvement (SSI) Challenge. Our system consists of a generative adversarial network (GAN) in complex-domain for speech restoration and a fine-grained multi-band fusion module for speech enhancement. In the blind test set of SSI, the proposed system achieves an overall mean opinion score (MOS) of 3.49 based on ITU-T P.804 and a Word Accuracy Rate (WAcc) of 0.78 for the real-time track, as well as an overall P.804 MOS of 3.43 and a WAcc of 0.78 for the non-real-time track, ranking 1st in both tracks.

Community

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2402.01808
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2402.01808 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2402.01808 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2402.01808 in a Space README.md to link it from this page.

Collections including this paper 1