Fast-ULCNet

Abstract

Single-channel speech enhancement algorithms are often used in resource-constrained embedded devices, where low latency and low-complexity designs are essential. We propose Fast-ULCNet, an adaptation of ULCNet that replaces GRU layers with FastGRNNs, reducing computational latency and complexity. To address state drifting in long signals, we introduce a trainable complementary filter. Fast-ULCNet matches state-of-the-art performance while cutting model size by over 50% and reducing latency by 34% on average.

Audio Examples

This demonstration presents a collection of audio examples designed to allow visitors to evaluate and compare different noise reduction techniques. All the clips have been chosen from the public test set available on the 2020 DNS Challenge repository. For each audio sample, we provide the original noisy recording alongside two enhanced versions: one processed with the state-of-the-art model ULCNet, and the other processed with our proposed method, Fast-ULCNet.

Example 1

Engine noise

Noisy

ULCNet

Fast-ULCNet (ours)

Example 2

Drill noise

Noisy

ULCNet

Fast-ULCNet (ours)

Example 3

Electronic music

Noisy

ULCNet

Fast-ULCNet (ours)

Example 4

Babble noise

Noisy

ULCNet

Fast-ULCNet (ours)

Example 5

Dog barking noise

Noisy

ULCNet

Fast-ULCNet (ours)

Conditions of Use

All material used for training and testing is publicly available on the DNS Challenge GitHub repository.
Samples are for testing and demonstration of noise reduction techniques only.
No warranties are expressed or implied.
Do not redistribute or modify outside of evaluation without permission.
Include this notice when sharing links to this page.