APSIPA Transactions on Signal and Information Processing > Vol 14 > Issue 1

Analysis and Extension of Noisy-target Training for Unsupervised Target Signal Enhancement

Takuya Fujimura, Nagoya University, Japan, fujimura.takuya@g.sp.m.is.nagoya-u.ac.jp , Tomoki Toda, Nagoya University, Japan
 
Suggested Citation
Takuya Fujimura and Tomoki Toda (2025), "Analysis and Extension of Noisy-target Training for Unsupervised Target Signal Enhancement", APSIPA Transactions on Signal and Information Processing: Vol. 14: No. 1, e12. http://dx.doi.org/10.1561/116.20250018

Publication Date: 12 Jun 2025
© 2025 T. Fujimura and T. Toda
 
Subjects
Audio signal processing,  Enhancement
 
Keywords
Target signal enhancementunsupervised learningdenoisingdereverberationdeclipping
 

Share

Open Access

This is published under the terms of CC BY-NC.

Downloaded: 14 times

In this article:
Introduction 
NyTT and its Related Works 
Motivation and Content of the Investigation 
Experimental Analysis in the Denoising Task 
Experimental Analysis in the Dereverberation Task 
Experimental Analysis in the Declipping Task 
Conclusion 
References 

Abstract

Deep neural network-based target signal enhancement (TSE) is usually trained in a supervised manner using clean target signals. However, collecting clean target signals is costly and such signals are not always available. Thus, it is desirable to develop an unsupervised method that does not rely on clean target signals. Among various studies on unsupervised TSE methods, Noisy-target Training (NyTT) has been established as a fundamental method. NyTT simply replaces clean target signals with noisy ones in the typical supervised training, and it has been experimentally shown to achieve TSE. Despite its effectiveness and simplicity, its mechanism and detailed behavior are still unclear. In this paper, to advance NyTT and, thus, unsupervised methods as a whole, we analyze NyTT from various perspectives. We experimentally demonstrate the mechanism of NyTT, the desirable conditions, and the effectiveness of utilizing noisy signals in situations where a small number of clean target signals are available. Furthermore, we propose an improved version of NyTT based on its properties and explore its capabilities in the dereverberation and declipping tasks, beyond the denoising task.

DOI:10.1561/116.20250018