Continual Speaker Identity Unlearning with Minimal Interference
A new framework called CORTIS has been developed for continual speaker identity unlearning in zero-shot text-to-speech systems. This method addresses the limitations of existing techniques that assume all unlearning requests occur simultaneously. CORTIS allows for the sequential removal of speaker identities without reintroducing previously unlearned speakers, enhancing privacy protection.
- ▪CORTIS stands for Cumulative ORThogonal Identity Suppression.
- ▪The framework does not require access to previously unlearned speaker data.
- ▪CORTIS outperforms previous methods by maintaining the privacy of previously unlearned speakers during sequential requests.
Opening excerpt (first ~120 words) tap to expand
Computer Science > Sound arXiv:2605.25962 (cs) [Submitted on 25 May 2026] Title:Continual Speaker Identity Unlearning with Minimal Interference Authors:Jinju Kim, Yunsung Kang, Gyeong-Moon Park, Jong Hwan Ko View a PDF of the paper titled Continual Speaker Identity Unlearning with Minimal Interference, by Jinju Kim and 3 other authors View PDF HTML (experimental) Abstract:Machine unlearning removes designated concepts or knowledge from pre-trained models. Recent work has extended this paradigm to speaker identity unlearning in zero-shot text-to-speech (ZS-TTS), the task of selectively erasing a model's ability to replicate a speaker's voice.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv.org.