iZotope iZotope Research Team Research

iZotope Research

Our mission as the iZotope research team is to enable the durable technological competitiveness and scientific leadership of iZotope and its products based on the invention of novel signal processing technology. Read our latest research below.

Research Highlights

Analyzing a Unique Pingable Circuit: The Gamelan Resonator

This paper offers a study of the circuits developed by artist Paul DeMarinis for the touring version of his work Pygmy Gamelan.

Kurt James Werner and Ezra J. Teboul

Zero-shot Singing Voice Conversion

In this paper, we propose the use of speaker embedding networks to perform zero-shot singing voice conversion, and suggest two architectures for its realization.

Shahan Nercessian

Moog Ladder Filter Generalizations Based on State Variable Filters

In this paper, we propose a new style of continuous-time filter design composed of a cascade of 2nd-order state variable filters (SVFs) and a global feedback path.

Kurt James Werner and Russell McClellan

Speech Dereverberation using Recurrent Neural Networks

In this paper, we show how a simple reformulation allows us to adapt blind source separation techniques to the problem of speech dereverberation and, accordingly, train a bidirectional recurrent neural network (BRNN) for this task.

Shahan Nercessian and Alexey Lukin

energy-preserving-time-varying-schroeder-allpass-filters.png

Energy-Preserving Time-Varying Schroeder Allpass Filters and Multichannel Extensions

We propose time-varying Schroeder allpass filters and Gerzon allpass reverberators that remain energy preserving irrespective of arbitrary variation of their allpass gains or feed- back matrices over time.

Kurt James Werner, François G. Germain, and Cory S. Goldsmith

Blind Arbitrary Reverb Matching

We propose a model architecture for performing reverb matching and provide subjective experimental results suggesting that the reverb matching model can perform as well as a human.

Andy Sarroff and Roth Michaels

< Back to Research Team

Research Bibliography

Kurt James Werner and Ezra J. Teboul. “Analyzing a Unique Pingable Circuit: The Gamelan Resonator.” Proceedings of the 151st Convention of the Audio Engineering Society. Las Vegas, NV and Online. October 11–13, 2021.

Shahan Nercessian. “End-to-end voice conversion using a DDSP vocoder.” Proceedings of the 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). New Paltz, NY. October 17–20, 2021.

Niccolò Pretto, Nadir Dalla Pozza, Alberto Padoan, Anthony Chmiel, Kurt James Werner, Alessandra Micalizzi, Emery Schubert, Antonio Rodà, Simone Milani, and Sergio Canazza. “A workflow and novel digital filters for compensating speed and equalization errors on digitized audio open-reel tapes.” Proceedings of Audio Mostly. Trento, Italy. September 1–3, 2021.

Kurt James Werner. “An equivalent circuit interpretation of antiderivative antialiasing.” Proceedings of the International Conference on Digital Audio Effects (DAFx-21). Vienna, Austria. September 7–11, 2021.

Kurt James Werner, François Germain, and Cory Goldsmith. “Energy-preserving time-varying Schroeder allpass filters and multichannel extensions.” Journal of the Audio Engineering Society (JAES). Vol. 69, Issue 7/8, pp. 465–485. July 2021.

Shahan Nercessian, Andy Sarroff, and Kurt James Werner. “Lightweight and interpretable neural modeling of an audio distortion effect using hyper-conditioned differentiable biquads.” International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Toronto, Canada. June 6–11, 2021.

Shahan Nercessian. “Improved zero-shot voice conversion using explicit conditioning signals.” Proceedings of Interspeech 2020. Shanghai, China. October 25–29, 2020.

Shahan Nercessian. “Zero-shot singing voice conversion.” Proceedings of the 21st International Society for Music information Retrieval (ISMIR) Conference. Montreal, Canada. October 11–15, 2020.

Shahan Nercessian. “Neural parametric equalizer matching using differentiable biquads.” Proceedings of the International Conference on Digital Audio Effects (DAFx-20). Vienna, Austria. September 8–12, 2020.

Andy Sarroff and Roth Michaels. “Blind arbitrary reverb matching.” Proceedings of the International Conference on Digital Audio Effects (DAFx-20). Vienna, Austria. September 8–12, 2020.

Kurt James Werner. “Energy-preserving time-varying Schroeder allpass filters.” Proceedings of the International Conference on Digital Audio Effects (DAFx-20). Vienna, Austria. September 8–12, 2020.

Kurt James Werner and Russell McClellan. “Moog ladder filter generalizations based on state variable filters.” Proceedings of the International Conference on Digital Audio Effects (DAFx-20). Vienna, Austria. September 8–12, 2020.

Shahan Nercessian. “Voice conversion using phonetic information and WaveRNN.” Speech and Audio in the Northeast (SANE). New York, NY. October 24, 2019.

Shahan Nercessian and Alexey Lukin. “Speech dereverberation using recurrent neural networks.” Proceedings of the 22nd International Conference on Digital Audio Effects (DAFx-19). Birmingham, UK. September 2–4, 2019.

Gordon Wichern and Alexey Lukin. “Removing lavalier microphone rustle with recurrent neural networks.” Proceedings of the International Conference on Digital Audio Effects (DAFx-18). Aveiro, Portugal. September 4–8, 2018.

Gordon Wichern and Alexey Lukin. “Low-latency approximation of bidirectional recurrent networks for speech denoising.” Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). New Paltz, NY. October 15–18, 2017.

Alexey Lukin, Russell McClellan, and Aaron S. Wishnick. “A two-pass algorithm for automatic loudness correction.” Proceedings of the 141st Convention of the Audio Engineering Society (AES). Los Angeles, CA. September 29–October 2, 2016.

Gordon Wichern, Hannah Robertson, and Aaron S. Wishnick. “Quantitative analysis of masking in multitrack mixes using loudness loss.” Proceedings of the 141st Convention of the Audio Engineering Society (AES). Los Angeles, CA. September 29–October 2, 2016.

Gordon Wichern, Aaron S. Wishnick, Alexey Lukin, and Hannah Robertson. “Comparison of loudness features for automatic level adjustment in mixing.” Proceedings of the 139th Convention of the Audio Engineering Society (AES). New York, NY. October 29–November 1, 2015.

Aaron Wishnick. “Time-varying filters for musical applications.” Proceedings of the International Conference on Digital Audio Effects (DAFx-14). Erlangen, Germany. September 1–5, 2014.

< Back to Research Team