Making crystal-clear audio a reality on smaller devices

19 Jun 2025

Abstract 3D data visualization with colorful dots and a wavy grid, representing sound waves..

Speech processing is advancing rapidly, with new techniques emerging to create clearer and more natural audio. One ongoing challenge is speech denoising—removing unwanted background noise while keeping speech clear. To address this issue, we've introduced an updated method that improves audio quality and lowers processing requirements. This new technique uses a knowledge distillation process that makes it possible to achieve better speech enhancement on smaller everyday devices like smartphones, hearing aids, and smart glasses.

The path to smarter audio

Traditional methods for cleaning speech from noisy environments often require heavy computing power. Even recent AI-based tools can struggle to offer a good balance between performance and efficiency, making them less practical for real-time use on smaller, resource-limited devices. Our new approach addresses these challenges by teaching lighter models to match the performance of larger, more complex systems. It uses a cosine distance-based method that emphasizes the overall direction of audio features instead of requiring exact values. Or to put it another way, it helps the system grasp the essential elements of clear speech without overcomplicating the learning process.

Our innovation combines several key features: a cosine similarity method enabling flexible learning transfer, efficient handling of different model setups using linear bottleneck techniques, and consistent performance across various conditions. Testing in controlled settings has shown that our lightweight models deliver performance close to their more resource-intensive counterparts.

Even small improvements in audio clarity can lead to significantly better communication in real-life applications and this new method could benefit many sectors. Mobile devices can enjoy improved noise cancellation, hearing aids can provide clearer sound with less power, and teleconference systems benefit from better audio quality overall. This work isn't just a minor technical upgrade; it represents a shift towards more efficient speech processing. For consumers, this means clearer audio without needing costly hardware updates. For developers, it offers a more adaptable way to add speech enhancement features. For the industry, it opens the door to more accessible and efficient audio solutions.

Our research shows that fresh approaches to knowledge distillation can address common audio challenges in practical ways. As voice-enabled devices become more common, such improvements in speech enhancement technology are increasingly important for ensuring effective communication in our connected world.

Find out more in our paper: Knowledge Distillation for Speech Denoising by Latent Representation Alignment with Cosine Distance

About Konstantinos Drosos

Konstantinos (Kostas) Drosos is a principal audio machine learning scientist at Nokia. He is the author or co-author of over 50 scientific papers and an acting reviewer in various journals and international conferences, and is considered as a pioneer in different audio machine learning tasks. He is involved in the research and development of deep learning based methods in OZO Audio.

Connect with Kostas on LinkedIn

About Mikko Heikkinen

Mikko Heikkinen is a principal software engineer at Nokia with a broad background in developing advanced multimedia technologies. He is a trusted software generalist currently contributing to the development of OZO audio technologies. He holds several granted patents and patent applications and conducts research in machine learning applied to audio processing.

Connect with Mikko on LinkedIn

Article tags

Immersive Audio AI Standardization

Select your country

Making crystal-clear audio a reality on smaller devices

The path to smarter audio

About Konstantinos Drosos

About Mikko Heikkinen

Article tags

How AI is Revolutionizing Spatial Audio

Transparent AI in Nokia’s audio product development

Looking for Nokia licensed products support?

Looking for Nokia licensed products support?

Select your country

Making crystal-clear audio a reality on smaller devices

The path to smarter audio

About Konstantinos Drosos

About Mikko Heikkinen

Article tags

Related posts

How AI is Revolutionizing Spatial Audio

Transparent AI in Nokia’s audio product development