Immersive Voice and Audio Services platform
Nokia Immersive Voice is a complete, end-to-end spatial audio solution for experiencing natural, real-time voice communication with the IVAS (Immersive Voice and Audio Services) codec. It works on mobile devices, XR glasses, and other multi-microphone devices enabling realistic, location-aware conversations.
How Nokia Immersive Voice works
First, it’s important to note that spatial audio capture and playback are not covered by the IVAS standard. That’s where Nokia Immersive Voice provides the necessary tools.
Capturing spatial audio requires access to a device’s integrated microphones. Listening to immersive audio is possible on any playback device—whether with headphones or multi-loudspeaker systems.
1
The spatial analysis algorithm of Nokia Immersive Voice reads microphone inputs, reduces unwanted noise, and processes the audio into MASA format.
2
MASA data is encoded with the IVAS codec and transmitted to the receiving side, where it’s rendered for playback with advanced Acoustic Echo Cancellation (AEC), designed specifically for spatial audio.
3
The listener hears audio that reflects the real-world spatial scene. Additional controls allow for ambience adjustment, orientation selection, and head-tracking—giving users flexibility in how they experience the conversation.
What you can experience
Nokia Immersive Voice is designed as a test and evaluation platform—helping industry players explore how IVAS can be applied in different scenarios.
One-to-one calls
Share the sounds of your environment and hear exactly what your counterpart is hearing. Or reduce background noise for clarity.
Potential use cases:
- Sharing the ambience of a nature hike or a city street
- Talking while streaming or watching content together
Multi-party calls
Hear group call participants from distinct directions and adjust the balance between voices and background ambience.
Potential use cases:
- Team meetings and project calls
- Live content streaming for remote audiences
- Customer support services
Audio and video conferencing
Voices come from consistent directions, improving clarity and reducing listening fatigue. Spatial mixing ensures everyone is heard clearly.
Potential use cases:
- Team and training sessions
- Watch parties and shared events
- One-way streaming or broadcasting
XR communication
Combines extended reality with spatial audio for location- and movement-aware communication.
Potential use cases:
- Co-designing and collaborative creation
- Remote guidance and troubleshooting
- Telepresence applications
- Mission-critical communication in industrial and public sector settings
Want to hear more?
Contact our experts to explore how Nokia Immersive Voice can help evaluate and shape the future of IVAS-powered communication.