Features

Natural listening experience

Support for high-quality audio range, audio playback, background blur, air attenuation and more, perfectly simulating a natural listening experience.

Highest fidelity 3D audio

Supports 48kHz full-band sampling and allows listeners to pinpoint both the direction and distance of a voice coming from the speaker.

Low latency

Low latency, low power consumption, and efficient processing modes preserve the real-time experience.

Cross-platform support

Agora’s streaming 3D audio API has support for Web, iOS, Android, Mac, Windows, Unity, React Native, and Electron.

Global scalability

Scale from 1:1 to millions of users on the network that annually powers hundreds of billions of minutes of real-time video to users in over 200 countries and regions.

Your vision, unrestricted.

With Interactive Whiteboard, you can build a collaborative app fast—with custom branding and full of features. Our platform makes it easy to create a customized and engaging learning environment.

Flexible APIs support custom branding and extensive digital whiteboard features.
Easily integrate real-time voice and video calling, interactive streaming and signaling.
Save users’ bandwidth by preloading, sharing, and annotating files, and retain all the dynamic content.

And have peace of mind with HIPAA, GDPR, and CCPA compliance.

Deliver a more natural audio experience

Make your product stand out with Agora’s 3D Spatial Audio API that boosts user engagement.

Deliver a more realistic audio experience

Replicate how we hear sound in the real world for a more natural experience that makes users feel like they are in the same room.

Deliver a more realistic audio experience

Integrate quickly and easily

Quickly make your user experience more immersive by activating Agora’s 3D Spatial Audio extension that works seamlessly with our video, voice, and streaming products.

Integrate quickly and easily

Give users the best audio quality

Allow your audience to hear deeper nuances of music and spoken word with superior audio that elevates the quality of the user’s entire experience.

Give users the best audio quality

Agora Media Services

Recording

Record audio streams, video streams and web pages for archive, review, or distribution.

Cloud Recording Documentation

On-Premise Recording Documentation

Webpage Recording Documentation

Media Gateway

Directly push media streams into Agora voice and video channels using the RTMP/SRT protocol and enable advanced transcoding processing on media streams to facilitate distribution.

Go to Documentation

Cloud Transcoding

Beta

Obtain audio and video source streams from hosts in RTC channels and perform transcoding, audio mixing, and video compositing.

Go to Documentation

Media Pull

Add additional engagement to your Agora sessions by pulling live or recorded video and audio content and ingesting directly into your Agora channel.

Go to Documentation

Media Push

Expand your audience with hybrid engagement experiences by pushing audio and video streams from Agora channels to Content Delivery Networks (CDN).

Go to Documentation

Made for developers

Your Code

Agora SDK

Customize your experience from the start with our flexible SDK.

Go to Docs

Your Code

Agora SDK

Build and integrate real-time video into your app with the most flexibility and customization using Agora's Video SDK.

Go to Docs

NO CODE

App Builder

Agora’s App Builder is the fastest and easiest way to real-time video into your product using our no-code visual designer.

Go to Docs

low code

Agora UI Kit

Add real-time video to your app with only a few lines of code using low-code UI Kit libraries.

Go to Docs

your code

Agora SDK

Customize your experience from the start with our flexible SDK.

Go to Docs

low code

Agora UI Kit

Integrate real-time communication and streaming using only a few lines of code with low-code UIKit libraries.

Go to Docs

Documentation

This project presents you a set of API examples to help you understand how to use Agora APIs.

View documentation on how to set up 3D Spatial Audio.

Go to Docs

Activate Extension

Activate the AI Noise Suppression extension on the Agora Console.

Activate the 3D Spatial Audio extension in the Agora Console.

Go to Console

your code

Agora SDK

Build and integrate Live Streaming with the most flexibility and full customization using Agora's Video SDK.

Go to Docs

NO code

App Builder

Agora’s App Builder is the fastest and easiest way to add real-time voice chat, video chat, and live streaming into your product.

Go to Docs

your code

Agora SDK

Build and integrate real-time visual collaboration features into your application with the most flexibility and full customization using Agora's Interactive Whiteboard SDK.

Go to Docs

LOW code

Fastboard

Build real-time visual collaboration faster with a pre-built UI and the ability to include custom plug ins.

Try it Now

Security, privacy and compliance

Agora is certified to the ISO/IEC 27001, 27017, 27018, 27701 and SOC 2 security standards and meets privacy regulations like GDPR, CCAP, COPPA, and HIPAA. Agora doesn’t collect or store any end-user data aside from Internet Protocol (IP) addresses and operational information necessary for providing our services.

ISO 27001:2022

ISO 27017:2015

ISO 27018:2019

ISO 27701:2019

HIPAA

GDPR

SOC2 Type1&2

CCPA

COPPA

Integrated chipset and module

By building our Conversational AI technology into RiseLink's high-performance IoT chip modules, the turnkey solution makes it easy to integrate voice AI into any connected toy.

“With Agora’s conversational AI technology and our optimized AI hardware, we’re enabling the next generation of toys to think, respond, and interact naturally. We are excited to usher in the future of robotics and toys, ones that can react to the environment around them and interact fluently with users.”

Pengfei Zhang

CEO, Riselink

Use cases

Provide an exceptional immersive sound experience

View All

A livecast of a gaming session with three players.

Livecasting

Create a more personal environment, as if friends are sharing same physical space.

A man is a on conference call next to several others, powered by 3D spatial audio which allows participants to hear him clearly.

Meetings / Conference calls

Make meetings more productive by allowing participants to focus on the main speaker—not background noises

A young child on a live video call on a laptop with his teacher and immersed in the lesson.

Education

Enrich the learning experience by making it more personal and memorable—as if the teacher is sitting next to the student.

A video of a live musical concert, providing an immersive experience and allowing listeners to enjoy the nuances in every note.

Music Streaming

Provide a fully immersive experience allowing listeners to enjoy the nuances in every note.

Robopoet's Fuzzoo, an AI companion robot, leverages Agora's ConvoAI Device Kit to deliver real-time emotional support and personalized interaction.

"Agora’s AI technology enables toys and robots to interact in a way that feels natural and engaging. With real-time voice processing, emotional AI, and advanced speech capabilities, Agora makes seamless human-machine interaction possible and ensures exceptional performance and reliability."

Yuna Pan

Co-Founder and CTO

Frequently asked questions

How does Agora improve the experience in comparison with other solutions for voice interaction with AI?

Agora enables more natural voice conversations with AI, thanks to low-latency responses and real-time interruption handling. Agora’s built-in background noise suppression, echo cancelation, and selective attention locking allow AI to hear the user clearly in any environment. Agora’s global real-time network ensures connectivity and performance in any location.

What LLMs can be connected to Agora’s conversational AI platform?

Agora's Conversational AI Engine offers support for a wide range of large language models (LLMs), including:

OpenAI
OpenAI Realtime API
Azure OpenAI
Google Gemini
Google Vertex AI
Anthropic Claude
Dify
Custom LLM

Review our documentation on connecting LLMs here: https://docs.agora.io/en/conversational-ai/models/llm/overview

What automatic-speech-recognition (ASR) / speech-to text (STT) models are supported?

Agora’s Conversational AI Engine currently supports the following ASR providers:

ARES (default)
Microsoft Azure
Deepgram

Review our documentation on connecting ASR models here: https://docs.agora.io/en/conversational-ai/models/asr/overview

What text-to-speech (TTS) models are supported?

Agora’s Conversational AI Engine currently supports the following TTS providers:

Microsoft Azure
ElevenLabs
Cartesia (Beta)
OpenAI (Beta)
Hume AI (Beta)

Review our documentation on connecting TTS models here: https://docs.agora.io/en/conversational-ai/models/tts/overview

What avatar providers are supported?

Agora’s Conversational AI Engine currently supports the following AI avatar providers:

Akool (Beta)
HeyGen (Alpha)

Review our documentation on connecting avatar providers here: https://docs.agora.io/en/conversational-ai/models/avatar/overview

What additional technology is required to implement a voice AI agent?

To implement a voice AI agent, you need to connect an LLM and a text-to-speech service to Agora’s Conversational AI Engine. This enables full customization of the experience, with the LLM and voice of your choice.

What is a “chained” or “cascade” model” in relation to conversational voice AI?

The chained or cascade model refers to the processing flow of the user’s voice being processed by automatic speech recognition (ASR) technology that converts speech to text, then that text being processed by the LLM, then the LLM’s response being processed by text-to-speech technology and ultimately outputting the AI agent’s voice response.

Does Agora’s Conversational AI Engine enable the creation of an AI model or LLM?

No, Agora’s Conversational AI Engine requires an existing AI model or LLM. The Engine enables customized voice interaction with the LLM but is not capable of creating or training an LLM.

TEN

App Builder

Flexible Classroom

Download SDKs

Support Plans and Pricing

3D Spatial Audio

3D Spatial Audio

Features

Natural listening experience

Highest fidelity 3D audio

Low latency

Cross-platform support

Global scalability

Talk to a voice agent powered by the Conversational AI Engine

Your vision, unrestricted.

See OpenAI's Realtime API in action

Deliver a more natural audio experience

Deliver a more realistic audio experience

Deliver a more realistic audio experience

Integrate quickly and easily

Integrate quickly and easily

Give users the best audio quality

Give users the best audio quality

Recording options for:

Agora Media Services

Made for developers

Quickstart guide

How the Conversational AI Engine works

Made for developers

Agora SDK

Agora SDK

App Builder

Agora UI Kit

Agora SDK

Agora UI Kit

Documentation

Documentation

Activate Extension

Agora SDK

App Builder

Agora SDK

Fastboard

Integrated chipset and module

Provide an exceptional immersive sound experience

Livecasting

Meetings / Conference calls

Education

Music Streaming

Fastboard

Frequently asked questions

FAQs

FAQs

FAQs

FAQs

FAQs

Get started with 10,000 free minutes

Talk to Us

Developer Resources