Audio Front End Algorithm
Audio Front End Algorithm
Professional-Grade Speech Enhancement, Empowering Intelligent Voice Interactions
Solution Selection
Overview
AFE (Audio Front-End) is a professional-grade speech enhancement algorithm independently developed by Realtek. It employs multi-dimensional signal processing techniques to significantly improve speech clarity and signal-to-noise ratio (SNR), delivering robust audio solutions for speech recognition and real-time communication.
Professional Signal Processing
AEC (Acoustic Echo Cancellation)
Dual-stage linear cancellation + residual suppression for effective echo removal
BF (Beamforming)
Multi-microphone spatial filtering for targeted speech enhancement
NS (Noise Suppression)
Supports signal processing and neural network two modes for noise reduction
AGC (Automatic Gain Control)
Fixed + adaptive gain adjustment for stable output levels
SSL (Sound Source Localization)
360° directional tracking with microphone arrays
Advantages
Full-Scenario Coverage
- Supports speech recognition (AFE_FOR_ASR) and voice communication (AFE_FOR_COM) modes
- Compatible with near-field and far-field pickup
- Adaptable to 1mic/2mic/3mic microphone array configurations
Flexible Customization
- Standard array spacings (dual-mic: 30mm/50mm/70mm; three-mic: 50mm)
- Fully configurable algorithm parameters for scenario-specific optimization
- Custom algorithm development services
Typical Applications
Speech Recognition
- Applied in human-machine interaction systems, cowork with KWS/ASR modules.
- Smart speakers, voice controlled home devices, smart toys.
- Enhances speech recognition robustness and prioritizes minimal speech distortion.
Voice Communication
- Applied in real-time human communication systems.
- Conference systems, phone calls, doorbell intercom.
- Optimizes voice quality and emphasizes low latency and low echo/noise leakage.
Speech Recognition
- Applied in human-machine interaction systems, cowork with KWS/ASR modules.
- Smart speakers, voice controlled home devices, smart toys.
- Enhances speech recognition robustness and prioritizes minimal speech distortion.
Voice Communication
- Applied in real-time human communication systems.
- Conference systems, phone calls, doorbell intercom.
- Optimizes voice quality and emphasizes low latency and low echo/noise leakage.
Recommended ICs
| Features | Filter | RTL872xD | RTL8721Dx | RTL8721F | RTL8720E | RTL8710E | RTL8726E | RTL8713E | RTL8730E | RTL8735B |
|---|---|---|---|---|---|---|---|---|---|---|
| Application Processor |
Cortex-M | Cortex-M | Cortex-M | Cortex-M | Cortex-M | Cortex-M | Cortex-M | Cortex-A | Cortex-M | |
| DSP | ||||||||||
| ISP | ||||||||||
| TrustZone | ||||||||||
| Dual Band | ||||||||||
| Wi-Fi6 | ||||||||||
| R-MESH | ||||||||||
| Ultra-low power | ||||||||||
| Ethernet | ||||||||||
| BT Dual Mode | ||||||||||
| HMI | ||||||||||
| Audio ADC | ||||||||||
| Audio DAC | ||||||||||
| USB | ||||||||||
|
BT Dedicated Antenna |
| Feature | RTL8721Dx | RTL8726E | RTL8713E | RTL8730E |
|---|---|---|---|---|
| AFE Single MIC (Speech Recognition Mode) | ||||
| AFE Single MIC (Voice Communication Mode) | ||||
| AFE Dual MIC (Speech Recognition Mode) | ||||
| AFE Three MIC (Speech Recognition Mode) | ||||
| AEC (Speech Recognition Mode) | ||||
| AEC (Voice Communication Mode) | ||||
| BF (Speech Recognition Mode) | ||||
| BF (Voice Communication Mode) | ||||
| NS (Speech Recognition Mode) | ||||
| NS (Voice Communication Mode) | ||||
| AGC (Speech Recognition Mode) | ||||
| AGC (Voice Communication Mode) | ||||
| SSL (Speech Recognition Mode) | ||||
| SSL (Voice Communication Mode) | ||||
| KWS Fixed Keyword | ||||
| KWS User-defined Keyword | ||||
| VAD | ||||
| ASR |


