(may need to enter ctrl(command)+shift+R to refresh the page)
RTC front-end processing (3A): (general) speech enhancement, echo cancallation
audio compression (codec): audio coding (including speech, music, and noise), BWE, and PLC
generative AI for speech: text-to-speech, voice conversion, and LLMs Applied to TTS
microphone array processing: beamforming, Neural beamformer, and multi-speaker DOA estimation