Frequently Asked Questions
How do voice assistants handle background noise?
Voice assistants use a combination of hardware (directional microphone arrays) and software. In software, Voice Activity Detection (VAD) is used to filter out non-speech audio, and ambient noise calibration (like in PyAudio/SpeechRecognition) establishes a baseline volume threshold to ignore static hums.
What is the difference between ASR and NLP in a voice pipeline?
ASR (Automatic Speech Recognition) simply converts the acoustic soundwaves into raw text string (e.g., hearing the sounds and writing "turn on the lights"). NLP (Natural Language Processing) takes that raw text string and extracts the meaning or intent from it (e.g., identifying the command is `intent_lights_on`).