What is the difference between Pitch and Frequency?

Frequency is the objective, mathematical measurement of how fast a wave vibrates (e.g., 440 cycles per second). Pitch is the subjective, human *perception* of that frequency (e.g., hearing the note 'A').

Why do we use Decibels instead of just regular numbers for volume?

Because human hearing is logarithmic, not linear. A sound that has 10 times the acoustic power only sounds about 'twice' as loud to us. Decibels (dB) match our logarithmic perception of loudness much better than linear scales.

Can AI 'hear' ultrasound?

Yes. As long as the microphone can capture the high-frequency vibrations and the digital sample rate is high enough to record it, an AI model can process frequencies far above the 20,000 Hz human limit.

🚀 LEVEL UP TO SENIOR:Unlock 500+ Advanced Practical Challenges & Exercises.

🎓 COURSERA PARTNER:Earn professional Google, Meta, and IBM certificates to supercharge your resume.

Tutorials

HTML MASTER CLASS /// LEARN TAGS /// BUILD STRUCTURE /// SEMANTIC WEB /// HTML MASTER CLASS /// LEARN TAGS ///

⚡ Total XP: 0|💻 artificialintelligence XP: 0

Intro to Sound Waves

Master the fundamental properties of sound. Learn the relationship between frequency and pitch, amplitude and volume, and discover how continuous atmospheric pressure changes are visualized as time-domain waveforms.

LOADING ENGINE...

Skill Matrix

UNLOCK NODES BY LEARNING NEW TAGS.

Waves Hub

Sonic foundations.

Quick Quiz //

Which property determines if a sound is a low rumble or a high squeak?

Audio AI begins with the physics of air. Before we can train a model to recognize speech, we must understand how sound travels and is measured.

1Waves of Pressure

Sound is a mechanical wave that results from the back-and-forth vibration of particles in a medium. These vibrations create alternating periods of high pressure (Compressions) and low pressure (Rarefactions). When these pressure changes hit our eardrums, our brain interprets them as sound. In the digital world, we simplify this into a graph called a Waveform, where the X-axis is time and the Y-axis is the instantaneous amplitude of that pressure. Understanding this physical reality is the first step before we can start applying algorithms to it.

—

// Basic Waveform Representation Concept
const sampleRate = 44100; // Hz
const duration = 1.0; // seconds
const numSamples = sampleRate * duration;

// A raw pressure array representing the wave
let audioBuffer = new Float32Array(numSamples);

// We measure the displacement (pressure)
// at each distinct point in time.

localhost:3000

localhost:3000/wave-viewer

Wave Analyzer

Format: RAW PCM

Buffer Size: 44,100 samples

Status: Analyzing Pressure...

2The Dimensions of Audio

We define sound using two primary dimensions. Frequency is the speed of the vibration, measured in Hertz (Hz) (cycles per second). It determines the Pitch—high frequencies sound like whistles, while low frequencies sound like thunder. Amplitude is the 'strength' of the vibration, measured in Decibels (dB). It determines the Loudness. Understanding these two properties is critical for Digital Signal Processing (DSP), as they allow us to filter, amplify, and transform sound mathematically. For example, if you want to remove background AC noise, you use a filter targeting its specific frequency.

—

// Simple Sine Wave Generator
function generateSineWave(freqHz, duration, amplitude) {
  let buffer = [];
  for (let i = 0; i < duration * 44100; i++) {
    let t = i / 44100;
    // Math.sin(2 * PI * f * t)
    buffer.push(amplitude * Math.sin(2 * Math.PI * freqHz * t));
  }
  return buffer;
}

localhost:3000

localhost:3000/synth

Tone Generator

Frequency: 440 Hz (Note: A4)

Amplitude: 0.8 (Loud)

Output: Pure Sine Tone

3The Time Domain Interface

When we look at audio in its raw state, we are viewing it in the Time Domain. This is the classic wavy line you see in audio editors. While the time-domain view is perfect for seeing the rhythm, the silence gaps, and the volume envelopes of a signal, it's actually quite difficult for AI models to extract complex features like 'What vowel is being spoken?' or 'Is this a guitar?'. To solve that, we eventually convert this time-domain wave into the frequency domain using Fourier Transforms. But everything starts here, with the raw, temporal wave.

—

// Time Domain Analysis Concept
function calculateEnergy(audioBuffer) {
  let sum = 0;
  for (let sample of audioBuffer) {
    sum += sample * sample; // Square the amplitude
  }
  return sum / audioBuffer.length; // Mean Square Energy
}

localhost:3000

localhost:3000/energy-monitor

📈

Temporal Energy

RMS Level: -12 dBFS