Digital Audio Representation

Sound is a natural phenomenon that travels through the air as vibrations. Humans hear sound as continuous waves, meaning sound exists in an analog form in the real world. However, computers cannot directly understand analog signals. Instead, computers store, process and transmit sound in digital format. So how can we represent sound in a digital format? That's what we are going to talk about today.

What is Audio?

Audio refers to sound signals that can be heard by humans. These signals are created when objects vibrate, producing sound waves that travel through a medium such as air. These sound waves are continuous and vary in:

Amplitude : Determines how loud or soft the sound is
Frequency : Determines the pitch (how high or low the sound is)

Why do we need digital representation?

Unlike humans, computers and digital devices cannot process analog signals directly. Digital systems work with binary data (1s and 0s). Therefore, analog signals must be converted into digital signals so that computers can,

Store audio files
Edit and process audio
Transmit audio over the internet
Compress audio for efficient storage
Improve sound quality

How is audio converted into digital form?

We can convert analog sound into digital form through a process called Analog to digital conversion (ADC). This process involves three main steps.

1) Sampling

Sampling is the process of measuring the amplitude of a sound wave at regular time intervals.
The number of samples taken per second is called the sampling rate. It's measured in Hertz (Hz). A higher sampling rate means the digital audio is more accurate. A higher sampling rate captures more details but it also requires more storage space.

CD-quality audio uses a sampling rate of 44,100 samples per second (44.1 kHz)

2) Quantization

Quantization is the process of assigning numerical values to each sampled amplitude.

Each sampled value is rounded to the nearest value within a fixed range. And the number of possible values depends on the bit depth.

Bit depth refers to how many bits we use to represent each numerical value.

8-bit audio : 256 possible amplitude levels
16-bit audio : 65,536 possible amplitude levels (Used in CD)
24-bit audio : 16,777,216 possible amplitude levels (Used for professional recordings/mixing)

3) Encoding

Encoding is the process of converting those quantized values into binary numbers.

These binary values are then stored as digital audio files (WAV, MP3, AAC)

Key factors affecting digital audio quality

Sampling rate - A higher sampling rate captures more detail and improves audio accuracy
Bit depth - Higher bit depth improves sound clarity and reduces noise
Compression - Audio files are usually compressed to reduce file size

Lossy compression: Removes some audio data to reduce file size (MP3)
Lossless compression: Preserves all original data (FLAC)

Advantages of digital audio

Easy to store and duplicate
High sound quality
Noise reduction and editing capabilities
Efficient transmission over networks
Long term durability without degradation

Application of digital audio

Music streaming services
Video conferencing
Film and television production
Gaming
Telecommunications

TechTalks

Search This Blog