All Topics  
Psychoacoustics

 

   Email Print
   Bookmark   Link






 

Psychoacoustics



 
 
Psychoacoustics is the study of subjective human perception
Perception

In psychology and the cognitive sciences, perception is the process of attaining awareness or understanding of sense information. It is a task far more complex than was imagined in the 1950s and 1960s, when it was predicted that building perceiving machines would take about a decade, a goal which is still very far from fruition....
 of sound
Sound

Sound is vibration transmitted through a solid, liquid, or gas, composed of frequencies within the range of hearing and of a threshold of hearing to be heard, or the sensation stimulated in organs of hearing by such vibrations....
s. Alternatively it can be described as the study of the psychological correlates of the physical parameters of acoustics
Acoustics

Acoustics is the interdisciplinary science that deals with the study of sound, ultrasound and infrasound . A scientist who works in the field of acoustics is an acoustician....
.

Background
Hearing is not a purely mechanical phenomenon of wave propagation, but is also a sensory and perceptual event. When a person hears something, that something arrives at the ear
Ear

The ear is the sense organ that detects sounds. The vertebrate ear shows a common biology from fish to humans, with variations in structure according to order and species....
 as a mechanical sound wave traveling through the air, but within the ear it is transformed into neural action potentials.






Discussion
Ask a question about 'Psychoacoustics'
Start a new discussion about 'Psychoacoustics'
Answer questions from other users
Full Discussion Forum



Recent Posts









Encyclopedia


Psychoacoustics is the study of subjective human perception
Perception

In psychology and the cognitive sciences, perception is the process of attaining awareness or understanding of sense information. It is a task far more complex than was imagined in the 1950s and 1960s, when it was predicted that building perceiving machines would take about a decade, a goal which is still very far from fruition....
 of sound
Sound

Sound is vibration transmitted through a solid, liquid, or gas, composed of frequencies within the range of hearing and of a threshold of hearing to be heard, or the sensation stimulated in organs of hearing by such vibrations....
s. Alternatively it can be described as the study of the psychological correlates of the physical parameters of acoustics
Acoustics

Acoustics is the interdisciplinary science that deals with the study of sound, ultrasound and infrasound . A scientist who works in the field of acoustics is an acoustician....
.

Background


Hearing is not a purely mechanical phenomenon of wave propagation, but is also a sensory and perceptual event. When a person hears something, that something arrives at the ear
Ear

The ear is the sense organ that detects sounds. The vertebrate ear shows a common biology from fish to humans, with variations in structure according to order and species....
 as a mechanical sound wave traveling through the air, but within the ear it is transformed into neural action potentials. These nerve pulses then travel to the brain where they are perceived. Hence, in many problems in acoustics
Acoustics

Acoustics is the interdisciplinary science that deals with the study of sound, ultrasound and infrasound . A scientist who works in the field of acoustics is an acoustician....
, such as for audio processing, it is advantageous to take into account not just the mechanics of the environment, but also the fact that both the ear and the brain are involved in a person’s listening experience.

The inner ear
Inner ear

The inner ear is the labyrinth , a system of passages comprising two main functional parts:* the organ of hearing, or cochlea* and the vestibular apparatus, the organ of balance that consists of three semicircular canals and the Vestibule of the ear....
, for example, does significant signal processing
Signal processing

Signal processing is the analysis, interpretation, and manipulation of signal . Signals of interest include: audio signal processing, , time-varying measurement values and sensor data, for example biological data such as electrocardiograms, control system signals, telecommunication transmission signals such as radio signals, and many others....
 in converting sound waveform
Waveform

Waveform means the shape and form of a signal such as a wave moving in a solid, liquid or gaseous medium.In many cases the medium in which the wave is being propagated does not permit a direct visual image of the form....
s into neural stimulus, so certain differences between waveforms may be imperceptible. MP3
MP3

MPEG-1 Audio Layer 3, more commonly referred to as MP3, is a digital audio Encoder format using a form of lossy data compression. It is a common audio format for consumer audio storage, as well as a de facto standard encoding for the transfer and playback of music on digital audio players....
 and other audio compression
Audio compression

Audio compression can mean two things:* Audio data compression - in which the amount of data in a recorded waveform is reduced for transmission....
 techniques make use of this fact. In addition, the ear has a nonlinear response to sounds of different loudness
Loudness

Loudness is the quality of a sound that is the primary psychological correlate of physical strength .Loudness, a subjective measure, is often confused with objective measures of sound pressure such as decibels or sound intensity....
 levels. Telephone networks and audio noise reduction
Noise reduction

Noise reduction is the process of removing noise from a signal . Noise reduction techniques are conceptually very similar regardless of the signal being processed, however A priori and a posteriori knowledge of the characteristics of an expected signal can mean the implementations of these techniques vary greatly depending on the type of si...
 systems make use of this fact by nonlinearly compressing data samples before transmission, and then expanding them for playback. Another effect of the ear's nonlinear response is that sounds that are close in frequency produce phantom beat notes, or intermodulation
Intermodulation

Intermodulation or intermodulation distortion , or intermod for short, is the result of two or more Signal of different frequencies being Frequency mixer together, forming additional signals at frequencies that are not, in general, at harmonic frequencies of either....
 distortion products.

Limits of perception


The human ear can nominally hear sounds in the range 20 Hz
Hertz

The hertz is a measure of frequency per unit of time, or the number of list of cycles per second. It is the SI base unit of frequency in the International System of Units , and is used worldwide in both general-purpose and scientific contexts....
 to 20,000 Hz (20 kHz). This upper limit tends to decrease with age, most adults being unable to hear above 16 kHz. The ear itself does not respond to frequencies below 20 Hz, but these can be perceived via the body's sense of touch. Some recent research has also demonstrated a hypersonic effect
Hypersonic effect

The hypersonic effect is a term coined to describe the phenomenon reported in some scientific studies, which demonstrate that although humans cannot consciously hear sounds at frequencies above [approximately] 20 Hertz, the presence or absence of those frequencies has a measurable effect on their psychological reaction....
 which is that although sounds above 20 kHz cannot consciously be heard, they can have an effect on the listener.

Frequency resolution of the ear is 0.36 Hz within the octave of 1,000–2,000 Hz. That is, changes in pitch larger than 0.36 Hz can be perceived in a clinical setting. However, even smaller pitch differences can be perceived through other means. For example, the interference of two pitches can often be heard as a (low-)frequency difference pitch. This effect of phase
Phase (waves)

The phase of an oscillation or wave is the fraction of a complete cycle corresponding to an offset in the displacement from a specified reference point at time t = 0....
 variance upon the resultant sound is known as 'beating
Beat (acoustics)

In acoustics, a beat is an interference between two sounds of slightly different frequency, perceived as periodic variations in volume whose rate is the difference between the two frequencies....
'.

The semitone
Semitone

A semitone, also called a half step or a half tone,Aaron Copland, Leonard Bernstein, and others use "half tone".One source says that step is "chiefly US", and that half-tone is "chiefly N....
 scale used in Western musical notation is not a linear frequency scale but logarithmic. Other scales have been derived directly from experiments on human hearing perception, such as the mel scale
Mel scale

The mel scale, proposed by Stanley Smith Stevens, John Volkman and Edwin Newman in 1937 is a perceptual Scale of pitch es judged by listeners to be equal in distance from one another....
 and Bark scale
Bark scale

The Bark scale is a psychoacoustics scale proposed by Eberhard Zwicker in 1961. It is named after Heinrich Barkhausen who proposed the first subjective measurements of loudness....
 (these are used in studying perception, but not usually in musical composition), and these are approximately logarithmic in frequency at the high-frequency end, but nearly linear at the low-frequency end.

The "intensity" range of audible sounds is enormous. Our ear drums are sensitive only to variations in the sound pressure, but can detect pressure changes as small as 2×10–10 atm
Atmosphere (unit)

The standard atmosphere is an international reference pressure defined as 101,325 Pascal and formerly used as unit of pressure . For practical purposes it has been replaced by the Bar which is 100,000 Pa....
 and as great or greater than 1 atm. For this reason, Sound Pressure Level
Sound pressure

Sound pressure is the local pressure deviation from the ambient pressure caused by a sound wave. Sound pressure can be measured using a microphone in air and a hydrophone in water....
 is also measured logarithm
Logarithm

In mathematics, the logarithm of a number to a given base is the Power or exponent to which the base must be raised in order to produce the number....
ically, with all pressures referenced to 1.97385×10–10 atm
Atmosphere (unit)

The standard atmosphere is an international reference pressure defined as 101,325 Pascal and formerly used as unit of pressure . For practical purposes it has been replaced by the Bar which is 100,000 Pa....
. The lower limit of audibility is therefore defined as 0 dB
Decibel

The decibel is a logarithmic units of measurement that expresses the magnitude of a physical quantity relative to a specified or implied reference level....
, but the upper limit is not as clearly defined. While 1 atm
Atmosphere (unit)

The standard atmosphere is an international reference pressure defined as 101,325 Pascal and formerly used as unit of pressure . For practical purposes it has been replaced by the Bar which is 100,000 Pa....
 (191 dB) is the largest pressure variation an undistorted sound wave can have in Earth's atmosphere
Earth's atmosphere

The Earth's atmosphere is a layer of gases surrounding the planet Earth that is retained by the Earth's gravity. Dry air contains roughly 78.08% nitrogen, 20.95% oxygen, 0.93% argon, 0.038% Carbon dioxide in the Earth's atmosphere, and trace amounts of other gases....
, larger sound waves can be present in other atmosphere
Atmosphere

An atmosphere is a layer of gases that may surround a material body of sufficient mass, by the gravity of the body, and are retained for a longer duration if gravity is high and the atmosphere's temperature is low....
s, or on Earth in the form of shock waves
Shock Waves

Shock Waves, also known as Almost Human and Death Corps, is a horror movie from 1977 directed by Ken Wiederhorn. It features Peter Cushing as a Nazi commander who commanded a division of SS soldiers - 'Der Toten Korps', who can 'live' underwater - intended to man U-boats which need no oxygen and never need to surface....
. The upper limit is more a question of the limit where the ear will be physically harmed or with the potential to cause a hearing disability
Hearing impairment

A hearing impairment is a full or partial decrease in the ability to detect or understand sounds.Caused by a wide range of biological and environmental factors, loss of hearing can happen to any organism that perceives sound....
. This limit also depends on the time exposed to the sound. The ear can be exposed to short periods in excess of 120 dB without permanent harm — albeit with discomfort and possibly pain; but long term exposure to sound levels over 80 dB can cause permanent hearing loss.

A more rigorous exploration of the lower limits of audibility determines that the minimum threshold at which a sound can be heard is frequency dependent. By measuring this minimum intensity for testing tones of various frequencies, a frequency dependent Absolute Threshold of Hearing
Absolute threshold of hearing

The absolute threshold of hearing is the minimum sound level of a pure tone that an average ear with normal Hearing can hear in a noiseless environment....
 (ATH) curve may be derived. Typically, the ear shows a peak of sensitivity (i.e., its lowest ATH) between 1 kHz and 5 kHz, though the threshold changes with age, with older ears showing decreased sensitivity above 2 kHz.

The ATH is the lowest of the equal-loudness contour
Equal-loudness contour

An equal-loudness contour is a measure of sound pressure , over the frequency spectrum, for which a listener perceives a constant loudness when presented with pure steady tones....
s. Equal-loudness contours indicate the sound pressure level (dB), over the range of audible frequencies, which are perceived as being of equal loudness. Equal-loudness contours were first measured by Fletcher and Munson at Bell Labs
Bell Labs

Bell Laboratories is the research organization of Alcatel-Lucent and previously of the American Telephone & Telegraph Company .Bell Laboratories has had its headquarters at Berkeley Heights, New Jersey, and it has research and development facilities throughout the world....
 in 1933 using pure tones reproduced via headphones, and the data they collected are called Fletcher-Munson curves. Because subjective loudness was difficult to measure, the Fletcher-Munson curves were averaged over many subjects.

Robinson and Dadson refined the process in 1956 to obtain a new set of equal-loudness curves for a frontal sound source measured in an anechoic chamber
Anechoic chamber

An anechoic chamber is a shielded room designed to attenuate sound or electromagnetic energy. Anechoic chambers were originally used in the context of absorbing Acoustics echoes caused by internal reflections of a room, but more recently anechoic chambers have also been used to provide a shielded environment for radio frequency and microw...
. The Robinson-Dadson curves were standardized as ISO
International Organization for Standardization

The International Organization for Standardization , widely known as ISO , is an international standard-setting body composed of representatives from various national standards organizations....
 226 in 1986. In 2003, ISO 226 was revised as equal-loudness contour
Equal-loudness contour

An equal-loudness contour is a measure of sound pressure , over the frequency spectrum, for which a listener perceives a constant loudness when presented with pure steady tones....
 using data collected from 12 international studies.

Masking effects

In some situations an otherwise clearly audible sound can be masked by another sound. For example, conversation at a bus stop can be completely impossible if a loud bus is driving past. This phenomenon is called masking. A weaker sound is masked if it is made inaudible in the presence of a louder sound. The masking phenomenon occurs because any loud sound will distort the Absolute Threshold of Hearing, making quieter, otherwise perceptible sounds inaudible.

If two sounds occur simultaneously and one is masked by the other, this is referred to as simultaneous masking
Simultaneous masking

In acoustics, simultaneous masking is masking between two concurrent sounds. Sometimes called frequency masking since it is often observed when the sounds share a frequency band e.g....
. Simultaneous masking is also sometimes called frequency masking. The tonality of a sound partially determines its ability to mask other sounds. A sinusoidal
Siné

Maurice Sinet, known as Sin? is a France cartoonist.As a young man he studied drawing and graphic arts, earning his life as a cabaret singer....
 masker, for example, requires a higher intensity to mask a noise-like maskee than a loud noise-like
Noise

In common use, the word noise means unwanted sound or noise pollution. In electronics noise can refer to the electronic signal corresponding to acoustic noise or the electronic signal corresponding to the noise commonly seen as 'Noise ' on a degraded television or video image....
 masker does to mask a sinusoid. Computer models which calculate the masking caused by sounds must therefore classify their individual spectral peaks according to their tonality.

Similarly, a weak sound emitted soon after the end of a louder sound is masked by the louder sound. Even a weak sound just before a louder sound can be masked by the louder sound. These two effects are called forward and backward temporal masking
Temporal masking

Temporal masking or "non-simultaneous masking" occurs when a sudden stimulus sound makes loudness other sounds which are present immediately preceding or following the stimulus....
, respectively.

'Phantom' fundamentals


Low pitches
Pitch (psychophysics)

Pitch is the property of a sound that allows the construction of melodies; pitches are compared as "higher" and "lower", and are quantified as frequency , corresponding very nearly to the repetition rate of sound waves....
 can sometimes be heard when there is no apparent source or component of that frequency. This perception is due to the brain interpreting repetition patterns determined by the differences of audible harmonics that are present. A harmonic series
Harmonic series (music)

Definite pitch musical instruments are often based on an approximate harmonic oscillator such as a string or a column of air, which oscillates at numerous frequencies simultaneously....
 of pitches that are related 2×f, 3×f, 4×f, 5×f, etc, give human hearing the psychoacoustic impression that the pitch 1×f is present. This phenomenon is used by some pro audio manufacturers to allow sound systems to seem to produce notes that are lower in pitch than they are capable of reproducing.

Software


The psychoacoustic model provides for high quality lossy signal compression
Lossy data compression

A lossy compression method is one where data compression and then decompressing it retrieves data that may well be different from the original, but is close enough to be useful in some way....
 by describing which parts of a given digital audio signal can be removed (or aggressively compressed) safely - that is, without significant losses in the (consciously) perceived quality of the sound.

It can explain how a sharp clap of the hands might seem painfully loud in a quiet library, but is hardly noticeable after a car backfires on a busy, urban street. This provides great benefit to the overall compression ratio, and psychoacoustic analysis routinely leads to compressed music files that are 1/10 to 1/12 the size of high quality original masters with very little discernible loss in quality. Such compression is a feature of nearly all modern audio compression formats. Some of these formats include MP3
MP3

MPEG-1 Audio Layer 3, more commonly referred to as MP3, is a digital audio Encoder format using a form of lossy data compression. It is a common audio format for consumer audio storage, as well as a de facto standard encoding for the transfer and playback of music on digital audio players....
, Ogg Vorbis, AAC
Advanced Audio Coding

Advanced Audio Coding is a standardized, lossy data compression Audio data compression and encoder scheme for digital audio. Designed to be the successor of the MP3 format, AAC generally achieves better sound quality than MP3 at many bit rates....
, WMA
Windows Media Audio

Windows Media Audio is an audio data compression technology developed by Microsoft. The name can be used to refer to its audio file format or its audio codecs....
, MPEG-1 Layer II (used for digital audio broadcasting
Digital audio broadcasting

Digital Audio Broadcasting , also known as EUREKA, is a digital radio technology for broadcasting radio stations, used in several countries, particularly in the UK and Europe....
 in several countries) and ATRAC
ATRAC

Adaptive Transform Acoustic Coding is a family of proprietary audio codec developed by Sony. MiniDisc was the first commercial product to incorporate ATRAC in 1992....
, the compression used in MiniDisc
MiniDisc

A MiniDisc is a magneto-optical disc-based data storage device initially intended for storage of up to 80 minutes of digitized sound. Today, in the form of Hi-MD, it has developed into a general-purpose storage medium in addition to greatly expanding its audio roots....
 and Walkman
Walkman

Walkman is an audio cassette player used to market its portable Audio frequency and video players. The original Walkman introduced a change in music listening habits, allowing people to carry their own choice of music with them....
.

Psychoacoustics is based heavily on human anatomy
Human anatomy

Human anatomy, which, with physiology and biochemistry, is a complementary basic medical science is primarily the scientific study of the morphology of the adult human body....
, especially the ear's limitations in perceiving sound as outlined previously. To summarize, these limitations are:

  • High frequency limit
    High frequency limit

    The high frequency limit of hearing is the upper extent to which a particular animal can perceive sound.Perhaps the most commonly known aspect of the psychoacoustic model is that humans cannot hear frequency above and below certain thresholds; in fact, most humans can only hear frequencies between 20 Hz and 20,000 Hz ....
  • Absolute threshold of hearing
    Absolute threshold of hearing

    The absolute threshold of hearing is the minimum sound level of a pure tone that an average ear with normal Hearing can hear in a noiseless environment....
  • Temporal masking
    Temporal masking

    Temporal masking or "non-simultaneous masking" occurs when a sudden stimulus sound makes loudness other sounds which are present immediately preceding or following the stimulus....
  • Simultaneous masking
    Simultaneous masking

    In acoustics, simultaneous masking is masking between two concurrent sounds. Sometimes called frequency masking since it is often observed when the sounds share a frequency band e.g....


Given that the ear will not be at peak perceptive capacity when dealing with these limitations, a compression algorithm can assign a lower priority to sounds outside the range of human hearing. By carefully shifting bits away from the unimportant components and toward the important ones, the algorithm ensures that the sounds a listener is most likely to perceive are of the highest quality.

Music

Psychoacoustics include topics and studies which are relevant to music psychology
Music psychology

Music psychology, or the psychology of music, may be regarded either as a branch of psychology or as a branch of musicology. It aims to explain and understand musical behavior and musical experience....
. Theorists such as Benjamin Boretz
Benjamin Boretz

Benjamin Boretz is a twentieth- and twenty-first-century United States composer and Music theory.He was born in Brooklyn, New York and studied composition at Brandeis University with Arthur Berger, at the Aspen Music School with Darius Milhaud, at UCLA with Lukas Foss, and at Princeton with Milton Babbitt and Roger Sessions....
 consider some of the results of psychoacoustics to be meaningful only in a musical context.

Applied psychoacoustics

Psychoacoustics is presently applied within many fields from software development, where developers map proven and experimental mathematical patterns; in digital signal processing, where many audio compression codecs such as MP3
MP3

MPEG-1 Audio Layer 3, more commonly referred to as MP3, is a digital audio Encoder format using a form of lossy data compression. It is a common audio format for consumer audio storage, as well as a de facto standard encoding for the transfer and playback of music on digital audio players....
 use a psychoacoustic model to increase compression ratios; in the design of (high end) audio systems for accurate reproduction of music in theatres and homes; as well as defense systems where scientists have experimented with limited success in creating new acoustic weapons, which emit frequencies that may impair, harm, or kill (see ). It is also applied today within music, where musicians and artists continue to create new auditory experiences by masking unwanted frequencies of instruments, causing other frequencies to be enhanced. Yet another application is in design of small or lower-quality loudspeakers, which use the phenomenon of missing fundamental
Missing fundamental

A sound is said to have a missing fundamental, suppressed fundamental, or phantom fundamental when its overtones suggest a fundamental frequency but the sound lacks a component at the fundamental frequency itself....
s to give the effect of low frequency bass notes that the system, due to frequency limitations, cannot actually reproduce (see references).

See also


Footnotes


Notations

  • E. Larsen and R.M. Aarts (2004), , J. Wiley.
  • E. Larsen and R.M. Aarts (2002), , J. Audio Eng. Soc., March, 50 (3), pp. 147-164.
  • T. Oohashi, N. Kawai, E. Nishina, M. Honda, R. Yagi, S. Nakamura, M. Morimoto, T. Maekawa, Y. Yonekura, and H. Shibasaki. The role of biological system other than auditory air-conduction in the emergence of the hypersonic effect http://dx.doi.org/10.1016/j.brainres.2005.12.096. Brain Research, 1073:339–347, February 2006.


External links