|
Sensible Stereo Recording-- Auditory Scene creation -- Recording what we hear -- Experimental results -- Theory -- SRA --
Under construction: Creating the Gestalt of the Auditory ScenePage 1 - Page 2 - Page 3 - Page 4
1.3.1 The sound of a single loudspeaker in a roomLet's look at the simple case of a single loudspeaker in a reverberant room. Wall reflections, FIG. 16, can be modeled by image sources, which replace the walls. FIG 17 shows the intersection of three orthogonal mirror surfaces as in a room corner. It can be seen that there are seven images of the single loudspeaker, all radiating sound but to varying degree. If the ceiling were added to the three walls, then there would be images added in the ceiling corner, and their images in the floor corner, and so on. If a second side wall and a front wall are added so that the room is closed and has eight corners, then the total number of image sources becomes very large. But as the image distances from the corner increases from 1st order, to 2nd order, to 3rd order reflections and so on, their strength also decreases due absorption and diffusion by the wall surface properties. Still, as long as there are reflections there is never just a single loudspeaker in the room, but many copies and seen in different orientations. The situation is not unlike having another person in the room who is talking to you. She too has many acoustic images and if the room is very reverberant she may be hard to understand. In an anechoic room there is no issue with intelligibility, but we feel uneasy. Between those two extremes of reverberation lies a comfort zone for normal living spaces. Cathedrals, concert halls, auditoria or office spaces have their unique and different requirements for sound reverberation. What applies to acoustically large spaces does not necessarily translate to living rooms, which are considered as acoustically small below their Schroeder frequency, i.e. typically below 200 Hz. What constitutes an acoustically comfortable living room? To me it is a room with RT60 around 500ms, which is much more live than a German recording studio built to the RT60 standard of 250ms. Even Tonmeisters agree that this is not an environment to listen for pleasure, but a working space to analyze the recording. They speculated that the PLUTO loudspeaker would not be suitable in the studio, but they would like it in a living room for enjoyment and checking how their recording translates.
As an experiment to show the interaction of a single loudspeaker with a room I will use a 1 kHz toneburst, FIG. 18, applied to the left ORION, FIG. 19, and then applied to the left PLUTO after it has been moved into the previous location of the ORION. The loudspeaker outputs are recorded with a stereo microphone from the listening position A in FIG. 4.
FIG. 20 Direct and reflected signal response during the first 200ms as recorded from listening position A for ORION and PLUTO. Though ORION and PLUTO reproduce the same 1 kHz burst the response of the room is stronger and different for PLUTO as it is for ORION. PLUTO radiates omni-directional while ORION radiates bi-directional in dipole fashion. Reflected signals that arrive within 10 ms of the direct sound from nearby surfaces or objects add to the burst received by the microphone and cannot be resolved visually. To do so would require to increase the burst frequency, e.g. to 3 kHz for a 3.3 ms duration test signal. The room response at the higher burst frequency would be indicative of the radiation pattern of the loudspeakers at the higher frequencies and the reflectivity of the room around 3 kHz. In general it should be noted that the reflection pattern and the reverberant field at the listening position is a direct function of the 3-D polar response of the loudspeaker and its variation with frequency. A corresponding situation occurs in the concert hall, where any musical instrument that is being played determines the gestalt of its direct and reverberated sound at the recording microphone location due to its directivity. Thus, when a musician is recorded in an isolation booth and reverberation added later, it is highly unlikely that the reverberation will have the natural characteristics of the same musical instrument in a particular venue. The reflections in FIG. 20 are above the threshold of detection in FIG. 9. They are below the drift threshold in FIG. 10 for the first 50 ms or so but not for greater delays. Since we are dealing with a large number of reflections and a situation that is quite different from the cases that were investigated in those two figures above, one must be careful when drawing conclusions about how we are likely to perceive the acoustic event. You can apply the 1 kHz burst test signal to your own loudspeaker and hear for yourself. I generated the burst using the f(x) Expression Evaluator in Goldwave under Tools. The expression is: (0.5-0.5*cos(2*pi*t*f/x))*sin(2*pi*t*f) with x=10 and f=1000. It creates a 1000 Hz sinewave, which is 100% amplitude modulated by a 100 Hz raised-cosine wave. I copied one modulation cycle and pasted it three times at 500 ms intervals into a New wave of 2 s duration to generate Track 05-LL-1kHz_bursts_500ms.wav. You can use this technique to generate tonebursts of different frequency, number of cycles or repetition rate for testing purposes, particularly for high level stress testing of devices that might otherwise be damaged by overheating from continuous signals. a - Track 05-LL-1kHz_bursts_500ms.wav b - Track 06-LL-1kburst-orion-pluto.mp3 c - Track 07-LL-anechoic-female-male-music.wav d - Track 08-LL-left_orion.mp3 e - Track 09-LL-left_pluto.mp3 f - Track 10-LL-left_SL.mp3 g - Track 11-LL-left_orion-SL_replay.mp3 h - Track 12-LL-left_SL_EL.mp3 Note that my voice appears at the loudspeaker, not 30 cm in front of you, as was the distance from my mouth to the microphone. If this were a true 3-D reproduction, then my voice would have been this close to you. Instead it is located at the loudspeaker. We process directional and distance cues in the direct and reflected sounds of any source of sound. In this case we localize the voice at the loudspeaker. My voice has little recorded reverberation, but the voice of my wife does and is at lower volume. In the AS she is placed at some distance from me. The real distance to the loudspeaker from where ever I am in the room is also the minimum distance between me and the AS when I close my eyes. Last night I attended a concert, FIG.1, and made some
related observations about single loudspeaker sound reproduction. The lecturer
of the pre-concert music talk had a microphone. A loudspeaker high above the
stage amplified his voice but the volume was kept low and the precedence effect
[2] prevented me from localizing the loudspeaker as a source separate from the
speaking person. Nevertheless it was obvious to me that a loudspeaker was
involved. His voice had an unnatural overlay due to the strong reverberation of
the loudspeaker's output by the hall. Furthermore the reverberation was colored
by the polar response of the loudspeaker. I have often wondered how we recognize whether a sound was made by a real instrument or by a loudspeaker, especially when the sound comes from some distance, like through the hallway in a building or from an open window. It must be the reverberant sound, which is characteristic for every instrument or source. The physical construction, size and geometry of a source determines its radiation pattern. Since we live in reflective environments each source of sounds assumes a gestalt. We recognize the source by its gestalt even when the environment changes. We can differentiate a piano from the loudspeaker reproduction of a piano even when the sound that we hear has lost high or low frequencies or envelope distortion in its acoustic transmission from a distant place. I draw several conclusions from the listening tests for sound reproduction from a single loudspeaker in a reverberant space:
The polar response of the loudspeaker, the spatial context in the recording and the listening room are important contributors to creating the Gestalt of the AS. With currently used methods of sound recording and reproduction we are merely trying to fool the mind. Therefore it should be important to understand the cues or tricks that we fall for and to avoid those that destroy the illusion. Physical reconstruction of the wave field that existed at some location in space during the original acoustic event is still at the experimental stage. If realized it could give us a taste of time travel. Stereo loudspeakers in a room are not up to that task, only binaural is, FIG. 8.
1.3.2 The sound of two loudspeakers in a room
I am in awe of how the brain has evolved to perceive, locate and recognize multiple, individual sources of sound in environments of greatly different reflective properties. To quote from Bregman [20]: "Sound is a pattern of pressure waves moving through
the air, each sound producing event creating its own wave pattern. The human brain recognizes these patterns as indicative of the
events that give rise to them: a car going by, a violin playing, a woman
speaking, and so on. Unfortunately by the time the sound has reached the ear,
the wave patterns arriving from the individual events have been added together
in the air so that the pressure wave that reaches the eardrum is the sum of the
pressure patterns coming from the individual events. The summed pressure wave
need not resemble the wave patterns of the individual sounds. Physics Today carried in 2011 an article, "Listening in on the listening brain", which indicates that hearing the eardrum signals involves a bi-directional stream of information - from inner ear to brain and from brain to inner ear. A short 0.1 ms click stimulates approximately 7 ms of brain activity. How to make sense of two loudspeakers emitting identical or nearly identical wave patterns in a reflective environment, as with stereo, would seem to be a particularly difficult and confusing problem in hearing. In evolutionary terms, there is no natural precedence for such situation. Maybe a wolf pack can create a similar acoustic event. Hearing phantom sources is like an escape from reality, from hearing the two loudspeakers, which are readily recognized, and the familiar room. Hiding loudspeakers and room should therefore be the challenge for every loudspeaker designer and recording engineer in order to create a believable auditory illusion, an Auditory Scene of a different reality, that of listening to musicians performing Copland's Organ Symphony in Davies Symphony Hall, for example.
i - Track 05-LL-1kHz_bursts_500ms.wav j - Track 06-LL-1kburst-orion-pluto.mp3 k - Track 07-LL-anechoic-female-male-music.wav l - Track 08-LL-left_orion.mp3 m - Track 09-LL-left_pluto.mp3 n - Track 10-LL-left_SL.mp3 o - Track 11-LL-left_orion-SL_replay.mp3 p - Track 12-LL-left_SL_EL.mp3
---------------------------------------------------------------------------------------------------------------------- impulse response inside the brain
-- Auditory Scene creation -- Recording what we hear -- Experimental results -- Theory -- SRA -- |