聲音的質(zhì)量是指經(jīng)傳輸、處理后音頻信號(hào)的保真度。目前,業(yè)界公認(rèn)的聲音質(zhì)量標(biāo)準(zhǔn)分為4級(jí),即數(shù)字激光唱盤(pán)CD-DA質(zhì)量,其信號(hào)帶寬為10Hz~20kHz;調(diào)頻廣播FM質(zhì)量,其信號(hào)帶寬為20Hz~15kHz;調(diào)幅廣播AM質(zhì)量,其信號(hào)帶寬為50Hz~7kHz;電話(huà)的話(huà)音質(zhì)量,其信號(hào)帶寬為200Hz~3400Hz??梢?jiàn),數(shù)字激光唱盤(pán)的聲音質(zhì)量,電話(huà)的話(huà)音質(zhì)量。除了頻率范圍外,人們往往還用其它方法和指標(biāo)來(lái)進(jìn)一步描述不同用途的音質(zhì)標(biāo)準(zhǔn)。
The quality of sound refers to the fidelity of audio signal after transmission and processing. At present, the recognized sound quality standard in the industry is divided into four levels, namely, the quality of digital compact disc CD-DA, and its signal bandwidth is 10Hz ~ 20kHz; FM broadcasting quality, with signal bandwidth of 20Hz ~ 15KHz; AM broadcast quality, with signal bandwidth of 50Hz ~ 7kHz; The voice quality of the telephone, and its signal bandwidth is 200Hz ~ 3400Hz. It can be seen that the sound quality of digital compact disc is the highest and that of telephone is the lowest. In addition to the frequency range, people often use other methods and indicators to further describe the sound quality standards for different purposes.
音響
sound
對(duì)模擬音頻來(lái)說(shuō),再現(xiàn)聲音的頻率成分越多,失真與干擾越小,聲音保真度越高,音質(zhì)也越好。如在通信科學(xué)中,聲音質(zhì)量的等級(jí)除了用音頻信號(hào)的頻率范圍外,還用失真度、信噪比等指標(biāo)來(lái)衡量。
For analog audio, the more frequency components of the reproduced sound, the less distortion and interference, the higher the sound fidelity and the better the sound quality. For example, in communication science, the level of sound quality is measured not only by the frequency range of audio signal, but also by indicators such as distortion and signal-to-noise ratio.
對(duì)數(shù)字音頻來(lái)說(shuō),再現(xiàn)聲音頻率的成分越多,誤碼率越小,音質(zhì)越好。通常用數(shù)碼率(或存儲(chǔ)容量)來(lái)衡量,取樣頻率越高、量化比特?cái)?shù)越大,聲道數(shù)越多,存儲(chǔ)容量越大,當(dāng)然保真度就高,音質(zhì)就好。
For digital audio, the more components of the reproduced sound frequency, the smaller the bit error rate and the better the sound quality. Usually measured by digital rate (or storage capacity), the higher the sampling frequency, the larger the number of quantization bits, the more channels, and the larger the storage capacity. Of course, the higher the fidelity and the better the sound quality.
聲音的類(lèi)別特點(diǎn)不同,音質(zhì)要求也不一樣。如,語(yǔ)音音質(zhì)保真度主要體現(xiàn)在清晰、不失真、再現(xiàn)平面聲象;樂(lè)音的保真度要求較高,營(yíng)造空間聲象主要體現(xiàn)在用多聲道模擬立體環(huán)繞聲,或虛擬雙聲道3D環(huán)繞聲等方法,再現(xiàn)原來(lái)聲源的一切聲象。
The category and characteristics of sound are different, and the sound quality requirements are also different. For example, the fidelity of voice quality is mainly reflected in clarity, no distortion and reproduction of plane sound image; The fidelity of music is required to be high. The creation of spatial sound image is mainly reflected in the use of multi-channel simulation of three-dimensional surround sound, or virtual two-channel 3D surround sound and other methods to reproduce all sound images of the original sound source.

音頻信號(hào)的用途不同,采用壓縮的質(zhì)量標(biāo)準(zhǔn)也不一樣。如,電話(huà)質(zhì)量的音頻信號(hào)采用ITU-TG?711標(biāo)準(zhǔn),8kHz取樣,8bit量化,碼率64Kbps。AM廣播采用ITU-TG?722標(biāo)準(zhǔn),16kHz取樣,14bit量化,碼率224Kbps。高保真立體聲音頻壓縮標(biāo)準(zhǔn)由ISO和ITU-T聯(lián)合制訂,CD11172-3MPEG音頻標(biāo)準(zhǔn)為48kHz、44.1kHz、32kHz取樣,每聲道數(shù)碼率32Kbps~448Kbps,適合CD-DA光盤(pán)用。
Audio signals are used for different purposes, and the quality standards of compression are also different. For example, ITU-TG? 711 standard, 8kHz sampling, 8bit quantization, code rate 64Kbps. ITU-TG? 722 standard, 16KHz sampling, 14bit quantization, code rate 224kbps. The high fidelity stereo audio compression standard is jointly formulated by ISO and ITU-T. the cd11172-3mpeg audio standard is 48Khz, 44.1KHz and 32kHz sampling, and the digital rate of each channel is 32kbps ~ 448kbps, which is suitable for CD-DA discs.
對(duì)聲音質(zhì)量要求過(guò)高,則設(shè)備復(fù)雜;反之,則不能滿(mǎn)足應(yīng)用。一般以"夠用,又不浪費(fèi)"為原則。
If the requirements for sound quality are too high, the equipment is complex; On the contrary, it cannot meet the application requirements. It is generally based on the principle of "enough without waste".
語(yǔ)音音質(zhì):評(píng)定語(yǔ)音編碼的質(zhì)量目前常用的是主觀評(píng)定,即以主觀打分 (MOS)來(lái)度量,它分為以下五級(jí):5(優(yōu)),不察覺(jué)失真;4(良),剛察覺(jué)失真,但不討厭;3(中),察覺(jué)失真,稍微討厭;2(差),討厭,但不令人反感;1(劣),極其討厭,令人反感。一般再現(xiàn)語(yǔ)音頻率若達(dá)7kHz以上,MOS可評(píng)5分。這種評(píng)價(jià)標(biāo)準(zhǔn)廣泛應(yīng)用于多媒體技術(shù)和通信中,如可視電話(huà)、電視會(huì)議、語(yǔ)音電子郵件、語(yǔ)音信箱等。
Speech quality: subjective evaluation is commonly used to evaluate the quality of speech coding, that is, it is measured by subjective scoring (MOS), which is divided into the following five levels: 5 (excellent), undetected distortion; 4 (good), just aware of distortion, but not annoying; 3 (middle), perceived distortion, slightly annoying; 2 (poor), annoying, but not offensive; 1 (inferior), extremely annoying and disgusting. Generally, if the reproduction speech frequency reaches more than 7kHz, MOS can be rated as 5 points. This evaluation standard is widely used in multimedia technology and communication, such as videophone, video conference, voice email, voice mailbox and so on.
樂(lè)音音質(zhì):樂(lè)音音質(zhì)的優(yōu)劣取決于多種因素,如聲源特性(聲壓、頻率、頻譜等)、音響器材的信號(hào)特性(如失真度、頻響、動(dòng)態(tài)范圍、信噪比、瞬態(tài)特性、立體聲分離度等)、聲場(chǎng)特性(如直達(dá)聲、前期反射聲、混響聲、兩耳間互相關(guān)系數(shù)、基準(zhǔn)振動(dòng)、吸聲率等)、聽(tīng)覺(jué)特性(如響度曲線(xiàn)、可聽(tīng)范圍、各種聽(tīng)感)等。所以,對(duì)音響設(shè)備再現(xiàn)音質(zhì)的評(píng)價(jià)難度較大。
Music sound quality: the quality of music sound depends on many factors, such as sound source characteristics (sound pressure, frequency, spectrum, etc.), signal characteristics of sound equipment (such as distortion, frequency response, dynamic range, signal-to-noise ratio, transient characteristics, stereo separation, etc.), sound field characteristics (such as direct sound, early reflection, reverberation, cross-correlation coefficient between ears, reference vibration, sound absorption rate, etc.) Auditory characteristics (such as loudness curve, audible range, various auditory senses), etc. Therefore, it is difficult to evaluate the reproduction sound quality of audio equipment.
我們通常用下列兩種方法:一是使用儀器測(cè)試技術(shù)指標(biāo);二是憑主觀聆聽(tīng)各種音效。由于樂(lè)音音質(zhì)屬性復(fù)雜,主觀評(píng)價(jià)的個(gè)人色彩較濃,而現(xiàn)有的音響測(cè)試技術(shù)又只能從某些側(cè)面反映其保真度。所以,迄今為止,還沒(méi)有一個(gè)能真正定量反映樂(lè)音音質(zhì)保真度的國(guó)際公認(rèn)的評(píng)價(jià)標(biāo)準(zhǔn)。但也有報(bào)道,國(guó)際電信聯(lián)盟(ITU-T)近期已批準(zhǔn)一種客觀評(píng)價(jià)音質(zhì)的被稱(chēng)之為電子耳的新型測(cè)量方法,可對(duì)任何音響器材的音質(zhì)進(jìn)行客觀聽(tīng)音評(píng)價(jià),也可用于檢測(cè)電話(huà)通訊語(yǔ)音編碼系統(tǒng)的缺陷。
We usually use the following two methods: one is to use instruments to test technical indicators; Second, listen to various sound effects subjectively. Due to the complexity of music quality attributes, the personal color of subjective evaluation is strong, and the existing sound testing technology can only reflect its fidelity from some aspects. So far, there is no internationally recognized evaluation standard that can truly quantitatively reflect the fidelity of music quality. However, it is also reported that the International Telecommunication Union (ITU-T) has recently approved a new measurement method called electronic ear, which can objectively evaluate the sound quality of any audio equipment, and can also be used to detect the defects of telephone communication speech coding system.
通常,據(jù)樂(lè)音音質(zhì)聽(tīng)感三要素,即響度、音調(diào)和愉快感的變化和組合來(lái)主觀評(píng)價(jià)音質(zhì)的各種屬性,如低頻響亮為聲音豐滿(mǎn),高頻響亮為聲音明亮,低頻微弱為聲音平滑,高頻微弱為聲音清澄。下面結(jié)合聲源、聲場(chǎng)及信號(hào)特性介紹幾種典型的聽(tīng)感。
Generally, according to the changes and combinations of the three elements of musical sound quality, namely loudness, tone and pleasure, various attributes of sound quality are subjectively evaluated. For example, low-frequency loudness is full sound, high-frequency loudness is bright sound, low-frequency weakness is smooth sound, and high-frequency weakness is clear sound. Next, several typical listening senses are introduced in combination with sound source, sound field and signal characteristics.
立體感。主要由聲音的空間感(環(huán)繞感)、定位感(方向感)、層次感(厚度感)等所構(gòu)成的聽(tīng)感,具有這些聽(tīng)感的聲音稱(chēng)為立體聲。自然界的各種聲場(chǎng)本身都是富有立體感的,它是模擬聲源聲象重要的一個(gè)特征。德?波爾效應(yīng)證明,人耳的生理特點(diǎn)是:人耳在兩聲源的對(duì)稱(chēng)軸上,當(dāng)聲壓差△p=0dB和時(shí)間差△t=0ms時(shí),感覺(jué)兩聲源聲象相同,分不出有兩個(gè)聲源;而當(dāng)△p>15dB或△t>3ms時(shí),人耳就感覺(jué)到有兩個(gè)聲源,聲像往聲壓大或?qū)暗穆曉匆苿?dòng),每5dB的聲壓差相當(dāng)于lms的時(shí)間差。哈斯效應(yīng)又進(jìn)一步證明,當(dāng)△t=5ms~35ms時(shí),人耳感到有兩個(gè)聲源;而當(dāng)近次反射聲、滯后直達(dá)聲或兩個(gè)聲源的時(shí)間差△t>50ms時(shí),即使一次反射聲(又稱(chēng)近次或前期反射聲)或滯后聲的響度比直達(dá)聲或?qū)奥暤捻懚却笤S多倍,聲源方位仍由直達(dá)聲或?qū)奥暃Q定。
Three dimensional sense. The sense of hearing is mainly composed of the sense of space (sense of surround), the sense of positioning (sense of direction), the sense of hierarchy (sense of thickness), etc. the sound with these sense of hearing is called stereo. All kinds of sound fields in nature are full of three-dimensional sense. It is the most important feature of simulating the sound image of sound source. virtue? Bohr effect proves that the physiological characteristics of human ear are as follows: when the sound pressure difference △ P = 0dB and time difference △ t = 0ms are on the symmetry axis of two sound sources, the human ear feels that the sound images of the two sound sources are the same and there are no two sound sources; When △ P > 15dB or △ T > 3MS, the human ear feels that there are two sound sources. The sound image moves to the sound source with high sound pressure or in front of the guide. The sound pressure difference of every 5dB is equivalent to the time difference of LMS. Haas effect further proves that when △ t = 5ms ~ 35ms, human ears feel that there are two sound sources; When the time difference △ t between the near reflected sound, the delayed direct sound or the two sound sources is more than 50ms, even if the loudness of the primary reflected sound (also known as the near or early reflected sound) or the delayed sound is many times greater than that of the direct sound or the leading sound, the sound source orientation is still determined by the direct sound or the leading sound.
根據(jù)人耳的這個(gè)生理特點(diǎn),只要通過(guò)對(duì)聲音的強(qiáng)度、延時(shí)、混響、空間效應(yīng)等進(jìn)行適當(dāng)控制和處理,在兩耳人為的制造具有一定的時(shí)間差△t、相位差△θ、聲壓差△P的聲波狀態(tài),并使這種狀態(tài)和原聲源在雙耳處產(chǎn)生的聲波狀態(tài)完全相同,人就能真實(shí)、完整地感受到重現(xiàn)聲音的立體感。與單聲道聲音相比,立體聲通常具有聲象分散、各聲部音量分布得當(dāng)、清晰度高、背景噪聲低的特點(diǎn)。
According to the physiological characteristics of the human ear, as long as the sound intensity, delay, reverberation and spatial effect are properly controlled and processed, there is a certain time difference △ T and phase difference △ in the artificial manufacturing of the two ears θ、 The sound wave state of sound pressure difference △ P, and make this state exactly the same as the sound wave state generated by the original sound source at both ears, so that people can truly and completely feel the three-dimensional sense of reproduced sound. Compared with mono sound, stereo usually has the characteristics of sound image dispersion, proper volume distribution of each sound part, high definition and low background noise.