US20130136282A1
2013-05-30
13/489,281
2012-06-05
The present invention features systems for enhancing audio signals to correct across a spectrum of frequencies according to a model of the spectral characteristics of hearing loss. The methods of the present invention estimate hearing loss using a linear function of the critical band center frequency. The systems of the present invention compute a user-determined degree of correction to sounds at varying frequencies, allowing a listener to hear sounds, across varying frequencies, as the listener wishes to hear them without needing to raise the volume of the sounds to potentially damaging levels. Systems may be incorporated into apparatuses including but not limited to personal communications devices, virtual audio ports/channels and media players.
Get notified when new applications in this technology area are published.
H03G9/005 » CPC main
Combinations of two or more types of control, e.g. gain control and tone control of digital or coded signals
H03G3/32 » CPC further
Gain control in amplifiers or frequency changers without distortion of the input signal; Automatic control in amplifiers having semiconductor devices the control being dependent upon ambient noise level or sound level
H03G9/025 » CPC further
Combinations of two or more types of control, e.g. gain control and tone control in untuned amplifiers frequency-dependent volume compression or expansion, e.g. multiple-band systems
H03G3/20 IPC
Gain control in amplifiers or frequency changers without distortion of the input signal Automatic control
H04R25/00 IPC
Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
The present non-provisional patent application claims benefit to the earlier priority date of provisional patent application Ser. No. 61/564,945, filed Nov. 30, 2011, and application Ser. No. 61/564,942, filed Nov. 30, 2011, the disclosures of which are incorporated in their entirety herein by reference.
The present invention relates to systems, apparatuses, methods, and computer-readable media in the field of audio processing for the correction of hearing loss and for personalization of sound. The present invention may include but is not limited to mobile phones (and the like), headsets (and the like), virtual audio ports (and the like) and music players (and the like).
Hearing loss is a common result of aging for most humans which can also be produced by excessive exposure to loud noises. Most humans begin to lose awareness of high frequency noises in their twenties. Hearing loss starts in the higher frequency register and gradually progresses to lower registers. Thus individuals at many age ranges may feel the need to personalize the spectrum composition of sound to tune it to their personal preferences and profiles of hearing registers.
Iso-loudness contours are most often displayed in terms of decibel (âdBâ) intensity versus log frequency. The log frequency axis provides excessive emphasis on the lower frequencies and less emphasis on high frequencies where audiological damage most often occurs in sensioneural hearing loss.
It can be observed that everyone is âimpairedâ at higher frequencies unless the volume of sounds at those frequencies are very loud. Most of the spice of music is found in the high frequency regionâsibilant speech, breathiness, cymbals, etc. Further, nearly everyone enjoys music more when it is played loudly. That gives that the opportunity to hear this musical spice. By using the present invention, users can gain a sense of enjoyment without needing to raise the volume of music and sounds to potentially damaging levels.
When offered the opportunity to listen through the corrective systems of the present invention, based on the hearing equations discussed herein, people with very normal hearing uniformly prefer some degree of modulation of an audio signal at varying frequencies.
Since hearing loss progresses exponentially from higher to lower frequencies for all individuals, but the amount of hearing loss varies from one person to another, the linear function will hold for most people, but the slope of the line will vary from one individual to another.
In some prior art systems, an individualized audiogram is used to measure hearing loss for a particular individual. The sound can then be spectrally corrected using this stored audiogram pattern to reproduce a spectral correct sound in the individual's perception. This method is disadvantageous since it requires a significant number of parameters to be stored and these parameters are cumbersome and difficult to obtain and adjust accurately, even when performed by a medical professional.
The present invention features systems for enhancing audio signals to adjust sound across a spectrum of frequencies according to a model of the spectral characteristics of hearing loss, using a single tunable input. The methods of the present invention estimate threshold elevations which can correspond to hearing loss and also personalized sound, from a function of critical band and a single user input parameter, where a changein the user input parameter relates monotonically to a change in the slope with respect to frequency of the estimated hearing loss pattern. The systems of the present invention compute a user-determined degree of correction to sounds at varying frequencies, allowing a listener to hear sounds, across varying frequencies, as the listener wishes to hear them without needing to raise the volume of the sounds to potentially damaging levels. Systems may be incorporated into apparatuses including but not limited to mobile phones, headsets, virtual audio ports and media players.
Allowing a user to control the slope of the correction pattern through a user adjustable setting allows the user to tune audio corrections, across a frequency spectrum, to his or her taste through a single adjustment.
Any feature or combination of features described herein are included within the scope of the present invention provided that the features included in any such combination are not mutually inconsistent as will be apparent from the context, this specification, and the knowledge of one of ordinary skill in the art. Additional advantages and aspects of the present invention are apparent in the following detailed description.
The following information regarding terms is non-limiting and exemplary in nature for the purpose of understanding the spirit of the invention
C îą îą B îą îą R îą ( F kHz ) = 26.81 1 + 1.960 F kHz - 0.53
G=g0+(PâT)*(1/Râ1)
P SIG α + P NOISE α - P THRQ α α P SIG
FIG. 1 is a drawing of a possible embodiment for the systems architecture of the present invention.
FIG. 2 is a drawing of a possible embodiment for the systems architecture of the present invention, using analog circuitry for pre and post processing of the signal.
FIG. 3 is a drawing of a possible embodiment for the systems architecture of the present invention, using a digital filter bank for the Frequency Analysis Module.
FIG. 4 Illustrates multiple possible methods for estimating hearing loss to be used to compute hearing loss corrections.
FIG. 5 is a drawing of a possible process flow for the present invention using a series implementation.
FIG. 6 is a drawing of a possible process flow for the present invention using a parallel implementation.
FIG. 7 is a drawing of a possible process flow for the present invention using complete Fourier transforms of the audio device instead of filter banks.
FIG. 8 is a drawing of one alternate embodiment of the apparatus for the present invention where the present invention is effective for generating a spectrum of personalized threshold elevations.
FIG. 9 is a drawing of a second alternate embodiment of the apparatus for the present invention dealing with ambient noise.
FIG. 10 is a drawing of a third alternate embodiment on computer-readable medium.
As shown in FIG. 1, a possible embodiment of the present invention comprises an audio device, an amplifier, a speaker and a microprocessor where a number of software applications are executed by a microprocessor. As shown, said software applications can comprise a series of converters and computational applications for calculating and generating the complex mathematical values that help to enable the invention along with the physical components which can be configured to communicate and operate with the same.
As shown in FIG. 1, in some embodiments, a process flow for the present invention comprises the following steps:
C îą îą B îą îą R îą ( F kHz ) = 26.81 1 + 1.960 F kHz - 0.53
X0=[FâY)]Ăb
In some embodiments, as shown in FIG. 2, the apparatus 100 comprises analog circuitry which preprocesses the signal before inputting it to the microprocessor, including an analog filter bank, which is then input to the microprocessor 01 which performs only the gain computations, which are then output and used by analog op-amp and summer block circuits. In this case, the microprocessor may only configured to execute applications to compute the corrective gains. A possible process flow for this embodiment comprises the following steps:
In some embodiments, as shown in FIG. 3, the apparatus comprises an audio device, an amplifier, a speaker and a microprocessor where a number of software applications are executed by a microprocessor. Instead of using a digital filter bank, however, it may be advantageous to use an explicit computation of the Fourier transform. In this case, the microprocessor may be configured to execute applications for:
A possible process flow for this embodiment may comprise the following steps:
In some embodiments, various other measures of critical band frequency may be used, including Bark, frequency, Mel frequency, and others.
In some embodiments, all or part of the present invention could be implemented using a parallel processing architecture such that some or all of the set of computations from step 3 to 5 above may be computed simultaneously using multiple processing units.
In some embodiments, the point where the estimate of X0 is computed, may be earlier in the sequence relative to processing of the source signal.
In some embodiments, analog-to-digital and digital-to-analog converters can be assumed to exist at interface points between analog elements and digital elements of any embodiment.
In some embodiments, programmable logic devices, including but not limited to FPGAs, may be used to implement parts of the processing shown in FIG. 2, with appropriate interfaces implemented between the microprocessor and the programmable logic devices.
In some embodiments, such as in FIG. 3, FFT co-processors could be used to perform the function of the filter banks, such that the rest of the apparatus operates at each point in the frequency domain instead of on a number of channels.
In some embodiments, the user adjustable input, sometimes referred to herein as the âinput deviceâ, âinput sourceâ or âinput moduleâ is connected to a potentiometer or a variable resistor, which thus produces an output voltage on the electronic device which can be read by the microprocessor through an A/D converter.
In some embodiments, the user adjustable input, sometimes referred to herein as âinput deviceâ, âinput sourceâ or âinput moduleâ may also be a stored setting that is adjusted through an electronic menu system using buttons to select menu parameters, or a touchscreen device in which buttons and inputs are detected when the users touches the screen or uses an implement to touch the screen.
In some embodiments, the user selectable input, sometimes referred to herein as âinput deviceâ, âinput sourceâ or âinput moduleâ might also be controlled through a voice command menu, for use by physically disabled people who are unable to adjust a physical input device.
In some embodiments, as explained by FIG. 5 and its description, the present invention can be enabled on a computer-readable medium 70 storing a set of instructions executable by one or more microprocessors, where the computer-readable medium 70 automatically adjusts a sound signal across a spectrum of frequencies according to a user adjustable setting, wherein the value of the gains are computed according to a psychoacoustic model, for which the model takes an estimated threshold elevation X0, which represents the user's hearing impairment, or threshold elevation corresponding to a preferred sound adjustment, at a given frequency F, where X0 is estimated using a linear function of the critical band center frequency F, where the Linear Slope equation is
X0=[FâY)]Ăb
where X0 is the threshold elevation, the b is the dB/critical band ratio, a setting adjustable by the user 03, where Y is a value selected from a range of 2-3 Bark, and where F is the current critical band center frequency;
The computer-readable medium comprising:
C îą îą B îą îą R îą ( F kHz ) = 26.81 1 + 1.960 F kHz - 0.53
Alternatively, in some embodiments, as shown in FIG. 7, explicit computation of the Fourier transform may be used to determine appropriate gains across the full frequency spectrum. This embodiment can be enabled on a computer-readable medium 70 storing a set of instructions executable by one or more microprocessors, where the computer-readable medium 70 automatically adjusts proper hearing corrections across the frequency spectrum according to a user adjustable setting, wherein the value of the gains are computed according to a psychoacoustic model, for which the model takes an estimated hearing loss X0, which represents the user's hearing impairment at a given frequency F,
the computer-readable medium comprising:
In some embodiments, as shown in FIGS. 5 and 6, the present invention may be implemented either in parallel or in series, or with some parts of the process implemented in parallel and others implemented in series.
In some embodiments, use of a parallel processing device to compute gains for multiple channels simultaneously may improve processing speed
In some embodiments, steps for converting analog audio input signals to digital input signals can be bypassed where the invention utilizes digital input audio devices capable of receiving digital audio signals and transmitting the same to the processor.
The present invention is concerned with the estimation of hearing loss using a simple adjustable setting and its implementation in audio devices, and may be used in combination with many different possible psychoacoustic models. The inputs to these models may be in units of dBFS, dBSPL, or dBHL or a number of other measures of sound intensity, and the model may output gains in dBFS, dBHL, or dBSPL and be converted to another unit before being applied to the source.
The present invention, in some embodiments, may be combined with a number of possible known psychoacoustic models, derived from the audiology literature, which are used to compute gains needed to amplify sound to overcome the effects of ambient noise on sound perception or gains needed to account for preferred threshold elevations computed for users' of normal hearing. The gains may be computed over the entire spectrum or by dividing the spectrum up into any number of smaller bandwidth or frequency components.
An example of a psychoacoustic model which may be combined with the present invention may be found in works such as (Moore, Brian C. et al., âA model for the prediction of thresholds, loudness and partial loudnessâ, Journal of the Audio Engineering Society, JAES Volume 45 Issue 4 pp. 224-240; April (1997)) (Also available at http://www.aes.org/e-lib/browse.cfm?elib=10272) and (Rosengrad, Peniah, S., âRelationship Between Measures Related to the Cochlear Active Mechanism and Speech Reception Thresholds in backgrounds with and without Spectral and/or Temporal Fluctuationsâ PhD Thesis MIT (2004)) (also available at http://hdl.handle.net/1721.1/28598). These models define a mathematical relationship between the sound impinging on the ear and the apparent loudness of the sound as perceived by a human.
The models above may be used to derive a formula for the gains needed to amplify sound and the formula may be written:
P SIG α + P NOISE α - P THRQ α α P SIG
In some embodiments, depending on the nature of the psychoacoustic model used, additional analog processing may occur in analog implementations, such as various circuitry which can estimate the power of the signal in the various frequency channels before inputting this information to the microprocessor.
In some embodiments, the present invention is concerned with the use of a single user input to control the computation with respect to frequency of the resulting hearing loss estimate, this computation being facilitated by the fact that a typical hearing loss profile is linear with respect to critical band center frequency to within +/â10%. In one exemplary embodiment, as in FIG. 1, an explicit linear computation is used to estimate the hearing loss. It will be apparent to those skilled in the art that there are other methods to perform this computation, for instance by using the user input to interpolate between stored patterns of increasing slope, as in FIG. 4. So long as the stored patterns are linear with respect to critical band center frequency (to within +/â10%) and the user input relates monotonically to the slope of the hearing loss estimate with respect to frequency, these modifications do not exceed the scope of this invention.
In embodiments where a pattern library or lookup table is used (FIG. 4 (middle)), the elements in the pattern library of lookup table may be obtained by collecting audiograms across a significant population and finding mean hearing loss patterns for varying degrees of hearing loss, thereby producing patterns of varying slope with respect to critical band center frequency.
In some embodiments, adjustments to the underlying linear estimation may be added to the linear equation to produce variations in the hearing loss. For instance, these adjustments may be derived from higher order polynomial equation that modifies the estimated hearing loss, where the adjustments are within +/â10% of the linear slope with respect to critical band center frequency.
In some embodiments, adjustments to the underlying linear estimation may be added to the linear equation or pattern library, to produce variations in the hearing loss to more accurately compensate for the average hearing loss profile of humans. For instance, adjustments derived from collecting audiograms across a large population.
In some embodiments, adjustments to the underlying linear estimation may be added to the linear equation or pattern library, to produce variations in the hearing loss to more accurately compensate for an individual's hearing loss profile. For instance, these adjustments may be derived from an audiogram for that individual.
In some embodiments, a library of stored hearing loss patterns, for which a varying slope, linear in critical band center frequency, could be fit to each pattern, may be stored in memory, with the user input selecting between or interpolating between stored patterns, where the patterns are arranged in order of increasing or decreasing slope, such that monotonic changes in the user input are translated into monotonic changes in the slope of the pattern being used, and the patterns remain linear in critical band center frequency to within +/â5 dBHL.
In some embodiments, a lookup table may be stored in memory, where for each quantized value of the frequency and user input, a hearing loss is returned, where the hearing loss varies monotonically with respect to both the frequency and user input. (Alternatively, the hearing loss returned is linear with respect to critical band center frequency to within +/â5 dBHL and the slope is monotonically increasing with respect to user input).
In some embodiments, the user adjustable input may be a setting on an electronic device such as a cell phone or music player, which the user modifies through a touch screen menu, trackpad, or other instrument which is used with the electronic device.
In an alternative embodiment, the hearing loss may be derived from an analysis of the ambient noise environment of the user. In this case, the user may be a person of normal hearing, who when subjected to ambient noise is less able to hear frequency components of the signal of interest that coincide with the frequency composition of the noise. In this instance, amplifying the sound volume in a spectrally varying manner enables a normal user to hear the spectral composition of the signal of interest properly over ambient noise.
In some embodiments the user-adjustable input may be a setting on a computer, or in a software application, which the user modifies using a push button, scrollbar, or other GUI input.
In some embodiments the present invention may be accessed via a web application or interface, where this web application resides on a web page, an electronic device such as a mobile phone, or any other general computing device.
The present invention features an apparatus for enhancing an audio signal. The apparatus, or audio device as claimed, (e.g., a mobile phone) may, for example, comprise a standard mobile phone receiver, a standard mobile phone microphone, and a standard mobile phone speaker, all of which are well known to one of ordinary skill in the art. The receiver can function to extract an amplitude of a source signal at a given frequency (or within a frequency range).
In some embodiments, the systems of the present invention can evaluate sounds within pre-determined ranges of frequencies, e.g., any appropriate set or group of ranges. Microphones, and/or receivers and/or the like can collect information for the particular frequency range (the pre-determined frequency range). In some embodiments, a first range is 500 Hz and below, a second range is between 500 Hz and 2 kHz, and a third range is above 2 kHz. In some embodiments a first range is 1 kHz and below and a second range is above 1 kHz. The present invention is not limited to the aforementioned ranges.
As shown in FIG. 8, in some broad embodiments, the present invention features an apparatus 700, effective for selecting a plurality of threshold elevations for a given user, without requiring individual measurement of each threshold elevation. As such, the apparatus calculates a plurality of spectrally varying threshold elevations across a plurality of channels, where the threshold elevations are computed from a single parameter set by a user-adjustable setting, each of the threshold elevations being represented as X0, in dBHL at a given frequency F.
In some embodiments, each of the threshold elevations can be used to determine corresponding correction gains.
In some embodiments, the apparatus further comprises an input device 703, which allows a user to set a single parameter for a personalized sound adjustment model 706. Said input device can comprise a personal computer, a tablet, laptop, smartphone, headset, or any further hardware that would be understood by one skilled in the art to be suitable for having controls for adjusting the parameter of the present invention and being in electronic communication with the microprocessor and other components of the apparatus. In some embodiments, the input device can simply be a software application that allows the user to adjust the single parameter.
The advantages of a single parameter for generating a plurality of threshold elevations, whether said threshold elevations are computed for users of normal hearing or users with impaired hearing, are many. For example, the present invention eliminates the need to complete audio tone tests in order to determine, individually, a user's hearing deficiency at various frequencies. As another example, a user with normal hearing, utilizing the present invention, would not need to adjust the volume of source signals or determine threshold elevations for multiple frequencies. Instead, the present invention, using a single user-adjustable parameter, estimates the user's preferred threshold elevations across a spectrum of frequencies all based on relationships between complex mathematical equations and the physical nature and characteristics of the human ear.
In some embodiments, the present invention is used in place of the traditional point-by-point tone audiology tone test.
In some embodiments, the apparatus comprises a microprocessor 701, in electronic communication with the input device 703, the microprocessor 701 configured to execute an application comprising the personalized sound adjustment model 706, where the personalized sound adjustment model 706 generates a spectrum of threshold elevations corresponding to a plurality of frequency components, based on the parameter set to the input device 703.
As the previous embodiment demonstrates, the present invention can generate a spectrum of threshold elevations without applying gains to the same. The benefit of this broader embodiment is a system that generates a spectrum of threshold elevations without requiring individual measurement of each, as has previously been the case with, for example, tone-tests. This embodiment does not require gains components to be novel and inventive. Thus, in some embodiments, additional components for applying gains corresponding to threshold elevations established by the present invention are not claimed.
In some embodiments, the user sets the single parameter via the input device 703, and thereafter, the input device 703 transmits information about the single parameter to the microprocessor 701, whereupon the microprocessor 701 applies the parameter to the personalized sound adjustment model 706, whereupon the personalized sound adjustment model 706, using the single parameter, generates a spectrum of threshold elevations, per frequency component, and where said threshold elevations are effective for identifying threshold levels of hearing for users with impaired hearing or for users with normal hearing who wish to optimize their perceived hearing.
As shown in FIG. 9, in some embodiments, the present invention comprises an apparatus 800, effective for measuring the power spectrum of the ambient noise in the environment of the listener where the apparatus calculates a plurality of spectrally varying threshold elevations across a plurality of channels, where the threshold elevations are derived from the power spectrum of the ambient noise, each of the threshold elevations being represented as X0, in dBHL at a given frequency F, and where each of the threshold elevations can be used to determine corresponding correction gains, the apparatus comprising: a microprocessor 801 configured to execute an application comprising a sound adjustment model 806, where the sound adjustment model 806 generates a spectrum of threshold elevations corresponding to a power spectrum of an ambient noise.
In some embodiments, the microprocessor 801 executes the personalized sound adjustment model 806 and generates a spectrum of threshold elevations corresponding a power spectrum of an ambient noise and where said threshold elevations are effective for determining appropriate gains to be applied to an audio signal of interest in order to correct the audio signal to account for the ambient noise.
In some embodiments, the apparatus comprises an audio device 02, configured to identify an audio signal x(t) where the audio signal is a broadband audio signal of interest, the audio device operatively connected to the microprocessor 01. The audio device can be any device capable of extracting or identifying an audio signal, for example, a microphone, receiver, pickup device, and the like. The audio device of the present invention is not limited to the aforementioned examples.
In some embodiments, the apparatus comprises a frequency analysis module 06 configured to extract a plurality of frequency components from the audio signal. See Glossary of Terms. Hardware and software tools for frequency analysis are readily available and commonly understood by those skilled in the art.
In some embodiments, the apparatus comprises a kHz frequency to critical band center frequency converter component 05. See Glossary of Terms.
In some embodiments, the apparatus comprises a psychoacoustic model 07 which, using the threshold elevations computed for each frequency component, determines corresponding gains for each frequency component of the audio signal.
In some embodiments, the apparatus comprises a gain block component 08 which applies gains to each frequency component of the audio signal.
In some embodiments, the apparatus comprises a summer block component. 09 which reconstitutes the audio signal from the frequency components, with the corrected gains.
In some embodiments, the audio device 02 extracts the audio signal x(t), which is then input to the microprocessor 01 through an analog to digital converter,
In some embodiments, the present invention comprises an apparatus, effective for selecting a plurality of threshold elevations for a given user, without requiring individual measurement of each threshold elevation, combined with a gain system and corresponding components which can apply a plurality of correction gains to an audio signal where the gains correspond to the threshold elevations and where the threshold elevations are computed from a single parameter set by a user-adjustable setting.
In some embodiments, the present invention can comprise an audio device 02, configured to identify an audio signal x(t), where the audio signal is a broadband audio signal of interest, the audio device operatively connected to a microprocessor 01.
In some embodiments, the present invention can comprise an input device 03, operatively connected to the microprocessor 01, controllable by a user, which allows the user to adjust a parameter.
In some embodiments, the present invention can comprise the microprocessor 01, configured to execute applications for a frequency analysis module, a kHZ frequency to critical band center frequency converter component, a personalized sound adjustment model, a psychoacoustic model and a gain block component.
In some embodiments, the frequency analysis module is 06 configured to extract a plurality of frequency components from the audio signal.
In some embodiments, the personalized sound adjustment model 06, parameterized by the user adjustable input 03 and the parameter, chosen by the user, establishes threshold elevations per frequency component where said threshold elevations are effective for users with impaired hearing or for users with normal hearing who wish to optimize their perceived hearing.
In some embodiments, the psychoacoustic model 07 uses the threshold elevations for each frequency component and computes corresponding gains per each frequency component of the audio signal.
In some embodiments, the gain block component 08 applies gains to each frequency component of the audio signal.
In some embodiments, the frequency synthesis module 09 reconstitutes the audio signal from the frequency components with the correction gains.
In some embodiments, the audio device 02 extracts the audio signal x(t), which is then input to the microprocessor 01 through an analog to digital converter, whereupon, the microprocessor 01 uses the frequency analysis module 04, to decompose the audio signal into a plurality of frequency components, whereupon, for each component, the equivalent critical band center frequency F is computed 05; whereupon, the personalized sound adjustment model 06 generates, for each frequency component, a threshold elevation value for X0, using the user adjustable input 03 as a parameter b of the model, whereupon the microprocessor 01 calculates, for each frequency component, a correction gain ÎX for the audio signal, using the psychoacoustic model 07, whereupon the microprocessor 01 applies the corrective gain to each frequency component of the audio signal 08, whereupon the microprocessor 01 reconstitutes the corrected audio signal by summing the frequency components, producing xout(t) 09.
In some embodiments, the Frequency Analysis Module is composed of a digital filter bank of IIR filters, which decompose the signal of interest into separate frequency sub-bands.
In some embodiments, the frequency sub-bands correspond to critical bands of hearing.
In some embodiments, the Frequency Analysis Module is an FFT co-processor, which provides Fourier transform components of the signal of interest.
In some embodiments, the input device 03 allows the user to select a parameter b, which parameterizes the personalized sound adjustment model 06, thereby allowing the user to modify the output of the personalized sound adjustment model 06.
In some embodiments, the personalized sound adjustment model 06 is a linear function of the critical band center frequency F, where the Linear Slope equation is
X0=[FâY)]Ăb
In some embodiments, Y is a value in frequency instead of Bark and can be a value selected from a range of 100-110. In some embodiments, Y is a value selected from a range of 100-400. In some embodiments, Y is a value selected from a range of 100-120. In some embodiments, Y is a value selected from a range of 110-130. In some embodiments, Y is a value selected from a range of 100-150. In some embodiments, Y is a value selected from a range of 100-300. In some embodiments, Y is a value selected from a range of 120-160. In some embodiments, Y is a value selected from a range of 130-170. In some embodiments, Y is a value selected from a range of 140-180. In some embodiments, Y is a value selected from a range of 150-200. In some embodiments, Y is a value selected from a range of 160-210. In some embodiments, Y is a value selected from a range of 170-220. In some embodiments, Y is a value selected from a range of 180-230. In some embodiments, Y is a value selected from a range of 190-240. In some embodiments, Y is a value selected from a range of 200-250. In some embodiments, Y is a value selected from a range of 150-250. In some embodiments, Y is a value selected from a range of 160-260. In some embodiments, Y is a value selected from a range of 170-270. In some embodiments, Y is a value selected from a range of 180-280. In some embodiments, Y is a value selected from a range of 190-290. In some embodiments, Y is a value selected from a range of 0-500. In some embodiments, Y is a value of 250+/â10%.
In some embodiments, the personalized sound adjustment model 06 is composed of a pattern library which is stored in long term memory, where the patterns are composed of points which measure threshold elevations, X0, with respect to frequency, of which 2/3 are within +/â5 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency, where the parameter b is used to select a pattern or interpolate between points in the pattern, and the frequency is used to select an element of the pattern or interpolate between elements, and where the user adjustable input (b) 03 bears a monotonic relationship to the selected X0 for each critical band center frequency. In some embodiments, the patterns are composed of points which measure threshold elevations, X0, with respect to frequency, of which 0-20% are within +/â5 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency. In some embodiments, the patterns are composed of points which measure threshold elevations, X0, with respect to frequency, of which 20-40% are within +/â5 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency. In some embodiments, the patterns are composed of points which measure threshold elevations, X0, with respect to frequency, of which 40-60% are within +/â5 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency. In some embodiments, the patterns are composed of points which measure threshold elevations, X0, with respect to frequency, of which 60-80% are within +/â5 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency. In some embodiments, the patterns are composed of points which measure threshold elevations, X0, with respect to frequency, of which 80-100% are within +/â5 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency. In some embodiments, the patterns are composed of points which measure threshold elevations, X0, with respect to frequency, of which 66% are within +/â5 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency. In some embodiments, the patterns are composed of points which measure threshold elevations, X0, with respect to frequency, of which 66% are within +/â1 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency. In some embodiments, the patterns are composed of points which measure threshold elevations, X0, with respect to frequency, of which 66% are within +/â2 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency. In some embodiments, the patterns are composed of points which measure threshold elevations, X0, with respect to frequency, of which 66% are within 3-6 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency. In some embodiments, the patterns are composed of points which measure threshold elevations, X0, with respect to frequency, of which 66% are within 6-10 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency.
In some embodiments, the personalized sound adjustment model 06 is composed of a lookup table indexed by values of parameter b and frequency, where the threshold elevation, X0 is found by selecting the corresponding element of the lookup table or interpolating between points in the lookup table.
In some embodiments, the personalized sound adjustment model 06 computes the threshold elevation by estimating a power spectrum of an ambient noise environment in which the user is situated.
In some embodiments, the corrective gains are computed in parallel for multiple channels using multiple processing units.
In some embodiments, the apparatus includes a speaker 11 and an amplifier 10 for outputting the audio signal.
In some embodiments, the apparatus is a mobile phone, media player, or virtual audio port.
Any appropriate psychoacoustic model may be used in accordance with the present invention. In some embodiments, although not limited to the same, the psychoacoustic model 07 comprises a formula which computes the gain as a function of the signal and ambient noise levels which is needed to make the sound appear as loud as if the noise was not present, this formula comprising:
P SIG α + P NOISE α - P THRQ α α P SIG
In some embodiments, the present invention is a method of allowing a user to select a preferred sound adjustment spectrum, using a linear function of the critical band center frequency to select a threshold elevation, by frequency, where the linear function is
X0=[FâY]Ăb
In some embodiments, adjustments are made to the underlying linear function, where the adjustments are corrections to selected threshold elevations and are stored in long term memory, where these adjustments are derived from an audiogram for an individual.
In some embodiments, adjustments are made to the underlying linear function, where these adjustments are corrections to the selected threshold elevations that are stored in long term memory, where these adjustments are derived from collecting audiograms from a large population.
In some embodiments, adjustments are made to the underlying linear function, where the adjustments are derived from a higher order function that is combined with the threshold elevation, where the resulting curve results in a function wherein 2/3 of the personalized sound adjustment model remains within +/â5 dB of a linear mean squared error fit to the curve with respect to critical band center frequency.
In some embodiments, the present invention enables a user to select a personalized sound adjustment spectrum using a pattern library which is stored in long term memory, where the patterns are composed of points which comprise a spectrum of threshold elevations X0 with respect to frequency, of which 2/3 are within +/â5 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency, where a parameter b, adjustable by the user, is used to select a pattern or interpolate between points in the pattern, and the frequency is used to select an element of the pattern or interpolate between elements, and where the parameter b bears a monotonic relationship to the selected threshold elevation X0 for each critical band center frequency.
In some embodiments, the present invention allows a user to select a spectrum of desired sound adjustments using a lookup table indexed by values of user selected parameter b, and frequency, where the threshold elevation X0 is found by selecting the corresponding element of the lookup table or interpolating between points in the lookup table, where b is adjustable by the user.
In some embodiments, the present invention comprises a method of selecting a spectrum of desired sound adjustments, by measuring the power spectrum of the ambient noise in the environment of the listener.
In some embodiments, the present invention comprises a computer-readable medium 70 storing a set of instructions executable by one or more microprocessors, where the computer-readable medium 70 automatically personalizes an audio signal by applying gains to the signal across a plurality of channels according to a user adjustable setting, where threshold elevations are computed from a single parameter set by a user-adjustable setting, each of the threshold elevations being represented as X0, in dBHL at a given frequency F.
In some embodiments, the present invention comprises code for extracting an input audio signal x(t) 02.
In some embodiments, the present invention comprises code for performing a frequency analysis on x(t) 03, producing a plurality of frequency components.
In some embodiments, the present invention comprises code for solving a plurality of gains corresponding to each of the frequency components, which comprises:
In some embodiments, the present invention comprises code for applying the gains ÎX to the various frequency components of the source signal 08.
In some embodiments, the present invention comprises code for reconstituting the personalized signal using the frequency synthesis module 09.
In some embodiments, as shown in FIG. 10, the present invention comprises a computer-readable medium 970 storing a set of instructions executable by one or more microprocessors, where the computer-readable medium 970 is effective for selecting a plurality of threshold elevations for a given user, without requiring individual measurement of each threshold elevation, where the computer-readable medium 970 calculates a plurality of spectrally varying threshold elevations across a plurality of channels, where the threshold elevations are computed from a single parameter set by a user-adjustable setting, each of the threshold elevations being represented as X0, in dBHL at a given frequency F, and where each of the threshold elevations can be used to determine corresponding gains
In some embodiments, the present invention comprises code for: an input module 903, which allows a user to set a single parameter for a personalized sound adjustment model 906; and a microprocessor 901, in electronic communication with the input module 903, the microprocessor 901 configured to execute an application comprising the personalized sound adjustment model 906, where the personalized sound adjustment model 906 generates a spectrum of threshold elevations corresponding to a plurality of frequency components, based on the parameter set to the input module 903.
In some embodiments, the user sets the single parameter via the input module 903, and thereafter, the input module 903 transmits information about the single parameter to the microprocessor 901, whereupon the microprocessor 901 applies the parameter to the personalized sound adjustment model 906, whereupon the personalized sound adjustment model 906, using the single parameter, generates a spectrum of threshold elevations, per frequency component, and where said threshold elevations are effective for identifying threshold levels of hearing for users with impaired hearing or for users with normal hearing who wish to optimize their perceived hearing.
In some embodiments, the Frequency Analysis Module is a digital filter banks composed of IIR filters, and the Frequency Synthesis Module is a summer block.
In some embodiments, the various channels correspond to critical bands of hearing.
In some embodiments, the Frequency Analysis Module performs a Fourier transform on the input signals producing X(f), where the power of each component is found by squaring the Fourier component, and the Frequency Synthesis Module performs an inverse Fourier transform, resulting in output signal xout(t).
In some embodiments, the computer-readable medium stores a set of instructions where the instruction include code to obtain a user input b which acts as a parameter of the personalized sound adjustment model.
In some embodiments, the computer-readable medium stores a set of instructions wherein the personalized sound adjustment model 06 uses a linear function of the critical band center frequency to select a spectrum of threshold elevations, where the Linear Slope equation is
X0=[FâY]Ăb
In some embodiments, the computer-readable medium stores a set of instructions wherein the personalized sound adjustment model 06 uses a pattern library which is stored in long term memory, where the patterns are composed of points which measure threshold elevations X0 with respect to frequency, of which 2/3 are within +/â5 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency, where the parameter b is used to select a pattern or interpolate between points in the pattern, and the frequency is used to select an element of the pattern or interpolate between elements, and where the user adjustable input (b) 03 bears a monotonic relationship to the selected X0 for each critical band center frequency.
In some embodiments, the computer-readable medium stores a set of instructions, wherein the personalized sound adjustment model 06 uses a lookup table indexed by values of parameter b and frequency, where threshold elevation X0 is found by selecting the corresponding element of the lookup table or interpolating between points in the lookup table.
In some embodiments, the computer-readable medium stores a set of instructions wherein personalized sound adjustment model 06 computes the threshold elevation by estimating a power spectrum of an ambient noise environment in which the user is situated, where the ambient noise is measured by a second audio device in the environment of the user, and a second frequency analysis module is used to break down the ambient noise into spectral components.
In some embodiments, the computer-readable medium stores a set of instructions wherein the audio signal contains more than one frequency component and steps (d) through (e) are computed in parallel for all channels, before proceeding to step (f).
In some embodiments, the computer-readable medium stores a set of instructions wherein the audio signal contains more than one frequency component and steps (d) through (e) are repeated for each component in series, before proceeding to step (f).
In some embodiments, the computer-readable medium stores a set of instructions wherein parts of the process are performed for each frequency channel or component in series and other parts are performed in parallel.
Without wishing to limit the present invention to any theory or mechanism, it is believed that the present invention is advantageous because the present invention features a maximum output volume, for example the source signal will be amplified to only a certain degree. This can help protect against damage to the user's hearing.
Various modifications of the invention, in addition to those described herein, will be apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the invention. Each reference cited in the present application is incorporated herein by reference in its entirety.
Although there has been shown and described the preferred embodiment of the present invention, it will be readily apparent to those skilled in the art that modifications may be made thereto which do not exceed the scope of the invention.
1. An apparatus 700, effective for selecting a plurality of threshold elevations for a given user, without requiring individual measurement of each threshold elevation, where the apparatus calculates a plurality of spectrally varying threshold elevations across a plurality of channels and where the threshold elevations are computed from a single parameter set by a user-adjustable input device 703, each of the threshold elevations being represented as X0, in dBHL at a given frequency F, and where each of the threshold elevations can be used to determine corresponding correction gains, the apparatus comprising:
a. the input device 703, which allows a user to set the single parameter for a personalized sound adjustment model 706; and
b. a microprocessor 701, in electronic communication with the input device 703, the microprocessor 701 configured to execute an application comprising the personalized sound adjustment model 706, where the personalized sound adjustment model 706 generates a spectrum of threshold elevations corresponding to a plurality of frequency components, based on the parameter set to the input device 703;
wherein the user sets the single parameter via the input device 703, and thereafter, the input device 703 transmits information about the single parameter to the microprocessor 701, whereupon the microprocessor 701 applies the parameter to the personalized sound adjustment model 706, whereupon the personalized sound adjustment model 706, using the single parameter, generates a spectrum of threshold elevations, per frequency component, and where said threshold elevations are effective for identifying threshold levels of hearing for users with impaired hearing or for users with normal hearing who wish to optimize their perceived hearing.
2. An apparatus 800, effective for measuring the power spectrum of the ambient noise in the environment of the listener where the apparatus calculates a plurality of spectrally varying threshold elevations across a plurality of channels, where the threshold elevations are derived from the power spectrum of the ambient noise, each of the threshold elevations being represented as X0, in dBHL at a given frequency F, and where each of the threshold elevations can be used to determine corresponding correction gains, the apparatus comprising: a microprocessor 801 configured to execute an application comprising a sound adjustment model 806, where the sound adjustment model 806 generates a spectrum of threshold elevations corresponding to a power spectrum of an ambient noise;
wherein the microprocessor 801 executes the personalized sound adjustment model 806 and generates a spectrum of threshold elevations corresponding a power spectrum of an ambient noise and where said threshold elevations are effective for determining appropriate gains to be applied to an audio signal of interest in order to correct the audio signal to account for the ambient noise.
3. The apparatus of claim 1-2, wherein the apparatus further comprises:
a. an audio device 702, 802, configured to identify an audio signal x(t) where the audio signal is a broadband audio signal of interest, the audio device operatively connected to the microprocessor 702, 802;
b. a frequency analysis module 704, 804 configured to extract a plurality of frequency components from the audio signal;
c. a kHz frequency to critical band center frequency converter component 705, 805;
d. a psychoacoustic model 707, 807 which, using the threshold elevation for each frequency component, computes corresponding gains for each frequency component of the audio signal;
e. a gain block component 708, 808 which applies gains to each frequency component of the audio signal; and a
f. a frequency synthesis module 709, 809 which reconstitutes the audio signal from the frequency components, with the corrected gains;
wherein the audio device 702, 802 extracts the audio signal x(t), which is then input to the microprocessor 701, 801 through an analog to digital converter,
whereupon, the microprocessor 701, 801 uses the frequency analysis module 704, 804, to decompose the audio signal into a plurality of frequency components,
whereupon, for each component, the equivalent critical band center frequency F is computed 705, 805,
and whereupon, the personalized sound adjustment model 706, 806 generates, for each frequency component, a threshold elevation value for X0,
whereupon the microprocessor 701, 801 calculates, for each frequency component, a correction gain ÎX for the audio signal, using the psychoacoustic model 707, 807 and the corresponding threshold elevation,
whereupon the microprocessor 701, 801 applies the corrective gain to each frequency component of the audio signal,
whereupon the microprocessor 701, 801 reconstitutes the corrected audio signal by summing the frequency components, producing xout(t) 709, 809.
4. An apparatus 200, effective for selecting a plurality of threshold elevations for a given user, without requiring individual measurement of each threshold elevation, and thereafter applying a plurality of correction gains to an audio signal where the gains correspond to the threshold elevations, where the threshold elevations are computed from a single parameter set by a user-adjustable input device 203, each of the threshold elevations being represented as X0, in dBHL at a given frequency F, the apparatus comprising:
a. an audio device 202, configured to identify the audio signal x(t), where the audio signal is a broadband audio signal of interest, the audio device operatively connected to a microprocessor 201;
b. the input device 203, operatively connected to the microprocessor 201, controllable by a user, which allows the user to adjust a parameter;
c. the microprocessor 201, configured to execute applications for:
i. a frequency analysis module 204 configured to extract a plurality of frequency components from the audio signal;
ii. a kHz frequency to critical band center frequency converter component 205;
iii. a personalized sound adjustment model 206, parameterized by the user adjustable input 203 and the parameter, chosen by the user, which establishes threshold elevations per frequency component and where said threshold elevations are effective for users with impaired hearing or for users with normal hearing who wish to optimize their perceived hearing;
iv. a psychoacoustic model 207 which, using the threshold elevations for each frequency component, computes corresponding gains per each frequency component of the audio signal;
v. a gain block component 208 which applies gains to each frequency component of the audio signal;
vi. a frequency synthesis module 209 which reconstitutes the audio signal from the frequency components, with the correction gains;
whereupon, the audio device 202 extracts the audio signal x(t), which is then input to the microprocessor 201 through an analog to digital converter,
whereupon, the microprocessor 201 uses the frequency analysis module 204, to decompose the audio signal into a plurality of frequency components,
whereupon, for each component, the equivalent critical band center frequency F is computed 205,
whereupon, the personalized sound adjustment model 206 generates, for each frequency component, a threshold elevation value for X0, using the user adjustable input 203 as a parameter b of the model,
whereupon the microprocessor 201 calculates, for each frequency component, a correction gain ÎX for the audio signal, using the psychoacoustic model 207,
whereupon the microprocessor 201 applies the corrective gain to each frequency component of the audio signal 208,
whereupon the microprocessor 201 reconstitutes the corrected audio signal by summing the frequency components, producing xout(t) 209.
5. The apparatus of claims 3-4, wherein the Frequency Analysis Module comprises a digital filter bank of filters, which decompose the signal of interest into separate frequency sub-bands.
6. The apparatus of claim 5, where the frequency sub-bands correspond to critical bands of hearing.
7. The apparatus of claim 3-4, where the Frequency Analysis Module is an FFT co-processor, which provides Fourier transform components of the signal of interest.
8. The apparatus of claim 5, where the frequency components are grouped together to approximate sub-bands correspond to critical bands of hearing.
9. The apparatus of claim 1 and 4, where the input device 203 allows the user to select a parameter b, which parameterizes the personalized sound adjustment model 206, thereby allowing the user to modify the output of the personalized sound adjustment model 206.
10. The apparatus of claims 1, 2 and 4, wherein the personalized sound adjustment model is a linear function of the critical band center frequency F, where the Linear Slope equation is
X0=[FâY]Ăb
where X0 is the threshold elevation at the Frequency F, the parameter, b, adjustable by the user input parameter b, represents the slope of the line in dBHL/critical band, where Y is a value selected from a range of 2-3 Bark, and where F is the critical band center frequency.
11. The apparatus of claims 1, 2 and 4, wherein the personalized sound adjustment model is composed of a pattern library which is stored in long term memory, where the patterns are comprised of points which measure threshold elevations, X0, with respect to frequency, of which 2/3 are within +/â5 dBHL of a mean squared error linear fit to the pattern with respect to critical band center frequency, where the parameter b is used to select a pattern or interpolate between points in the pattern, and the frequency is used to select an element of the pattern or interpolate between elements, and where the user adjustable input (b) bears a monotonic relationship to the selected X0 for each critical band center frequency.
12. The apparatus of claims 1, 2 and 4, wherein the personalized sound adjustment model is composed of a lookup table indexed by values of the parameter b and frequency, where the threshold elevation, X0 is found by selecting the corresponding element of the lookup table or interpolating between points in the lookup table.
13. The apparatus of claim 2, wherein the personalized sound adjustment model computes the threshold elevation by estimating a power spectrum of an ambient noise environment in which the user is situated.
14. The apparatus of claim 3-4, wherein the corrective gains are computed in parallel for multiple channels using multiple processing units.
15. The apparatus of claim 3-4, wherein the apparatus includes a speaker and an amplifier for outputting the audio signal.
16. The apparatus, of claim 3 or 4 wherein the apparatus is a mobile phone, media player, headset, or virtual audio port.
17. The apparatus of claim 3 and 4, wherein the psychoacoustic model comprises a formula which computes the gain as a function of the signal and ambient noise levels which is needed to make the sound appear as loud as if the noise was not present, the formula comprising:
P SIG α + P NOISE α - P THRQ α α P SIG
where G is the gain ratio, PSIG is the signal intensity at a frequency in units of power, PNOISE is the signal intensity of the background noise, PTHRQ is the absolute threshold of hearing, and α=0.2 is a constant.
18-39. (canceled)