GritTec laboratory updates speaker identification technology
Each example of audio file is described by the acoustic voice model, error model (FAR, FRR, EER) and noise model, describing surrounding noises and channel distortion, existing in audio file. For the full description of each speaker card it is sufficiently 1 - 3 audio files with the speaker voice, recorded for different telephone lines and duration of each one not less than 60 sec.
In algorithmic part of speaker identification technology it was added tone and music detectors. Detector of tone signals is intended for detection of DTMF, CPTD, UMTD and other similar signals. Detector of music is intended for detection of musical accompanying, playing during waiting of connection between telephone speakers.
Technology of building statistical voice models and its re-estimation (with S-states) was updated in speaker card module. Comparative analysis has shown that using the updating voice models greatly enlarges account of "true" and "false" spots and increases probability of definition of Acceptance and Rejection.
Testing of updating speaker identifications technology was conducted on the real telephone records and on specialized sound base LDC96S61 of English telephone records given by LDC consortium (Linguistic Data Consortium).
Renovations and optimization of architecture of program identification modules for using in multi-threading mode were made in software code. At the renovation of program modules architecture of modules was structured on the functionality of each modules. Developing architecture of program modules supposes buildings a client-server applications and identification server by end developers. In identification server identification of unknown speaker is made in the threading mode - independently for each other.
At present automatic speaker identification technology is available for Intel platform as SDK library with examples of MS VC++ projects.
Glossary:
FRR - False Rejection Rate;
FAR - False Acceptance Rate;
EER - Error Equal Rate: EER = FRR = FAR;
DTMF - Dual Tone Modulated Frequency;
UMTD - Universal Multy Tone Detection;
CPTD - Call Progress Tone Detection.
About GritTec
GritTec Laboratory specializes on research and development of algorithms and technologies in the field of speech and audio processing. GritTec's research is focused on speech enhancement, speech concealment, voice biometric, speech recognition, speech synthesis and other speech and audio technologies.
tel: +7 495 796 24 18
email: info@grittec.com
url: http://www.grittec.com
Company:
GritTec laboratory
Related press releases
-
GritTec has presented a new software product of GritTec Speaker-ID: The mobile c...
[2010-06-22 10:44:35]
GritTec has presented release of new software product of voice identification - GritTec Speaker-ID: The mobile client (Version 1,00) on the base of the text independent speaker identification engine G... -
GritTec Laboratory and Delma Technologies sign reseller agreement
[2008-01-27 17:35:10]
GritTec Laboratory (GritTec Ltd.), a developer of speech technologies, and Delma Technologies (Delma Technologies UK Ltd) manufacturing company making voice and communications monitoring and recording... -
GritTec laboratory updates speaker identification technology
[2007-09-09 16:10:49]
Automatic text independent speaker identification technology is intended for automatic identification of a speech signal of unknown voice by paired comparing with 'speaker cards', existing in the data... -
GritTec has updated high level API of GritTec's Speaker-ID SDK up to version 2,9...
[2010-11-09 01:49:56]
In this version GritTec has optimised handler of system events with returning errors codes for the general functions and procedures of high level GritTec's Speaker-ID SDK. Efficiency of the system ev... -
GritTec laboratory enables Online Store of the software products
[2007-07-24 19:36:06]
GritTec laboratory enables Online Store of the software products. Such online store makes marketing strategy more effective and permits to promote GritTecs technology to market of hi-tech technologies... -
GritTec laboratory announced a new product of Pitch Shift technology for audio a...
[2009-02-04 12:30:05]
GritTec's Pitch Shift technology is used for high quality of pitch scale modification (changing the harmonics structure) of speech and audio signals. Pitch shift technology can be effectively used for... -
GritTec laboratory announced Sample Rate Converter for audio applications
[2008-06-25 13:49:35]
GritTec's sample rate converter (SRC) is technology used for changing sampling rate in speech and audio signals. Principle of functioning algorithm is based on methods of interpolation. GritTec's SRC ... -
GritTec laboratory announced new version of dual microphone array solution of sp...
[2007-09-30 17:26:23]
Dual microphone array solution of speech enhancement is used for the suppression of external hindrances and surrounding noises in chosen direction source of the speech. The sources of external hindran... -
GritTec laboratory updates Speech Enhancement technology on the base of dual mic...
[2008-04-04 06:56:24]
In the updated version of dual microphone array (DMA) module Multiple Canceller (MC) updating has been made. Also integration DMA with technology on the basis of Noise Cancellation (NC) has been made ... -
GritTec Ltd. announced new version of active noise cancellation technology of sp...
[2009-07-03 04:38:45]
GritTec's Noise cancellation (NC) technology is intended for reducing the external hindrances and background noises in speech signal. In new version of GritTec's Noise Cancellation (Version 2.00) deve...
English
German
French
Spanish
Russian
Romanian




Intel® Processor Identification Utility - Bootable 1.0