|
GritTec laboratory updates speaker identification technology
Automatic text independent speaker identification technology is intended for automatic identification of a speech signal of unknown voice by paired comparing with 'speaker cards', existing in the database of system. Comparison is conducted by calculation of 'true' and 'false' spots (spots of correspondences) and with the further determination of probability of Acceptance and Rejection. Each speaker card besides information about current speaker (first, last name, birthday, gender, and so on) is characterized by examples of audio files with the speaker voice.
Each example of audio file is described by the acoustic voice model, error model (FAR, FRR, EER) and noise model, describing surrounding noises and channel distortion, existing in audio file. For the full description of each speaker card it is sufficiently 1 - 3 audio files with the speaker voice, recorded for different telephone lines and duration of each one not less than 60 sec.
In algorithmic part of speaker identification technology it was added tone and music detectors. Detector of tone signals is intended for detection of DTMF, CPTD, UMTD and other similar signals. Detector of music is intended for detection of musical accompanying, playing during waiting of connection between telephone speakers.
Technology of building statistical voice models and its re-estimation (with S-states) was updated in speaker card module. Comparative analysis has shown that using the updating voice models greatly enlarges account of "true" and "false" spots and increases probability of definition of Acceptance and Rejection.
Testing of updating speaker identifications technology was conducted on the real telephone records and on specialized sound base LDC96S61 of English telephone records given by LDC consortium (Linguistic Data Consortium).
Renovations and optimization of architecture of program identification modules for using in multi-threading mode were made in software code. At the renovation of program modules architecture of modules was structured on the functionality of each modules. Developing architecture of program modules supposes buildings a client-server applications and identification server by end developers. In identification server identification of unknown speaker is made in the threading mode - independently for each other.
At present automatic speaker identification technology is available for Intel platform as SDK library with examples of MS VC++ projects.
Glossary:
FRR - False Rejection Rate;
FAR - False Acceptance Rate;
EER - Error Equal Rate: EER = FRR = FAR;
DTMF - Dual Tone Modulated Frequency;
UMTD - Universal Multy Tone Detection;
CPTD - Call Progress Tone Detection.
About GritTec
GritTec Laboratory specializes on research and development of algorithms and technologies in the field of speech and audio processing. GritTec's research is focused on speech enhancement, speech concealment, voice biometric, speech recognition, speech synthesis and other speech and audio technologies.
tel: +7 495 796 24 18
email: info@grittec.com
url: http://www.grittec.com
Company: GritTec laboratory
|
|
| DVD-Cloner |
DVD-Cloner VII is the latest generation of the full-featured DVD copy software. It provides you easy-to-use tools and state-of-the-art technology that let you make quality DVD/Blu-ray copies. |
|
| Smart Bro |
Smart Bro is a free browser designed carefully to suite the user needs. It is built on the Internet Explorer technology. Smart Bro provides many options including:
1. Tabbed interface.
2. Integrated form filler
3. Popup killer. |
|
| Store Manager for osCommerce |
Manage osCommerce products, osCommerce product attributes, categories, manufacturers, orders, batch updates - export osCommerce products to Excel, and import products to osCommerce, import file from your supplier, easy populate and product attributes |
|
| SegPlayPC |
Paint-by-numbers meets modern technology in SegPlayPC TM, a computerized paint-by-numbers program for Windows 2000 and XP. With twenty images and powerful features for artsy types and casual gamers, SegPlayPC is simple to use with infinite variety. |
|
| AceErase File Shredder- Free |
Secure file shredder with unlimited use of the zero wipe (1 pass) shredding algorithm to erase your files. Has easy drag & drop, quick installer/uninstaller, comprehensive help, & free updates until the next major version release. Portable capable! |
|
| Studio Sound FX Plugin for Winamp |
Acuity Sound Technology from QO Labs. A sound so precise it would gain the seal of approval from professional recording artists, all from your existing speakers in your own home. Are you curious to know how this is all possible? Download Now. |
|
|