Speaker identification Unlike speaker verification, where a claim of an identity is accepted or rejected based on the speaker's voice, our research work on identification lets the computer identify who is talking, from a large number of enrolled speakers, based on a small sample of his or her voice. Again applications are limitless. For example, on a personal computer, speaker identification can help you share your computer with other people and still always have the screen and the icons configured the way you prefer. Speaker classification Speaker classification involves the ability to handle a population of an unknown number of unknown speakers to
- detect whenever a speaker changes;
- regroup speech segments spoken by the same person;
- cluster speakers who speak similarly (e.g. same accent).
Research direction Our focus is on text-independent technology, which is designed to work regardless of what the speaker is saying, or even regardless of the language. One benefit of this approach is that the user doesn't have to say anything special in order to be registered in the system's voice database - a few seconds of speech will do. A second benefit is that the user doesn't have to say anything special in order to be recognized. The applications we have talked about in this section are just now becoming a reality in our laboratories. We are currently researching ways to extend our capabilities to very large populations. What is possible today for a few thousand speakers, will one day be possible on a scale of millions of speakers. |