June 28th, 2004 BioSecure, SecurePhone 1 Automatic Speaker Verification : Technologies, Evaluations...

June 28th, 2004June 28th, 2004 BioSecure, SecurePhoneBioSecure, SecurePhone 11

Automatic Speaker Automatic Speaker Verification : Verification :

Technologies, EvaluationsTechnologies, Evaluationsand Possible Futureand Possible Future

Gérard CHOLLETGérard CHOLLETCNRS-LTCI, GET-ENSTCNRS-LTCI, GET-ENST

chollet@tsi.enst.fr

Biometrics in Current Security EnvironmentsBiometrics in Current Security Environments

OutlineOutline State of affairs (tasks, security, forensic,…)State of affairs (tasks, security, forensic,…) Speaker characteristics in the speech signalSpeaker characteristics in the speech signal Automatic Speaker Verification :Automatic Speaker Verification :

Decision theoryDecision theory Text dependent / Text independentText dependent / Text independent

Imposture (occasional, dedicated)Imposture (occasional, dedicated) Voice transformationsVoice transformations Audio-visual speaker verificationAudio-visual speaker verification Evaluations (algorithms, field tests, ergonomy,…)Evaluations (algorithms, field tests, ergonomy,…) Conclusions, PerspectivesConclusions, Perspectives

Why should a computer recognize Why should a computer recognize who is speaking ?who is speaking ?

Protection of individual property (habitation, bank account, Protection of individual property (habitation, bank account, personal data, messages, mobile phone, PDA,...) personal data, messages, mobile phone, PDA,...)

Limited access (secured areas, data bases)Limited access (secured areas, data bases) Personalization (only respond to its master’s voice)Personalization (only respond to its master’s voice) Locate a particular person in an audio-visual document Locate a particular person in an audio-visual document

(information retrieval)(information retrieval) Who is speaking in a meeting ?Who is speaking in a meeting ? Is a suspect the criminal ? (forensic applications)Is a suspect the criminal ? (forensic applications)

Tasks in Tasks in Automatic Speaker RecognitionAutomatic Speaker Recognition

Speaker verification (Voice Biometric)Speaker verification (Voice Biometric) Are you really who you claim to be ?Are you really who you claim to be ?

Identification (Speaker ID) :Identification (Speaker ID) : Is this speech segment coming from a known speaker ?Is this speech segment coming from a known speaker ? How large is the set of speakers (population of the How large is the set of speakers (population of the

world) ? world) ? Speaker detection, segmentation, indexing, retrieval, tracking :Speaker detection, segmentation, indexing, retrieval, tracking :

Looking for recordings of a particular speakerLooking for recordings of a particular speaker Combining Speech and Speaker RecognitionCombining Speech and Speaker Recognition

Adaptation to a new speaker, speaker typologyAdaptation to a new speaker, speaker typology Personalization in dialogue systemsPersonalization in dialogue systems

ApplicationsApplications

Access ControlAccess Control Physical facilities, Computer networks, WebsitesPhysical facilities, Computer networks, Websites

Transaction AuthenticationTransaction Authentication Telephone banking, e-CommerceTelephone banking, e-Commerce

Speech data ManagementSpeech data Management Voice messaging, Search enginesVoice messaging, Search engines

Law EnforcementLaw Enforcement Forensics, Home incarcerationForensics, Home incarceration

Voice BiometricVoice Biometric AvantagesAvantages

Often the only modality over the telephone,Often the only modality over the telephone, Low cost (microphone, A/D), UbiquityLow cost (microphone, A/D), Ubiquity Possible integration on a smart (SIM) card Possible integration on a smart (SIM) card Natural bimodal fusion : speaking faceNatural bimodal fusion : speaking face

DisadvantagesDisadvantages Lack of discretionLack of discretion Possibility of imitation and electronic imposturePossibility of imitation and electronic imposture Lack of robustness to noise, distortion,…Lack of robustness to noise, distortion,… Temporal driftTemporal drift

Speaker Identity in SpeechSpeaker Identity in Speech Differences inDifferences in

Vocal tract shapes and muscular controlVocal tract shapes and muscular control Fundamental frequency (typical values)Fundamental frequency (typical values)

100 Hz (Male), 200 Hz (Female), 300 Hz (Child)100 Hz (Male), 200 Hz (Female), 300 Hz (Child) Glottal waveformGlottal waveform PhonotacticsPhonotactics Lexical usageLexical usage

The differences between Voices of Twins is a limit The differences between Voices of Twins is a limit casecase

Voices can also be imitated or disguisedVoices can also be imitated or disguised

spectral envelope of / i: /

Speaker A

Speaker B

Speaker Identity

segmental factors (~30ms)segmental factors (~30ms) glottal excitationglottal excitation::

fundamental frequency, amplitude,fundamental frequency, amplitude,voice quality (e.g., breathiness)voice quality (e.g., breathiness)

vocal tractvocal tract::characterized by its transfer function characterized by its transfer function and represented by MFCCs (Mel and represented by MFCCs (Mel Freq. Cepstral Coef)Freq. Cepstral Coef)

suprasegmental factorssuprasegmental factors speaking speed (timing and rhythm of speech units)speaking speed (timing and rhythm of speech units) intonation patternsintonation patterns dialect, accent, pronunciation habitsdialect, accent, pronunciation habits

Acoutic featuresAcoutic features

Short term spectral analysisShort term spectral analysis

Intra- and Inter-speaker Intra- and Inter-speaker variabilityvariability

Speaker Verification

Typology of approaches (EAGLES Handbook) Text dependent

Public password Private password Customized password Text prompted

Text independent Incremental enrolment Evaluation

History of Speaker History of Speaker RecognitionRecognition

Current approachesCurrent approaches

HMM structure depends on the HMM structure depends on the applicationapplication

Gaussian Mixture ModelGaussian Mixture Model Parametric representation of the Parametric representation of the

probability distribution of observations:probability distribution of observations:

Gaussian Mixture ModelsGaussian Mixture Models

8 Gaussians per mixture

Two types of errors :Two types of errors : False rejectionFalse rejection (a client is rejected) (a client is rejected) False acceptationFalse acceptation (an impostor is accepted) (an impostor is accepted)

Decision theory : given an observation O and a claimed Decision theory : given an observation O and a claimed identityidentity HH00 hypothesis : it comes from an impostor hypothesis : it comes from an impostor HH1 1 hypothesis : it comes from our clienthypothesis : it comes from our client

HH1 1 is chosen if and only if P(is chosen if and only if P(HH11|O) > P(|O) > P(HH00|O) |O)

which could be rewritten (using Bayes law) aswhich could be rewritten (using Bayes law) as

Decision theory Decision theory for identity verificationfor identity verification

Signal detection theorySignal detection theory

DecisionDecision

Distribution of scoresDistribution of scores

Detection Error Tradeoff (DET) Detection Error Tradeoff (DET) CurveCurve

EvaluationEvaluation

Decision cost (FA, FR, priors, costs,…)Decision cost (FA, FR, priors, costs,…) Receiver Operating Characteristic CurveReceiver Operating Characteristic Curve Reference systems (open software)Reference systems (open software) Evaluations (algorithms, field trials, Evaluations (algorithms, field trials,

ergonomy,…)ergonomy,…)

National Institute of Standards & Technology National Institute of Standards & Technology (NIST)(NIST)

Speaker Verification EvaluationsSpeaker Verification Evaluations

• Annual evaluation since 1995• Common paradigm for comparing technologies

NIST evaluations : ResultsNIST evaluations : Results

ENST 2003

Combining Speech Recognition Combining Speech Recognition and Speaker Verification.and Speaker Verification.

Speaker independent phone HMMsSpeaker independent phone HMMs Selection of segments or segment Selection of segments or segment

classes which are speaker specificclasses which are speaker specific Preliminary evaluations are performed Preliminary evaluations are performed

on the NIST extended data set (one on the NIST extended data set (one hour of training data per speaker)hour of training data per speaker)

ALISP data-driven speech ALISP data-driven speech segmentationsegmentation

Searching in client and world speech Searching in client and world speech dictionaries dictionaries

for speaker verification purposesfor speaker verification purposes

FusionFusion

Fusion resultsFusion results

Speaking Faces : MotivationsSpeaking Faces : Motivations

A person speaking in front of a camera offers 2 A person speaking in front of a camera offers 2 modalities for identity verification (speech and face).modalities for identity verification (speech and face).

The sequence of face images and the The sequence of face images and the synchronisation of speech and lip movements could synchronisation of speech and lip movements could be exploited.be exploited.

Imposture is much more difficult than with single Imposture is much more difficult than with single modalities.modalities.

Many PCs, PDAs, mobile phones are equiped with a Many PCs, PDAs, mobile phones are equiped with a camera. Audio-Visual Identity Verification will offer camera. Audio-Visual Identity Verification will offer non-intrusive security for e-commerce, e-banking,…non-intrusive security for e-commerce, e-banking,…

Talking Face RecognitionTalking Face Recognition(hybrid verification)(hybrid verification)

Lip featuresLip features

Tracking lip movementsTracking lip movements

A talking face modelA talking face model

Using Hidden Markov Models (HMMs)Using Hidden Markov Models (HMMs)

Acoustic parameters

Visual parameters

Morphing, avatarsMorphing, avatars

Conclusions, PerspectivesConclusions, Perspectives

Deliberate imposture is a challenge for speech only systems

Verification of identity based on features extracted from talking faces should be developped

Common databases and evaluation protocols are necessary

Free access to reference systems will facilitate future developments

June 28th, 2004 BioSecure, SecurePhone 1 Automatic Speaker Verification : Technologies, Evaluations...

Documents

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE ...atvs.ii.uam.es/fierrez/files/2010_PAMI_BMDM_Ortega.pdf · The Multiscenario Multienvironment BioSecure Multimodal Database (BMDB)

Audio-Visual Speech and Speaker Recognition Gérard Chollet, Guido Aversano, Hervé Bredin, Fabian Brugger, Maurice Charbit, Jerôme Darbon, Walid Karam,

Aquaculture Biosecure Systems Dr... · Definition of Bioflocs By Francois Brenta ... BIOSECURITY IN SHRIMP FARMING –Indoor Biofloc Systems Applied to Broodstock Production Heterotrophic

BIOSECURE - Animalia

Biosecure Fingerprint - Contemporary Front Doors

iGEM Horizon Scanning · Biosecure - Process

Spoken Language Interaction in Telecommunication at ENST/CNRS-LTCI Gérard CHOLLET, Richard CROCE, Dijana PETROVSKA-DELACRETAZ, Marc SIGELLE, Pascal VAILLANT,

1 Cours parole du 9 février 2005 enseignants: Dr. Dijana Petrovska-Delacrétaz et Gérard Chollet Reconnaissance Automatique de la Parole 1.Introduction,

Privacy Preserving Biometric Identity Verificationcostic1206.uvigo.es/.../Presentations/COST2017Chollet_Jimenez.pdf · Privacy Preserving Biometric Identity Verification Gérard Chollet

BioSecure HACCP Nursery Production Biosecurity SystemBioSecure HACCP is the result of a generic Hazard Analysis Critical Control Point (HACCP) based risk assessment of production nurseries

1 Cours parole du 2 Mars 2005 enseignants: Dr. Dijana Petrovska-Delacrétaz et Gérard Chollet Synthèse de la Parole 1.Introduction, Historique, Domaines

repositori.unud.ac.id · KARYA : JURNAL ILMIAH Judul Jurnal Ilmiah (Artikel) : Evaluation of Clean Market Chain from a Biosecure Farm Jumlah Penulis Status Pengus Identitas Jurnal

BioSecure & COST 2101 – Smart Cards and Biometric – Lausanne, 2007 Sabah Jassim University of Buckingham, UK. SecurePhone A Multi-Modal Biometric Verifier

SecurePhone : a mobile phone with biometric authentication and e-signature support for dealing secure transactions on the fly IST-2002-506883Secure contracts

LCG-France Project Status Fabio Hernandez Frédérique Chollet Fairouz Malek Réunion Sites LCG-France Annecy, May 18-19 2009

Jacotte Chollet NOUVELLE CONSCIENCE Multidimensional m

Case of a Coughing Kid Anna Chollet, MD/MPH March 6, 2013

MAJORDOME Gérard CHOLLET, Richard CROCE, Laurence LIKFORMAN, Dijana PETROVSKA-DELACRETAZ, Pascal VAILLANT (chollet,croce,lauli,petrovsk,vaillant)@tsi.enst.fr@

Eric Chaxel * , Jean-Pierre Chollet * , Christophe Quiniou and Olivier Couach *

Start development of community Biosecure aquaculture zones in …aquaculture.asia/files/online_03/Biosecure aquaculture... · 2015. 2. 4. · Safe aquaculture zones wereinitiated