ENGLISH |
Liste des publications de "Dumouchel, Pierre"Nombre de documents archivés : 125. Article publié dans une revue, révisé par les pairs
Alam, Md Jahangir, Gupta, Vishwa, Kenny, Patrick et Dumouchel, Pierre.
2015.
« Speech recognition in reverberant and noisy environments employing multiple feature extractors and i-vector speaker adaptation ».
EURASIP Journal on Advances in Signal Processing, vol. 2015, nº 1.
Attabi, Yazid et Dumouchel, Pierre.
2013.
« Anchor models for emotion recognition from speech ».
IEEE Transactions on Affective Computing, vol. 4, nº 3.
pp. 280-290.
Cardinal, Patrick, Dumouchel, Pierre et Boulianne, Gilles.
2013.
« Large vocabulary speech recognition on parallel architectures ».
IEEE Transactions on Audio, Speech, and Language Processing, vol. 21, nº 11.
pp. 2290-2300.
Dehak, Najim, Dumouchel, Pierre et Kenny, Patrick.
2007.
« Modeling prosodic features with joint factor analysis for speaker verification ».
IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, nº 7.
pp. 2095-2103.
Dehak, Najim, Kenny, Patrick, Dehak, Reda, Dumouchel, Pierre et Ouellet, Pierre.
2011.
« Front-end factor analysis for speaker verification ».
IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, nº 4.
pp. 788-798.
Gupta, Vishwa, Kenny, Patrick, Ouellet, Pierre, Boulianne, Gilles et Dumouchel, Pierre.
2007.
« Combining gaussianized/non-gaussianized features to improve speaker diarization of telephone conversations ».
IEEE Signal Processing Letters, vol. 14, nº 12.
pp. 1040-1043.
Hill, Edward, Han, David, Dumouchel, Pierre, Dehak, Najim, Quatieri, Thomas, Moehs, Charles, Oscar-Berman, Marlene, Giordano, John, Simpatico, Thomas, Barh, Debmalya et Blum, Kenneth.
2013.
« Long term SuboxoneTM emotional reactivity as measured by automatic detection in speech ».
[Article scientifique]. PLoS ONE, vol. 8, nº 7.
Kenny, P., Ouellet, P., Dehak, N., Gupta, V. et Dumouchel, Pierre.
2008.
« A study of interspeaker variability in speaker verification ».
IEEE Transactions on Audio, Speech and Language Processing, vol. 16, nº 5.
pp. 980-988.
Kenny, Patrick, Boulianne, Gilles et Dumouchel, Pierre.
2005.
« Eigenvoice modeling with sparse training data ».
IEEE Transactions on Speech and Audio Processing, vol. 13, nº 3.
pp. 345-354.
Kenny, Patrick, Boulianne, Gilles, Ouellet, Pierre et Dumouchel, Pierre.
2007.
« Joint factor analysis versus eigenchannels in speaker recognition ».
IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, nº 4.
pp. 1435-1447.
Kenny, Patrick, Boulianne, Gilles, Ouellet, Pierre et Dumouchel, Pierre.
2004.
« Speaker adaptation using an eigenphone basis ».
IEEE Transactions on Speech and Audio Processing, vol. 12, nº 6.
579-589 .
Kenny, Patrick, Boulianne, Gilles, Ouellet, Pierre et Dumouchel, Pierre.
2007.
« Speaker and session variability in GMM-based speaker verification ».
IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, nº 4.
pp. 1448-1460.
Ouali, Chahid, Dumouchel, Pierre et Gupta, Vishwa.
2016.
« A spectrogram-based audio fingerprinting system for content-based copy detection ».
Multimedia Tools and Applications, vol. 75, nº 15.
pp. 9145-9165.
Ouali, Chahid, Dumouchel, Pierre et Gupta, Vishwa.
2016.
« Fast audio fingerprinting system using GPU and a clustering-based technique ».
IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 24, nº 6.
pp. 1106-1118.
Senoussaoui, Mohammed, Kenny, Patrick, Stafylakis, Themos et Dumouchel, Pierre.
2014.
« A Study of the Cosine Distance-Based Mean Shift for Telephone Speech Diarization ».
IEEE Transactions on Audio, Speech, and Language Processing, vol. 22, nº 1.
pp. 217-227. Compte rendu de conférence
Alam, M. J., Kenny, P., Dumouchel, P. et O'Shaughnessy, D..
2014.
« Robust feature extractors for continuous speech recognition ».
In 22nd European Signal Processing Conference, EUSIPCO 2014 (Lisbon, Portugal, Sept. 01-05, 2014)
pp. 944-948.
European Signal Processing Conference, EUSIPCO.
Alam, M. J., Kenny, P., Dumouchel, P. et O'Shaughnessy, D..
2014.
« Robust speech recognition using warped DFT-based cepstral features in clean and multistyle training ».
In 22nd European Signal Processing Conference, EUSIPCO 2014 (Lisbon, Portugal, Sept. 01-05, 2014)
pp. 1791-1795.
European Signal Processing Conference, EUSIPCO.
Alam, M. J., Kenny, P., Dumouchel, Pierre et O'Shaughnessy, D..
2014.
« Noise spectrum estimation using Gaussian mixture model-based speech presence probability for robust speech recognition ».
In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Singapor, Singapor, Sept. 14-18, 2014)
pp. 2759-2763.
International Speech and Communication Association.
Alam, Md Jahangir, Attabi, Yazid, Dumouchel, Pierre, Kenny, Patrick et O'Shaughnessy, D..
2013.
« Amplitude modulation features for emotion recognition from speech ».
In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Lyon, France, Aug. 25-29, 2013)
pp. 2420-2424.
International Speech and Communication Association.
Alam, Md Jahangir, Attabi, Yazid, Kenny, Patrick, Dumouchel, Pierre et O'Shaughnessy, Douglas.
2014.
« Automatic emotion recognition from cochlear implant-like spectrally reduced speech ».
In Ambient Assisted Living and Daily Activities ; 6th International Work-Conference, IWAAL 2014, Belfast, UK, December 2-5, 2014. Proceedings (Belfast, UK, Dec. 2-5, 2014)
Coll. « Lecture Notes in Computer Science », vol. 8868.
pp. 332-340.
Springer Verlag.
Alam, Md Jahangir, Kenny, Patrick, Ouellet, Pierre, Stafylakis, Themos et Dumouchel, Pierre.
2014.
« Supervised/unsupervised voice activity detectors for textdependent speaker recognition on the RSR2015 corpus ».
In Proceeding of Odyssey 2014: Speaker and Language Recognition Workshop (Joensuu, Finland, June 16-19, 2014)
pp. 123-130.
International Speech Communication Association.
Attabi, Yazid, Alam, Md Jahangir, Dumouchel, Pierre, Kenny, Patrick et O'Shaughnessy, Douglas.
2013.
« Multiple windowed spectral features for emotion recognition ».
In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Vancouver, BC, Canada, May 26-31, 2013)
pp. 7527-7531.
Institute of Electrical and Electronics Engineers.
Attabi, Yazid et Dumouchel, Pierre.
2012.
« Anchor models and WCCN normalization for speaker trait classification ».
In 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 (Portland, OR, USA, Sept. 9-12, 2012)
pp. 522-525.
International Speech Communications Association.
Attabi, Yazid et Dumouchel, Pierre.
2011.
« Automatic emotion recognition from speech a PHD research proposal ».
In 4th International Conference on Affective Computing and Intelligent Interaction (ACII) (Memphis, TN, USA, Oct. 9-12, 2011)
Coll. « Lecture Notes in Computer Science », vol. 6975.
pp. 191-199.
Berlin, Germany : Springer Verlag.
Attabi, Yazid et Dumouchel, Pierre.
2012.
« Emotion recognition from children's speech using anchor models ».
In 3rd International Workshop on Child, Computer and Interaction (WOCCI) (Portland, OR, USA, Sept. 14, 2012)
pp. 82-86.
International Society for Computers and Their Applications (ISCA).
Attabi, Yazid et Dumouchel, Pierre.
2012.
« Emotion recognition from speech: WOC-NN and class-interaction ».
In 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA) (Montreal, QC, Canada, July 2-5, 2012)
pp. 126-131.
Washington, DC : IEEE Computer Society.
Attabi, Yazid et Dumouchel, Pierre.
2011.
« Weighted ordered classes – nearest neighbors: a new framework for automatic emotion recognition from Speech ».
In 12th International Conference of Interspeech (Interspeech) (Florence, Italy, Aug. 28-31, 2011)
pp. 3125-3128.
France : International Speech and Communication Association.
Boucher, Patrice, Dufour, Pierre, Plusquellec, Pierrich, Dehak, Najim, Dumouchel, Pierre et Cardinal, Patrick.
2016.
« PHYSIOSTRESS: A multimodal corpus of data on acute stress and physiological activation ».
In Workshop on Multimodal Corpora : Computer vision and language processing (MMC 2016) (Portoroz, Slovenia, May 23-28, 2016)
pp. 45-48.
European Language Resources Association (ELRA).
Boufaden, Narjès et Dumouchel, Pierre.
2008.
« Leveraging emotion detection using emotions form yes-no answers ».
In 9th International Conference of Speech Communication Association 2008 (INTERSPEECH) (Brisbane, Australia, Sept. 22-26, 2008)
pp. 241-244.
International Speech Communications Association.
Boufaden, Narjès, Hoang, Truong Le et Dumouchel, Pierre.
2007.
« Détection et prédiction de la satisfaction des usagers dans les dialogues personne-machine ».
In Actes sur le Traitement Automatique des Langues Naturelles (TALN) (Toulouse, France, 5-8 juin 2007)
Montréal, Qué., Canada : CRIM.
Boulianne, G., Brousseau, J., Ouellet, P. et Dumouchel, P..
2000.
« Parlable: The CRIM transducer-based ASR system ».
In RFIA 2000 : 12e congrès francophone AFRIF-AFIA Reconnaissance des Formes et Intelligence Artificielle, Paris 1-3 fév. 2000, Salons de l'Aveyron (Paris, France, Feb. 1-3, 2000)
Paris, France : TELECOM Paris.
Boulianne, G., Brousseau, J., Ouellet, P. et Dumouchel, Pierre.
2000.
« French large vocabulary recognition with cross-word phonology transducers ».
In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2000) (Istanbul, Turkey, June 5-9, 2000)
pp. 1675-1678.
Piscataway, N. J., USA : IEEE.
Boulianne, Gilles, Beaumont, Jean-François, Cardinal, Patrick, Comeau, Michel, Ouellet, Pierre et Dumouchel, Pierre.
2003.
« Automatic segmentation of film dialogues into phonemes and graphemes ».
In Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech 2003) (Geneva, Switzerland, Sept. 1-4, 2003)
pp. 1241-1244.
International Speech and Communication Association.
Boulianne, Gilles, Brousseau, Julie, Talbot, Nathalie et Dumouchel, Pierre.
1999.
« Experiments in constrained maximum likelihood extraction of temporal features for speech recognition ».
In Eurospeech 99: 6th European Conference on Speech Communication and Technology: Budapest, Hungary, September 5-9, 1999 (Budapest, Hungary, Sept. 5-9, 1999)
pp. 1083-1086.
Bonn, Germany : ESCA.
Boulianne, Gilles et Dumouchel, Pierre.
2001.
« Out-of-vocabulary word modeling using multiple lexical fillers ».
In IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (Madonna di Campiglio, Italy, Dec. 9-13, 2001)
pp. 226-229.
Institute of Electrical and Electronics Engineers Inc..
Boulianne, Gilles et Dumouchel, Pierre.
2013.
« Unsupervised topic model for broadcast program segmentation ».
In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Vancouver, BC, Canada, May 26-31, 2013)
pp. 8455-8459.
Institute of Electrical and Electronics Engineers.
Boulianne, Gilles, Ouellet, Pierre et Dumouchel, Pierre.
2001.
« A transducer approach to word graph generation ».
In EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology (Aalborg, Denmark, Sept. 3-7, 2001)
pp. 1595-1598.
International Speech Communication Association.
Boutin, Simon, Tremblay, Réal, Cardinal, Patrick, Peters, Doug et Dumouchel, Pierre.
2015.
« Audio quotation marks for natural language understanding ».
In INTERSPEECH 2015. 16th Annual Conference of the International Speech Communication Association (Dresden, Germany, Sept. 6-10, 2015)
pp. 1349-1352.
International Speech Communication Association.
Brodeur, David, Grondin, François, Attabi, Yazid, Dumouchel, Pierre et Michaud, François.
2016.
« Integration framework for speech processing with live visualization interfaces ».
In 25th IEEE International Symposium on Robot and Human Interactive Communication, RO-MAN 2016 (New York, NY, USA, Aug. 26-31, 2016)
pp. 144-150.
Institute of Electrical and Electronics Engineers Inc..
Brousseau, J., Dumochel, P., Talbot, N. et Tadj, C..
2000.
« Phone and parameter based approaches to topic spotting ».
In RFIA 2000 : 12e congrès francophone AFRIF-AFIA Reconnaissance des Formes et Intelligence Artificielle, Paris 1-3 fév. 2000, Salons de l'Aveyron (Paris, France, Feb. 1-3, 2000)
Paris, France : TELECOM Paris.
Cardinal, Patrick, Boulianne, Gilles et Dumouchel, Pierre.
2012.
« The A* speech recognition system on parallel architectures ».
In 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA) (Montreal, QC, Canada, July 2-5, 2012)
pp. 108-113.
Washington, DC : IEEE Computer Society.
Cardinal, Patrick, Boulianne, Gilles et Dumouchel, Pierre.
2012.
« Using A* for the parallelization of speech recognition systems ».
In 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (Kyoto, Japan, Mar. 25-30, 2012)
pp. 4433-4436.
Piscataway, NJ : Institute of Electrical and Electronics Engineers Inc..
Cardinal, Patrick, Dumouchel, Pierre et Boulianne, Gilles.
2009.
« Using parallel architectures in speech recognition ».
In INTERSPEECH 2009 - 10th Annual Conference of the International Speech Communication Association (Brighton, UK, Sept. 6-10, 2009)
pp. 3039-3042.
International Speech and Communication Association.
Cardinal, Patrick, Dumouchel, Pierre, Boulianne, Gilles et Comeau, Michel.
2008.
« GPU accelerated acoustics likelihood computations ».
In 9th Annual Conference of the International Speech Communication Association (INTERSPEECH) (Brisbane, Australia, Sept. 22-26, 2008)
pp. 964-967.
Bonn, Germany : International Speech Communication Association.
Dehak, Najim, Dehak, Reda, Kenny, Patrick, Brummer, Niko, Ouellet, Pierre et Dumouchel, Pierre.
2009.
« Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification ».
In 10th Annual Conference of the International Speech Communication Association (INTERSPEECH) (Brighton, England, Sept. 6-10, 2009)
pp. 1527-1530.
ISCA-INST Speech Communication Assoc..
Dehak, Najim, Dehak, Réda, Kenny, Patrick et Dumouchel, Pierre.
2008.
« Comparison between factor analysis and GMM support vector machines for speaker verification ».
In Odyssey : the Speaker and Language Recognition Workshop (Stellenbosch, South Africa, Jan. 21-24, 2008)
Dehak, Najim, Kenny, Patrick, Dehak, Reda, Glembek, Ondrej, Dumouchel, Pierre, Burget, Lukas, Hubeika, Valiantsina et Castaldo, Fabio.
2009.
« Support vector machines and joint factor analysis for speaker verification ».
In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Taipei, Taiwan, April 19-24, 2009)
pp. 4237-4240.
Piscataway, NJ, USA : Institute of Electrical and Electronics Engineers.
Dehak, Najim, Kenny, Patrick et Dumouchel, Pierre.
2007.
« Continuous prosodic features and formant modeling with joint factor analysis for speaker verification ».
In 8th Annual Conference of the International Speech Communication Association (Interspeech) (Antwerp, Belgium, Aug. 27-31, 2007)
pp. 853-856.
Bonn, Germany : International Speech Communication Association.
Dehak, R., Dehak, N., Kenny, P. et Dumouchel, Pierre.
2008.
« Linear and non linear kernel GMM support vector machines for speaker verification ».
In 8th Annual Conference of the International Speech Communication Association (Interspeech) (Antwerp, Belgium, Aug. 27-31, 2007)
pp. 733-736.
Bonn, Germany : International Speech Communication Association.
Dehak, Réda, Dehak, Najim, Kenny, Patrick et Dumouchel, Pierre.
2008.
« Kernel combination for SVM speaker verification ».
In Odyssey : the Speaker and Language Recognition Workshop (Stellenbosch, South Africa, Jan. 21-24, 2008)
Dehak, Réna, Dehak, Najim, Kenny, Patrick et Dumouchel, Pierre.
2007.
« Linear and non linear kernel GMM super vector machines for speaker verification ».
In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Antwerp, Belgium, Aug. 27-31, 2007)
pp. 733-736.
ISCA.
Dumouchel, P..
1994.
« Suprasegmental features and continuous speech recognition ».
In Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing (Adelaide, Australia, Apr. 19-22, 1994)
II177-II180.
Institute of Electrical and Electronics Engineers Inc..
Dumouchel, P., Gupta, V., Lennig, M. et Mermelstein, P..
1988.
« Three probabilistic language models for a large-vocabulary speech recognizer ».
In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (New York City, NY, USA, Apr. 11-14, 1988)
pp. 513-516.
New York, NY, USA : IEEE.
Dumouchel, P. et O'Shaughnessy, D..
1995.
« Segmental intensity and HMM modelling ».
In Canadian Conference on Electrical and Computer Engineering, 1995 (Montreal, QC, Canada, Sept. 5-8, 1995)
pp. 995-998.
IEEE.
Dumouchel, Pierre, Dehak, Najim, Attabi, Yazid, Dehak, Reda et Boufaden, Narjes.
2009.
« Cepstral and long-term features for emotion recognition ».
In 10th Annual Conference of the International Speech Communication Association (INTERSPEECH) (Brighton, England, Sept. 6-10, 2009)
pp. 344-347.
ISCA-INST Speech Communication Assoc..
Dumouchel, Pierre, Vergin, Rivarol, O'Shaughnessy, Douglas et Rouat, Jean.
1997.
« La reconnaissance de la parole en français ».
In Les techniques d'intelligence artificielle appliquées aux technologies de l'information (Montréal, QC, Canada, 15-16 mai 1997)
Coll. « Cahiers Scientifiques », vol. 90.
pp. 77-86.
Montréal, Canada : Association Canadienne-Française pour l'Avancement des Sciences .
Fagundes, Rubem Dutra Ribeiro, Corrêa, Juarez Sagebin et Dumouchel, Pierre.
2002.
« A new phonetic model for continuous speech recognition systems ».
In 6th International Conference on Signal Processing (ICSP) (Beijing, China, Aug. 26-30, 2002)
pp. 572-575.
Institute of Electrical and Electronics Engineers Inc..
Gupta, V., Boulianne, G., Kenny, P. et Dumouchel, Pierre.
2008.
« Advertisement detection in french broadcast news using acoustic repetition and gaussian mixture models ».
In 9th Annual Conference of the International Speech Communication Association (INTERSPEECH) (Brisbane, Australia, Sept. 22-26, 2008)
pp. 2538-2541.
Bonn, Germany : International Speech Communication Association.
Gupta, V., Boulianne, G., Kenny, P., Ouellet, P. et Dumouchel, Pierre.
2008.
« Speaker diarization of french broadcast news ».
In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Las Vegas, NV, USA, March 31-April 4, 2008)
pp. 4365-4368.
Piscataway, NJ, USA : Institute of Electrical and Electronics Engineers.
Gupta, Vishwa, Kenny, Patrick, Ouellet, Pierre, Boulianne, Gilles et Dumouchel, Pierre.
2008.
« Multiple feature combination to improve speaker diarization of telephone conversations ».
In IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (Kyoto, Japan, Dec. 9-13, 2007)
pp. 705-710.
Piscataway, NJ, USA : Institute of Electrical and Electronics Engineers.
Hill, Edward, Dumouchel, Pierre et Moehs, Charles.
2011.
« An evidence-based toolset to Capture, Measure, and Assess Emotional Health ».
In 16th Annual CyberPsychology and CyberTherapy Conference (Gatineau, Canada, June 19-22, 2011)
Kenny, P., Boulianne, G. et Dumouchel, P..
2002.
« Maximum likelihood estimation of eigenvoices and residual variances for large vocabulary speech recognition tasks ».
In 7th International Conference on Spoken Language Processing, ICSLP 2002 (Denver, CO, USA, Sept. 16-20, 2002)
pp. 57-60.
International Speech Communication Association.
Kenny, P. et Dumouchel, P..
2004.
« Disentangling speaker and channel effects in speaker verification ».
In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Montréal, QC, Canada, May 17-21, 2004)
I37-I40.
Kenny, P., Mihoubi, M. et Dumouchel, P..
2003.
« New MAP estimators for speaker recognition ».
In EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology (Geneva, Switzerland, Sept. 01-04, 2003)
pp. 2961-2964.
International Speech Communication Association.
Kenny, Patrick, Boulianne, Gilles et Dumouchel, Pierre.
2000.
« Bayesian adaptation revisited ».
In Proceedings of ISCA Tutorial and Research Workshop (ITRW) ASR2000-Automatic Speech Recognition: Challenges for the new Millenium (Paris, France, 18-20 sept. 2000)
pp. 112-119.
Kenny, Patrick, Boulianne, Gilles et Dumouchel, Pierre.
2001.
« What is the best type of prior distribution for EMAP speaker adaptation? ».
In EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology (Aalborg, Denmark, Sept. 3-7, 2001)
pp. 1207-1210.
International Speech Communication Association.
Kenny, Patrick, Boulianne, Gilles, Ouellet, Pierre et Dumouchel, Pierre.
2006.
« Improvements in factor analysis based speaker verification ».
In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Toulouse, France, May 14-19, 2006)
I113-I116.
Kenny, Patrick, Boulianne, Gilles, Ouellet, Pierre et Dumouchel, Pierre.
2006.
« The geometry of the channel space in GMM-based speaker recognition ».
In IEEE Odyssey-The Speaker and Language Recognition Workshop (San Juan, PR, USA, June 28-30, 2006)
Piscataway, N.J. : Institute of Electrical and Electronics Engineers.
Kenny, Patrick, Boulianne, Gilles, Quellet, Pierre et Dumouchel, Pierre.
2005.
« Factor analysis simplified ».
In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (Philadelphia, PA, USA, Mar. 18-23, 2005)
pp. 637-640.
Piscataway, NJ, USA : Institute of Electrical and Electronics Engineers.
Kenny, Patrick, Dehak, Najim, Dehak, Réda, Gupta, Vishwa et Dumouchel, Pierre.
2008.
« The role of the speaker factors in the NIST extended data task ».
In Odyssey : the Speaker and Language Recognition Workshop (Stellenbosch, South Africa, Jan. 21-24, 2008)
Kenny, Patrick, Dehak, Najim, Ouellet, Pierre, Gupta, Vishwa et Dumouchel, Pierre.
2008.
« Development of the primary CRIM system for the NIST 2008 speaker recognition evaluation ».
In 9th Annual Conference of the International Speech Communication Association (INTERSPEECH) (Brisbane, Australia, Sept. 22-26, 2008)
pp. 1401-1404.
Bonn, Germany : International Speech Communication Association.
Kenny, Patrick et Dumouchel, Pierre.
2004.
« Experiments in speaker verification using factor analysis likelihood ratios ».
In ODYSSEY 2004 - The Speaker and Language Recognition Workshop (Toledo, Spain, May 31-June 3, 2004)
Kenny, Patrick, Gupta, Vishwa, Boulianne, Gilles, Ouellet, Pierre et Dumouchel, Pierre.
2006.
« Feature normalization using smoothed mixture transformations ».
In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Pittsburgh, PA, USA, Sept. 17-21, 2006)
pp. 25-28.
United Kingdom : DUMMY PUBID.
Kenny, Patrick, Stafylakis, Themos, Ouellet, Pierre, Alam, Md Jahangir et Dumouchel, Pierre.
2013.
« PLDA for speaker verification with utterances of arbitrary duration ».
In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Vancouver, BC, Canada, May 26-31, 2013)
pp. 7649-7653.
Institute of Electrical and Electronics Engineers.
Mihoubi, M., Boulianne, G. et Dumouchel, P..
2003.
« Discriminative training and maximum likelihood detector for speaker identification ».
In EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology (Geneva, Switzerland, Sept. 01-04, 2003)
pp. 2657-2660.
International Speech Communication Association.
Mihoubi, M., Dumouchel, P. et O'Shaughnessy, D..
2004.
« The use of typical sequences for robust speaker identification ».
In INTERSPEECH 2004 - 8th International Conference on Spoken Language Processing, ICSLP (Jeju Island, South Korea, Oct. 04-08, 2004)
pp. 2349-2352.
International Speech Communication Association.
Mihoubi, M., O'Shaughnessy, D. et Dumouchel, P..
2005.
« Relevant information extraction for discriminative training applied to speaker identification ».
In Interspeech 2006 - 9th European Conference on Speech Communication and Technology (Lisbon, Portugal, Sept. 04-08, 2005)
pp. 3097-3100.
International Speech Communication Association.
Ouali, C., Dumouchel, Pierre et Gupta, V..
2014.
« A robust audio fingerprinting method for content-based copy detection ».
In 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI) (Klagenfurt, Austria, June 18-20, 2014)
Piscataway, N. J., USA : IEEE.
Ouali, C., Dumouchel, Pierre et Gupta, V..
2014.
« Robust features for content-based audio copy detection ».
In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Singapor, Singapor, Sept. 14-18, 2014)
pp. 2395-2399.
International Speech and Communication Association.
Ouali, Chahid, Dumouchel, Pierre et Gupta, Vishwa.
2015.
« Efficient spectrogram-based binary image feature for audio copy detection ».
In 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (South Brisbane, QLD, Australia, Apr. 19-24, 2015)
pp. 1792-1796.
Piscataway, NJ, USA : IEEE.
Ouali, Chahid, Dumouchel, Pierre et Gupta, Vishwa.
2015.
« GPU implementation of an audio fingerprints similarity search algorithm ».
In 2015 13th International Workshop on Content-Based Multimedia Indexing (CBMI) (Prague, Czech Republic, June 10-12, 2015)
Piscataway, NJ, USA : IEEE.
Ouali, Chahid, Dumouchel, Pierre et Gupta, Vishwa.
2017.
« Robust video fingerprints using positions of salient regions ».
In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (New Orleans, LA, USA, Mar. 05-09, 2017)
pp. 3041-3045.
IEEE.
Ouali, Chalid, Dumouchel, Pierre et Gupta, Vishwa.
2015.
« Content-based multimedia copy detection ».
In 2015 IEEE International Symposium on Multimedia (ISM) (Miami, FL, USA, Dec. 14-16, 2015)
pp. 597-600.
Los Alamitos, CA, USA : IEEE Computer Society.
Ouellet, P., Tadj, C. et Dumouchel, P..
1998.
« Dialog and prosodic models for text-independent speaker identification ».
In RLA2C Speaker Recognition and its Commercial and Forensic Applications (Avignon, France, Apr. 20-23, 1998)
pp. 41-44.
Paris, France : Centre national de la recherche scientifique.
Senoussaoui, Mohamed, Kenny, Patrick, Brümmer, Niko, de Villiers, Edward et Dumouchel, Pierre.
2011.
« Mixture of PLDA models in I-vector space for gender-independent speaker recognition ».
In 12th International Conference of Interspeech (Interspeech) (Florence, Italy, Aug. 28-Sept. 1, 2011)
pp. 25-28.
France : International Speech and Communication Association.
Senoussaoui, Mohammed, Dehak, Najim, Kenny, Patrick, Dehak, Reda et Dumouchel, Pierre.
2012.
« First attempt of boltzmann machines for speaker verification ».
In Speaker and Language Recognition Workshop, Odyssey 2012 (Singapore, Singapore, June 25-28, 2012)
pp. 117-121.
Chinese and Oriental Languages Information Processing Society (COLIPS), Speaker and Language Characterization SIG.
Senoussaoui, Mohammed, Kenny, Patrick, Dehak, Najim et Dumouchel, Pierre.
2010.
« An i-vector extractor suitable for speaker recognition with both microphone and telephone speech ».
In Odyssey 2010 : The Speaker and Language Recognition Workshop (Brno, Czech Republic, June 28-July 1, 2010)
Senoussaoui, Mohammed, Kenny, Patrick, Dumouchel, Pierre et Castaldo, Fabio.
2011.
« Well-calibrated heavy tailed Bayesian speaker verification for microphone speech ».
In 36th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (Prague, Czech republic, May 22-27, 2011)
pp. 4824-4827.
Piscataway, NJ, USA : Institute of Electrical and Electronics Engineers.
Senoussaoui, Mohammed, Kenny, Patrick, Dumouchel, Pierre et Dehak, Najim.
2013.
« New cosine similarity scorings to implement gender-independent speaker verification ».
In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Lyon, France, Aug. 25-29, 2013)
pp. 2773-2777.
International Speech and Communication Association.
Senoussaoui, Mohammed, Kenny, Patrick, Dumouchel, Pierre et Stafylakis, Themos.
2013.
« Efficient iterative mean shift based cosine dissimilarity for multi-recording speaker clustering ».
In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Vancouver, BC, Canada, May 26-31, 2013)
pp. 7712-7715.
Institute of Electrical and Electronics Engineers.
Smaili, N., Cardinal, P., Boulianne, G. et Dumouchel, P..
2002.
« Disambiguation of finite-state transducers ».
In Proceedings of the 19th International Conference on Computational Linguistics (COLING2002) (Taipei, Taiwan, Aug. 26-30, 2002)
Association for Computational Linguistics (ACL).
Stafylakis, T., Katsouros, V., Kenny, P. et Dumouchel, P..
2012.
« Mean shift algorithm for exponential families with applications to speaker clustering ».
In Speaker and Language Recognition Workshop, Odyssey 2012 (Singapore, Singapore, June 25-28, 2012)
pp. 324-329.
Chinese and Oriental Languages Information Processing Society (COLIPS), Speaker and Language Characterization SIG.
Stafylakis, T., Kenny, P., Ouellet, P., Perez, J., Kockmann, M. et Dumouchel, Pierre.
2013.
« Text-dependent speaker recognition using PLDA with uncertainty propagation ».
In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Lyon, France, Aug. 25-29, 2013)
pp. 3684-3688.
International Speech and Communication Association.
Stafylakis, Themos, Katsouros, Vassilis, Kenny, Patrick et Dumouchel, Pierre.
2012.
« A mean shift algorithm for manifolds of exponential families ».
In 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA) (Montreal, QC, Canada, July 2-5, 2012)
pp. 511-516.
Washington, DC : IEEE.
Stafylakis, Themos, Kenny, Patrick, Gupta, Vishwa et Dumouchel, Pierre.
2013.
« Compensation for inter-frame correlations in speaker diarization and recognition ».
In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Vancouver, BC, Canada, May 26-31, 2013)
pp. 7731-7735.
Institute of Electrical and Electronics Engineers.
Stafylakis, Themos, Kenny, Patrick, Senoussaoui, Mohammed et Dumouchel, Pierre.
2012.
« PLDA using Gaussian restricted Boltzmann machines with application to speaker verification ».
In 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 (Portland, OR, USA, Sept. 9-12, 2012)
pp. 1690-1693.
International Speech Communications Association.
Stafylakis, Themos, Kenny, Patrick, Senoussaoui, Mohammed et Dumouchel, Pierre.
2012.
« Preliminary investigation of Boltzmann machine classifiers for speaker recognition ».
In Speaker and Language Recognition Workshop, Odyssey 2012 (Singapore, Singapore, June 25-28, 2012)
pp. 109-116.
Chinese and Oriental Languages Information Processing Society (COLIPS), Speaker and Language Characterization SIG.
Tadj, Chakib, Dumouchel, Pierre et Fang, Yu.
1998.
« N-Best GMM's for speaker identification ».
In Eurospeech 97: 5th European Conference on Speech Communication and Technology: Rhodes, Greece, 22-25 September, 1997 (Rhodes, Greece, Sept. 22-25, 1997)
pp. 2295-2298.
ESCA.
Tadj, Chakib, Dumouchel, Pierre, Mihoubi, Mohamed et Ouellet, Pierre.
1999.
« Environment adaptation and long term parameters in speaker identication ».
In Eurospeech 99: 6th European Conference on Speech Communication and Technology: Budapest, Hungary, September 5-9, 1999 (Budapest, Hungary, Sept. 5-9, 1999)
pp. 1015-1018.
Bonn, Germany : ESCA.
Tadj, Chakib, Dumouchel, Pierre et Ouellet, Pierre.
1998.
« GMM based speaker identification using training-time-dependent number of mixtures ».
In IEEE International Conference on Acoustics, Speech and Signal Processing (Seattle, WA, USA, May 12-15, 1998)
pp. 761-764.
Piscataway, NJ, USA : Institute of Electrical and Electronics Engineers.
Tadj, Chakib, Dumouchel, Pierre et Poirier, Frank.
1998.
« FDVQ based keyword spotter which incorporates a semi-supervised learning for primary processing ».
In Eurospeech 97: 5th European Conference on Speech Communication and Technology: Rhodes, Greece, 22-25 September, 1997 (Rhodes, Greece, Sept. 22-25, 1997)
pp. 2799-2802.
ESCA.
Talon, Marie-Hélène, Dumouchel, Pierre et O'Shaughnessy, Douglas.
1997.
« Modélisation du langage pour la reconnaissance de la parole : une approche stochastique non-déterministe à contexte de longueur variable ».
In Les techniques d'intelligence artificielle appliquées aux technologies de l'information (Montréal, QC, Canada, 15-16 mai 1997)
Coll. « Cahiers Scientifiques », vol. 90.
pp. 194-205.
Montréal, Canada : Association Canadienne-Française pour l'Avancement des Sciences .
Vergin, Rivarol, O'Shaughnessy, Douglas et Dumouchel, Pierre.
1999.
« Toward parametric representation of speech for speaker recognition systems ».
In Eurospeech 99: 6th European Conference on Speech Communication and Technology: Budapest, Hungary, September 5-9, 1999 (Budapest, Hungary, Sept. 5-9, 1999)
pp. 795-798.
Bonn, Germany : ESCA. Chapitre de livre
Boulianne, Gilles, Beaumont, Jean-François, Boisvert, Maryse, Brousseau, Julie, Cardinal, Patrick, Chapdelaine, Claude, Comeau, Michel, Ouellet, Pierre, Osterrath, Frédéric et Dumouchel, Pierre.
2010.
« Shadow speaking for real-time closed-captioning of TV broadcasts in french ».
In
Listening to subtitles : subtitles for the deaf and hard of hearing.
New York, NY, USA : Peter Lang International Academic Professional Publishers.
Dumouchel, Pierre, Boulianne, Gilles et Brousseau, Julie.
2011.
« Measures for quality of closed captioning ».
In
Audiovisual translation in close-up : practical and theoretical approaches.
New York, NY, USA : Peter Lang International Academic Professional Publishers.
Hill, Edward, Dumouchel, Pierre et Moehs, Charles.
2011.
« An evidence-based toolset to capture, measure and assess emotional health ».
In
Annual Review of Cybertherapy and Telemedicine 2011.
Coll. « Studies in health technology and informatics », vol. 167.
pp. 176-181. IOS Press. CommunicationCheriet, Mohamed, Crevier, Daniel, de Guise, Jacques A., Doré, Sylvie, Dumouchel, Pierre, Lepage, Richard, Noumeir, Rita et Sabourin, Robert. 1996. « Présentation des activités du laboratoire d'imagerie, de vision et d'intelligence artificielle ». Communication lors de la conférence : Conférence-midi du Département de Génie Électrique de l'ÉTS (Montréal, QC, Canada, 15 févr. 1996). Dumouchel, Pierre. 2005. « CRIM's work on speech recognition and captioning ». Communication lors de la conférence : Commonwealth Hansard Editors Association Conference (AB, Canada, Aug. 8, 2005). Dumouchel, Pierre. 2023. « Discours de clôture de la conférence ». Communication lors de la conférence : 4é édition du forum MobiliT.AI (Toulouse, France, 30-31 mai 2023). Dumouchel, Pierre. 2007. « E-inclusion : la recherche au service des sourds et des aveugles ». Communication lors de la conférence : 75e Congrès de l'ACFAS (Trois-Rivières, QC, Canada, 7-11 mai 2007). Dumouchel, Pierre. 2005. « E-inclusion, a new canadian initiative ». Communication lors de la conférence : E-Challenges (Ljubljana, Slovénia, Oct. 19-21, 2005). Dumouchel, Pierre. 2005. « Exploiting audio-visual metadata ». Communication lors de la conférence : Canadian Metadata Forum (Ottawa, ON, Canada, Sept. 27-28, 2005). Dumouchel, Pierre. 2000. « Keynote lecturer ». Communication lors de la conférence : ED-MEDIA 2000 : World Conference on Educational Multimedia, Hypermedia and Telecommunications (Montreal, QC, Canada, July, 2000). Dumouchel, Pierre. 2005. « Real-time closed captioning ». Communication lors de la conférence : Canadian Hard of Hearing Association Conference (Kelowna, BC, Canada, June 2-5, 2005). Dumouchel, Pierre et O'Shaughnessy, D.. 1995. « Segmental duration and HMM modelling ». Communication lors de la conférence : ESCA EUROSPEECH 1995 4th European Conference on Speech Communication (Madrid, Spain, Sept., 1995). Dumouchel, Pierre, Vergin, R. et O'Shaughnessy, D.. 1997. « La reconnaissance automatique de la parole en français ». Communication lors de la conférence : 65e Congrès de l'ACFAS (Trois-Rivières, QC, Canada, mai 1997). Kenny, P., Boulianne, G., Ouellet, P. et Dumouchel, Pierre. 2006. « The geometry of the channel space in GMM-based speaker recognition ». Communication lors de la conférence : IEEE Odyssey-The Speaker and Language Recognition Workshop (San Juan, PR, USA, June 28-30, 2006). Kenny, P., Gupta, V., Boulianne, G., Ouellet, P. et Dumouchel, Pierre. 2006. « Feature normalization using smoothed mixture transformations ». Communication lors de la conférence : International Conference on Spoken Language Processing Proceedings (INTERSPEECH) (Pittsburgh, PA, USA, Sept. 17-21, 2006). Lepage, Richard, Noumeir, Rita et Dumouchel, Pierre. 1996. « Présentation du colloque "L'intelligence artificielle dans les technologies de l'information" ». Communication lors de la conférence : 25e Canadian Science Writers' Association Annual Conference (Montréal, QC, Canada, May, 1996). Ouellet, P., Boulianne, G., Brousseau, J. et Dumouchel, Pierre. 2000. « Reconnaissance de la parole par composition de transducteurs à états finis ». Communication lors de la conférence : ACFAS : Colloque sur la Modélisation du Monde Réel (Montréal, QC, Canada, mai 2000). Tadj, Chakib, Dumouchel, Pierre, Mihoubi, M. et Ouellet, P.. 1999. « Environment adaptation and long term parameters in speaker identification ». Communication lors de la conférence : European Speech Communication Association (EuroSpeech) (Budapest, Hungary, Sept., 1999). Talon, M.-H., Dumouchel, Pierre et O'Shaughnessy, D.. 1997. « Modélisation du langage pour la reconnaissance de la parole: une approche stochastique non-déterministe à contexte de longueur variable ». Communication lors de la conférence : 65e Congrès de l'ACFAS (Trois-Rivières, QC, Canada, mai 1997). Talon, M.-H., Pierre, Dumouchel et O'Shaughnessy, D.. 1996. « Modélisation du langage pour la reconnaissance de la parole: une approche stochastique non-déterministe à contexte de longueur convenable ». Communication lors de la conférence : Colloque multidisciplinaire L'intelligence artificielle dans les technologies de l'information tenu dans le cadre du Congrès de l'ACFAS (Montréal, QC, Canada, 13-17 mai 1996). Vergin, R., Dumouchel, Pierre et O'Shaughnessy, D.. 1996. « La reconnaissance automatique de la parole en français québécois ». Communication lors de la conférence : Colloque multidisciplinaire L'intelligence artificielle dans les technologies de l'information tenu dans le cadre du Congrès de l'ACFAS (Montréal, QC, Canada, 13-17 mai 1996). BrevetLe Devehat, Yannick, Perron, David, Fraysse, Olivier, Dumouchel, Pierre, Landry, René Jr et Rivest, François (inventeurs) 2 août 2011. « Method of improving successful recognition of genuine acoustic authentication devices ». Identita Technologies International SRL (titulaire(s)). Brevet américain US 7,992,067. |