ENGLISH |
Liste des publications de "Cardinal, Patrick"Nombre de documents archivés : 61. 2023
Praveen, R. Gnana, Cardinal, Patrick et Granger, Eric.
2023.
« Audio-visual fusion for emotion recognition in the valence-arousal space using joint cross-attention ».
IEEE Transactions on Biometrics, Behavior, and Identity Science, vol. 5, nº 3.
pp. 360-373.
Praveen, R. Gnana, Granger, Eric et Cardinal, Patrick.
2023.
« Recursive joint attention for audio-visual fusion in regression based emotion recognition ».
In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Rhodes Island, Greece, June 04-10, 2023)
Institute of Electrical and Electronics Engineers Inc..
Simard, Guillaume, Melançon, Cédric, Cardinal, Patrick et Gascon-Samson, Julien.
2023.
« Performance characterization of MQTT brokers in a device-local edge deployment ».
In MiddleWEdge 2023 - Proceedings of the 2nd International Workshop on Middleware for the Edge (Bologna, Italia, Dec. 11, 2023)
pp. 13-18.
Association for Computing Machinery, Inc. 2022
Baril, Guillaume, Cardinal, Patrick et Koerich, Alessandro Lameiras.
2022.
« Named entity recognition for audio de-identification ».
In International Joint Conference on Neural Networks (IJCNN) (Padua, Italy, July 18-23, 2022)
Institute of Electrical and Electronics Engineers Inc..
Esmaeilpour, Mohammad, Cardinal, Patrick et Koerich, Alessandro Lameiras.
2022.
« From environmental sound representation to robustness of 2D CNN models against adversarial attacks ».
Applied Acoustics, vol. 195.
Esmaeilpour, Mohammad, Cardinal, Patrick et Koerich, Alessandro Lameiras.
2022.
« Towards robust speech-to-text adversarial attack ».
In 47th IEEE International Conference on Acoustics, Speech, and Signal Processing (Singapore, Singapore, May 23-27, 2022)
pp. 2869-2873.
Institute of Electrical and Electronics Engineers Inc..
Esmaeilpour, Mohammad, Cardinal, Patrick et Lameiras Koerich, Alessandro.
2022.
« Multidiscriminator sobolev defense-GAN against adversarial attacks for end-to-end speech systems ».
IEEE Transactions on Information Forensics and Security, vol. 17.
pp. 2044-2058.
Esmaeilpour, Mohammad, Chaalia, Nourhene, Abusitta, Adel, Devailly, Franois-Xavier, Maazoun, Wissem et Cardinal, Patrick.
2022.
« Bi-discriminator GAN for tabular data synthesis ».
Pattern Recognition Letters, vol. 159.
pp. 204-210.
Esmaeilpour, Mohammad, Chaalia, Nourhene et Cardinal, Patrick.
2022.
« RSD-GAN: Regularized sobolev defense GAN against speech-to-text adversarial attacks ».
IEEE Signal Processing Letters, vol. 29.
pp. 1998-2002.
Praveen, R. G., de Melo, W. C., Ullah, N., Aslam, H., Zeeshan, O., Denorme, T., Pedersoli, M., Koerich, A. L., Bacon, S., Cardinal, P. et Granger, E..
2022.
« A joint cross-attention model for audio-visual fusion in dimensional emotion recognition ».
In IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (New Orleans, LA, USA, June 19-20, 2022)
pp. 2485-2494.
Piscataway, NJ, USA : IEEE. 2021
Chabot, Philippe, Bouserhal, Rachel E., Cardinal, Patrick et Voix, Jérémie.
2021.
« Detection and classification of human-produced nonverbal audio events ».
Applied Acoustics, vol. 171.
Esmaeilpour, Mohammad, Cardinal, Patrick et Lameiras Koerich, Alessandro.
2021.
« Class-conditional defense GaN against end-to-end speech attacks ».
In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (Toronto, ON, Canada - En ligne, June 06-11,, 2021)
pp. 2565-2569.
Institute of Electrical and Electronics Engineers Inc..
Esmaeilpour, Mohammad, Cardinal, Patrick et Lameiras Koerich, Alessandro.
2021.
« Cyclic defense GAN against speech adversarial attacks ».
IEEE Signal Processing Letters, vol. 28.
pp. 1769-1773.
Praveen, R. Gnana, Granger, Eric et Cardinal, Patrick.
2021.
« Cross attentional audio-visual fusion for dimensional emotion recognition ».
In 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021) (Jodhpur, India, Dec. 15-18, 2021)
Institute of Electrical and Electronics Engineers Inc..
Rajasekhar, Gnana Praveen, Granger, Eric et Cardinal, Patrick.
2021.
« Deep domain adaptation with ordinal regression for pain assessment using weakly-labeled videos ».
Image and Vision Computing, vol. 110.
Saadati, Mirmohammad, Pedersoli, Marco, Cardinal, Patrick et Oliver, Peter.
2021.
« RADARSAT-2 Synthetic-Aperture radar land cover segmentation using deep convolutional neural networks ».
In Pattern Recognition. ICPR International Workshops and Challenges, Virtual Event, January 10-15, 2021, Proceedings Part VIII (Milan, Italy, Jan. 10-15, 2021)
Coll. « Lecture Notes in Computer Science », vol. 12668.
pp. 106-117.
Springer. 2020
Dufour, Marie-Michèle, Lanovaz, Marc J. et Cardinal, Patrick.
2020.
« Artificial intelligence for the measurement of vocal stereotypy ».
Journal of the Experimental Analysis of Behavior, vol. 114, nº 3.
pp. 368-380.
Esmaeilpour, Mohammad, Cardinal, Patrick et Lameiras Koerich, Alessandro.
2020.
« A robust approach for securing audio classification against adversarial attacks ».
IEEE Transactions on Information Forensics and Security, vol. 15, nº 1.
pp. 2147-2159.
Esmaeilpour, Mohammad, Cardinal, Patrick et Lameiras Koerich, Alessandro.
2020.
« Unsupervised feature learning for environmental sound classification using Weighted Cycle-Consistent Generative Adversarial Network ».
Applied Soft Computing, vol. 86.
Praveen, Gnana R., Granger, Eric et Cardinal, Patrick.
2020.
« Deep weakly supervised domain adaptation for pain localization in videos ».
In 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG) (Buenos Aires, Argentina, Nov. 16-20, 2020)
pp. 473-480.
IEE Computer Society.
Sallo, Raymel Alfonso, Esmaeilpour, Mohammad et Cardinal, Patrick.
2020.
« Adversarially training for audio classifiers ».
In 25th International Conference on Pattern Recognition (ICPR) (Milan, Italy, Jan. 10-15, 2021)
pp. 9569-9576.
Piscataway, NJ, USA : IEEE.
Senoussaoui, Mohammed, Saria-Paja, Milton O., Cardinal, Patrick, Falk, Tiago H. et Michaud, François.
2020.
« State-of-the-art speaker recognition methods applied to speakers with dysarthria ».
In
Voice Technologies for Speech Reconstruction and Enhancement.
Coll. « Speech Technology and Text Mining in Medicine and Health Care », vol. 6.
pp. 7-34. Boston; Berlin : De Gruyter. 2019
Abdoli, Sajjad, Cardinal, Patrick et Lameiras Koerich, Alessandro.
2019.
« End-to-end environmental sound classification using a 1D convolutional neural network ».
Expert Systems with Applications, vol. 136.
pp. 252-263.
Aminbeidokhti, Masih, Pedersoli, Marco, Cardinal, Patrick et Granger, Eric.
2019.
« Emotion recognition with spatial attention and temporal softmax pooling ».
In Image Analysis and Recognition : 16th International Conference, ICIAR : Proceedings (Waterloo, ON, Canada, Aug. 27-29, 2019)
Coll. « Lecture Notes in Computer Science », vol. 11662.
pp. 323-331.
Cham, Switzerland : Springer International Publishing. Bouserhal, Rachel, Sarria-Paja, Milton, Cardinal, Patrick et Voix, Jérémie. 2019. « Classification of nonverbal human produced audio events: a pilot study ». Communication lors de la conférence : National Hearing Conservation Association Annual Conference 2019 (Grapevine, TX, USA, Feb. 07-09, 2019).
Lanovaz, Marc J., Cardinal, Patrick et Francis, Mary.
2019.
« Using a visual structured criterion for the analysis of alternating-treatment designs ».
Behavior Modification, vol. 43, nº 1.
pp. 115-131.
Lanovaz, Marc J., Turgeon, Stéphanie, Cardinal, Patrick et Wheatley, Tara L..
2019.
« Using single-case designs in practical settings: Is within-subject replication always necessary? ».
Perspectives on Behavior Science, vol. 42, nº 1.
pp. 153-162.
Ortega, Juan D. S., Cardinal, Patrick et L. Koerich, Alessandro.
2019.
« Emotion recognition using fusion of audio and video features ».
In IEEE International Conference on Systems, Man and Cybernetics (SMC) (Bari, Italy, Oct. 06-09, 2019)
pp. 3847-3852.
Institute of Electrical and Electronics Engineers Inc.. 2018
Bouserhal, Rachel E., Chabot, Philippe, Sarria-Paja, Milton, Cardinal, Patrick et Voix, Jérémie.
2018.
« Classification of nonverbal human produced audio events: A pilot study ».
In 19th Annual Conference of the International Speech Communication (INTERSPEECH 2018) (Hyderabad, India, Sept. 02-06, 2018)
pp. 1512-1516.
International Speech Communication Association. Bouserhal, Rachel, Sarria-Paja, Milton, Cardinal, Patrick et Voix, Jérémie. 2018. « Classification of nonverbal human produced audio events: a pilot study ». Communication lors de la conférence : Workshop on machine hearing and learning (Montreal, QC, Canada, Sept. 21, 2018). 2017
Verduyckt, I., Cardinal, P., Loubnani, A. et Alpan, A..
2017.
« MyOrtho – A vocal coach application with visual feed-back for monitoring and storing of patient progress in a home environment ».
In 10th International Workshop Models and Analysis of Vocal Emissions for Biomedical Applications (Firenze, Italy, Dec. 13-15, 2017)
pp. 31-34.
Firenze University Press. 2016
Ali, Ahmed, Dehak, Najim, Cardinal, Patrick, Khurana, Sameer, Yella, Sree Harsha, Glass, James, Bell, Peter et Renals, Steve.
2016.
« Automatic dialect detection in Arabic broadcast speech ».
In 17th Annual Conference of the International Speech Communication Association, (INTERSPEECH) (San Francisco, CA, USA, Sept. 08-16, 2016)
pp. 2934-2938.
Baixas, France : International Speech and Communication Association.
Boucher, Patrice, Dufour, Pierre, Plusquellec, Pierrich, Dehak, Najim, Dumouchel, Pierre et Cardinal, Patrick.
2016.
« PHYSIOSTRESS: A multimodal corpus of data on acute stress and physiological activation ».
In Workshop on Multimodal Corpora : Computer vision and language processing (MMC 2016) (Portoroz, Slovenia, May 23-28, 2016)
pp. 45-48.
European Language Resources Association (ELRA).
Senoussaoui, Mohammed, Cardinal, Patrick, Dehak, Najim et Koerich, Alessandro L..
2016.
« Native language detection using the i-vector framework ».
In 17th Annual Conference of the International Speech Communication Association (INTERSPEECH) (San Francisco, CA, USA, Sept. 08-16, 2016)
pp. 2398-2402.
Baixas, France : International Speech and Communication Association. 2015
Boutin, Simon, Tremblay, Réal, Cardinal, Patrick, Peters, Doug et Dumouchel, Pierre.
2015.
« Audio quotation marks for natural language understanding ».
In INTERSPEECH 2015. 16th Annual Conference of the International Speech Communication Association (Dresden, Germany, Sept. 6-10, 2015)
pp. 1349-1352.
International Speech Communication Association.
Cardinal, Patrick, Dehak, Najim, Koerich Lameiras, Alessandro, Alam, Jahangir et Boucher, Patrice.
2015.
« ETS System for AV+EC 2015 Challenge ».
In Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge (Brisbane, Australia, Oct. 26-30, 2015)
pp. 17-23.
ACM.
Cardinal, Patrick, Dehak, Najim, Zhang, Yu et Glass, James.
2015.
« Speaker adaptation using the I-vector technique for bottleneck features ».
In INTERSPEECH 2015. 16th Annual Conference of the International Speech Communication Association (Dresden, Germany, Sept. 6-10, 2015)
pp. 2867-2871.
International Speech Communication Association. 2014
Ali, Ahmed, Zhang, Yifan, Cardinal, Patrick, Dehak, Najim, Vogel, Stephan et Glass, James.
2014.
« A complete KALDI recipe for building Arabic speech recognition systems ».
In 2014 IEEE Spoken Language Technology Workshop (STL) (South Lake Tahoe, NV, USA, Dec. 7-10, 2014)
pp. 525-529.
IEEE.
Cardinal, Patrick, Ali, Ahmed, Dehak, Najim, Zhang, Yu, Al Hanai, Tuka, Zhang, Yifan, Glass, James R. et Vogel, Stephan.
2014.
« Recent advances in ASR applied to an Arabic transcription system for Al-Jazeera ».
In INTERSPEECH 2014. 15th Annual Conference of the International Speech Communication Association (Singapore, Singapore, Sept. 14-18, 2014)
pp. 2088-2092.
International Speech Communication Association. Gupta, Vishwa, Boulianne, Gilles et Cardinal, Patrick (inventeurs) 9 septembre 2014. « Content based audio copy detection ». Centre de Recherche Informatique de Montréal (CRIM) (titulaire(s)). Brevet américain US 8,831,760. 2013Cardinal, Patrick. 2013. « Speech recognition on multi-core processors and GPUs ». Thèse de doctorat. Montréal, (Québec), École de technologie supérieure, 145 p.
Cardinal, Patrick, Dumouchel, Pierre et Boulianne, Gilles.
2013.
« Large vocabulary speech recognition on parallel architectures ».
IEEE Transactions on Audio, Speech, and Language Processing, vol. 21, nº 11.
pp. 2290-2300. 2012
Cardinal, Patrick, Boulianne, Gilles et Dumouchel, Pierre.
2012.
« The A* speech recognition system on parallel architectures ».
In 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA) (Montreal, QC, Canada, July 2-5, 2012)
pp. 108-113.
Washington, DC : IEEE Computer Society.
Cardinal, Patrick, Boulianne, Gilles et Dumouchel, Pierre.
2012.
« Using A* for the parallelization of speech recognition systems ».
In 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (Kyoto, Japan, Mar. 25-30, 2012)
pp. 4433-4436.
Piscataway, NJ : Institute of Electrical and Electronics Engineers Inc..
Gupta, Vishwa Nath, Boulianne, Gilles et Cardinal, Patrick.
2012.
« CRIM’s content-based audio copy detection system for TRECVID 2009 ».
Multimedia Tools and Applications, vol. 60, nº 2.
pp. 371-387. 2010
Boulianne, Gilles, Beaumont, Jean-François, Boisvert, Maryse, Brousseau, Julie, Cardinal, Patrick, Chapdelaine, Claude, Comeau, Michel, Ouellet, Pierre, Osterrath, Frédéric et Dumouchel, Pierre.
2010.
« Shadow speaking for real-time closed-captioning of TV broadcasts in french ».
In
Listening to subtitles : subtitles for the deaf and hard of hearing.
New York, NY, USA : Peter Lang International Academic Professional Publishers.
Cardinal, Patrick, Gupta, Vishwa et Boulianne, Gilles.
2010.
« Content-based advertisement detection ».
In INTERSPEECH 2010. 11th Annual Conference of the International Speech Communication Association (Chiba, Makuhari, Japan, Sept. 26-30, 2010)
pp. 2214-2217.
International Speech Communication Association.
Gupta, Vishwa, Boulianne, Gilles et Cardinal, Patrick.
2010.
« CRIM's content-based audio copy detection system for TRECVID 2009 ».
In 2010 International Workshop on Content-Based Multimedia Indexing (CBMI) (Grenoble, France, June 23-25, 2010)
IEEE.
Gupta, Vishwa, Boulianne, Gilles et Cardinal, Patrick.
2010.
« Content-based audio copy detection using nearest-neighbor mapping ».
In 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) (Dallas, TX, USA, Mar. 14-19, 2010)
pp. 261-264.
IEEE. 2009
Cardinal, Patrick et Boulianne, Gilles.
2009.
« Real-time correction of closed-captions ».
In INTERSPEECH 2009 - 10th Annual Conference of the International Speech Communication Association (Brighton, UK, Sept. 6-10, 2009)
pp. 1447-1450.
International Speech and Communication Association.
Cardinal, Patrick, Dumouchel, Pierre et Boulianne, Gilles.
2009.
« Using parallel architectures in speech recognition ».
In INTERSPEECH 2009 - 10th Annual Conference of the International Speech Communication Association (Brighton, UK, Sept. 6-10, 2009)
pp. 3039-3042.
International Speech and Communication Association.
Héritier, Maguelonne, Gupta, Vishwa, Gagnon, Langis, Boulianne, Gilles, Foucher, Samuel et Cardinal, Patrick.
2009.
« CRIM's content-based copy detection system for TRECVID ».
In 2009 TREC Video Retrieval Evaluation Notebook Papers (Gaithesburg, MD, USA, Nov. 16, 2009)
National Institute of Standards and Technology. 2008
Cardinal, Patrick, Dumouchel, Pierre, Boulianne, Gilles et Comeau, Michel.
2008.
« GPU accelerated acoustics likelihood computations ».
In 9th Annual Conference of the International Speech Communication Association (INTERSPEECH) (Brisbane, Australia, Sept. 22-26, 2008)
pp. 964-967.
Bonn, Germany : International Speech Communication Association. 2007
Cardinal, P., Boulianne, G., Comeau, M. et Boisvert, M..
2007.
« Real-time correction of closed captions ».
In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics (Prague, Czech Republic, June 24-29, 2007)
pp. 113-116.
Association for Computational Linguistics (ACL). 2006
Boulianne, G., Beaumont, J.-F., Boisvert, M., Brousseau, J., Cardinal, P., Chapdelaine, C., Comeau, M., Ouellet, P. et Osterrath, F..
2006.
« Computer-assisted closed-captioning of live TV broadcasts in French ».
In INTERSPEECH 2006 : ICSLP ; Proceedings of the Ninth International Conference on Spoken Language Processing (Pittsburgh, PA, USA, Sept. 17-21, 2006)
pp. 273-276.
International Speech and Communication Association.
Cardinal, Patrick.
2006.
« E-Inclusion core speech forward-backward algorithm ».
Coll. « Collection scientifique et technique », vol. CRIM-06/05-05.
Centre de recherche informatique de Montréal. 6 p. 2005
Cardinal, Patrick, Boulianne, Gilles et Comeau, Michel.
2005.
« Segmentation of recordings based on partial transcriptions ».
In Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech'2005-Eurospeech) (Lisbon, Portugal, Sept. 4-8, 2005)
pp. 3345-3348.
International Speech and Communication Association. 2003
Boulianne, Gilles, Beaumont, Jean-François, Cardinal, Patrick, Comeau, Michel, Ouellet, Pierre et Dumouchel, Pierre.
2003.
« Automatic segmentation of film dialogues into phonemes and graphemes ».
In Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech 2003) (Geneva, Switzerland, Sept. 1-4, 2003)
pp. 1241-1244.
International Speech and Communication Association.
Brousseau, Julie, Beaumont, Jean-François, Boulianne, Gilles, Cardinal, Patrick, Chapdelaine, Claude, Comeau, Michel, Osterrath, Frédéric et Ouellet, Pierre.
2003.
« Automated closed-captioning of live TV broadcast news in French ».
In Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech 2003) (Geneva, Switzerland, Sept. 1-4, 2003)
pp. 1245-1248.
International Speech and Communication Association. Cardinal, Patrick. 2003. « Finite-state transducers and speech recognition ». Mémoire de maîtrise. McGill University 2002
Smaili, N., Cardinal, P., Boulianne, G. et Dumouchel, P..
2002.
« Disambiguation of finite-state transducers ».
In Proceedings of the 19th International Conference on Computational Linguistics (COLING2002) (Taipei, Taiwan, Aug. 26-30, 2002)
Association for Computational Linguistics (ACL). |