direkt zum Inhalt springen

direkt zum Hauptnavigationsmenü

Sie sind hier

TU Berlin

Inhalt des Dokuments

Prof. Michael Wagner

Lupe

Research Field:

Speech science and technology

Research Topics:

Speaker recognition and characterisation, forensic speaker verification, face recognition, multimodal methods and fusion


Biography


Michael Wagner was born in Münster, Germany, studied physics and mathematics at the Universities of Münster and Munich, and in 1973 received his Diplomphysiker degree from Ludwig-Maximilians-Universität in Munich with a thesis on the computer simulation of an elementary particle spectrometer. In 1979 he received his PhD in computer science from the Australian National University with a thesis on the acoustic analysis of speaker characteristics. Dr Wagner held research and teaching positions at the Technical University of Munich, National University of Singapore, Nixdorf AG, the University of Wollongong, the Australian Defence Force Academy, and the Australian National University. Since 1996 he has held the Chair in Computing of the University of Canberra, where at various times he has been Head of the School of Computing, Head of the Discipline of Software Engineering, Director of the Human-Computer Communication Lab and Director of the National Centre for Biometric Studies. He has also been a visiting researcher at the Universities of Amsterdam, Hong Kong and Duisburg, and at Siemens Research Laboratories in Munich. Michael Wagner is the author of more than 120 refereed publications in the field of speech science and technology.

Address

Quality and Usability Lab
Deutsche Telekom Laboratories
TU Berlin
Ernst-Reuter-Platz 7
D-10587 Berlin, Germany
Tel:  +49 176 27093965
Fax: +49 30 8353 58409

Publications

MICHAEL WAGNER PUBLICATIONS

Refereed Papers

2014 

  • Wagner, M., “Liveness Assurance in Voice Authentication,” Encyclopedia of Biometrics. Springer, in print-2014.
  • Fernández Gallardo, L., Wagner, M., and Möller, S., “Spectral sub-band analysis of speaker verification employing narrowband and wideband speech,” in Proceedings of Odyssey 2014: The Speaker and Language Recognition Workshop, Joensuu, Finland,2014.
  • Chetty, G., Wagner, M., and Goecke, R., “A multilevel fusion approach for audiovisual emotion recognition,” in Advances in emotion recognition, Konar, A. and Chakraborty, A.,Eds. John Wiley & Sons, 2014, pp. 1–25.
  • Alghowinem, S., AlShehri, M., Goecke, R., and Wagner, M., “Exploring Eye Activity as an Indication of Emotional States Using an Eye-tracking Sensor,” in Intelligent Systems for Science and Information: Extended and Selected Results from the Science and Information Conference 2013, vol. 542, Springer, 2014, pp. 261–276.
  • Alghowinem, S., Alghuwinem, S., Alshehri, M., Alwabil, A., Goecke, R., and Wagner, M., “Design of an Emotion Elicitation Framework for Arabic Speakers,” in Proceedings of the 16th International Conference on Human-Computer Interaction, HCI International 2014, Heraklion, Greece, 2014.

2013

  • Wagner, M., “Biometric person authentication - the quiet revolution in computer security and forensic science [Invited Keynote Address],” in Proceedings of the 8th International Conference on Information Technology in Asia, CITA-13, Kuching, Malaysia, 2013.
  • Wagner, M., “Biometric person authentication - strengthening our defences in the face of a computer security crisis [Invited Keynote Address],” in Proceedings of the 3rd International Conference on Software Engineering and Computer Science, ICSECS-2013, Kuantan, Malaysia, 2013.
  • Vandyke, D., Wagner, M., and Goecke, R., “Voice source waveforms for utterance level speaker identification using support vector machines,” in Proceedings of the 8th International Conference on Information Technology in Asia, CITA-13, Kuching, Malaysia, 2013, pp. 31–37.
  • Vandyke, D., Wagner, M., and Goecke, R., “R-norm: Improving interspeaker variability modelling at the score level via regression score normalisation,” in Proceedings of the 14th Annual Conference of the International Speech Communication Association, Interspeech-2013, Lyon, France, 2013, pp. 3117–3121.
  • Vandyke, D., Rose, P., and Wagner, M., “The voice source in forensic voice comparison: a likelihood-ratio based investigation with the challenging yafm database,” in Proceedings of the International Association of Forensic Phonetics and Acoustics, Tampa, Florida, USA, 2013.
  • Joshi, J., Goecke, R., Alghowinem, S., Dhall, A., Wagner, M., Epps, J., Parker G., and Breakspear, M., “Multimodal Assistive Technologies for Depression Diagnosis and Monitoring,” Journal on Multimodal User Interfaces, vol. 7, no. 3, pp. 217–228, 2013.
  • Fernández Gallardo, L., Wagner, M., and Möller, S., “Transmission Channel Effects on Human Speaker Identification in Multi-Party Conference Calls,” in Proceedings of the 8th International Conference on Information Technology in Asia, CITA-13, Kuching, Malaysia, 2013, pp. 38–43.
  • Fernández Gallardo, L., Möller, S., and Wagner, M., “Human speaker identification of known voices transmitted through different user interfaces and transmission channels,” in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-2013, Vancouver, Canada, 2013, pp. 7775–7779.
  • Dhall, A., Goecke, R., Joshi, J., Wagner, M., and Gedeon, T., “Emotion Recognition In The Wild Challenge 2013,” in Proceedings of the 15th ACM International Conference on Multimodal Interaction, ICMI2013, Sydney, Australia, 2013, pp. 509–516. 
  • Alghowinem, S., Wagner, M., and Goecke, R., “AusTalk - The Australian Speech Database: Design Framework, Recording Experience and Localisation,” in Proceedings of the 8th International Conference on Information Technology in Asia, CITA-13, Kuching, Malaysia, 2013, pp. 24–30.
  • Alghowinem, S., Goecke, R., Wagner, M., Parker, G., and Breakspear, M., “Head Pose and Movement Analysis as an Indicator of Depression,” in Proceedings of the 5th iannual Humaine Association Conference on Affective Computing and Intelligent Interaction, ACII2013, Geneva, Switzerland, 2013.
  • Alghowinem, S., Goecke, R., Wagner, M., Parker, G., and Breakspear, M., “Eye Movement Analysis for Depression Detection,” in Proceedings of the 2013 IEEE International Conference on Image Processing, ICIP2013, Melbourne, Australia, 2013.
  • Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Parker, G., and Breakspear, M., “Characterising Depressed Speech for Classification,” in Proceedings of the 14th Annual Conference of the International Speech Communication Association, Interspeech-2013, Lyon, France, 2013, pp. 2534–2538.
  • Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Gedeon, T., Breakspear, M., and Parker G., “A comparative study of different classifiers for detecting depression from spontaneous speech,” in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-2013, Vancouver, Canada, 2013, pp. 8022–8026. 
  • Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Breakspear, M., and Parker G., “Detecting depression: a comparison between spontaneous and read speech,” in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-2013, Vancouver, Canada, 2013, pp. 7547–7551.

2012

  • Wagner, M., “Multibiometric authentication,” in Advanced topics in biometrics, Li, H., Toh, K.-A., and Li, L., Eds. World Scientific Publishers, 2012, pp. 419–434.
  • Wagner, M., “Automatic speaker identification using the magnitude and phase spectra of inverse-filtered voiced speech,” in Quantitative approaches to problems in linguistics, Donohue, C., Ishihara, S., and Steed, W., Eds. Munich, Germany: Lincom Europa, 2012, pp. 197–205.
  • Vandyke, D., Wagner, M., and Goecke, R., “Speaker identification using glottal-source waveforms and support-vector-machine modelling,” in Proceedings of the 14th Australasian International Conference on Speech Science and Technology, SST-2012, Sydney, Australia, 2012, pp. 49–52.
  • Fernández Gallardo, L., Wagner, M., and Möller, S., “Analysis of automatic speaker verification performance over different narrowband and wideband telephone channels,” in Proceedings of the 14th Australasian International Conference on Speech Science and Technology, SST-2012, Sydney, Australia, 2012, pp. 157–160.
  • Fernández Gallardo, L., Möller, S., and Wagner, M., “Comparison of human speaker identification of known voices transmitted through narrowband and wideband communication systems,” in ITG-Fachbericht 236: Sprachkommunikation, Proceedings of the 10th ITG Symposium, Braunschweig, Germany, 2012, pp. 219–222.
  • Alghowinem, S., Goecke, R., Wagner, M., Epps, J., Breakspear, M., and Parker, G., “From joyous to clinically depressed: mood detection using spontaneous speech,” in Proceedings of the 25th International FLAIRS Conference, FLAIRS-25, Marco Island, USA, 2012.

2011

  • Polzehl, T, Schmitt, A., Metze, M., and Wagner, M., “Anger recognition in speech using acoustic and linguistic cues,” Speech Communication, vol. 53, no. 9–10, pp. 1198–1209, 2011.
  • Burnham, D., Estival, D., Fazio, S., Viethen, J., Cox, F., Dale, R., Cassidy, S., Epps, J., Togneri, R., Wagner, M., Kinoshita, Y., Goecke, R., Arciuli, J., Onslow, M., Lewis, T., Butcher, A., and Hajek, J., “Building an audio-visual corpus of Australian English: large corpus collection with an economical portable and replicable black box,” in Proceedings of the 12th Annual Conference of the International Speech Communication Association, Interspeech-2011, Firenze, Italy, 2011, pp. 841–844.

2010

  • Wagner, M., Tran, D., Togneri, R., Rose, P., Powers, D., Onslow, M., Loakes, D., Lewis, T., Kuratate, T., Kinoshita, Y., Kemp, N., Ishihara, S., Ingram, J., Hajek, J., Grayden, D., Goecke, R., Fletcher, J., Estival, D., Epps, J., Dale, R., Cutler, A., Cox, F., Chetty, G., Cassidy, S., Butcher, A., Burnham, D., Bird, S., Best, C., Bennamoun, M., Arciuli, J., and Ambikairajah, E., “The Big Australian Speech Corpus (The Big ASC),” in Proceedings of the 13th Australasian International Conference on Speech Science and Technology, Melbourne, 2010, pp. 166–170.
  • Norris, M. and Wagner, M., “Age-group and gender classification through class-dependent phone recognition,” in Proceedings of the 13th Australasian International Conference on Speech Science and Technology, Melbourne, 2010, pp. 38–41. 
  • Farrús, M., Wagner, M., Erro Eslava, D., and Hernando, J., “Automatic speaker recognition as a measurement of voice imitation and conversion,” Journal of Speech, Language and the Law, vol. 17, no. 1, pp. 119–142, 2010.

2009

  • Wagner, M. and Chetty, G., “Liveness assurance in face authentication,” Encyclopedia of Biometrics, vol. 2, 2 vols. Springer, New York, USA, pp. 908–915, 2009. 
  • Wagner, M., “Liveness assurance in voice authentication,” Encyclopedia of Biometrics, vol. 2, 2 vols. Springer, New York, USA, pp. 916–924, 2009.
  • Polzehl, T, Sundaram, S., Ketabdar, H., Wagner, M., and Metze, F., “Emotion classification in children’s speech using fusion of acoustic and linguistic features,” in Proceedings of the 10th Annual Conference of the International Speech Communication Association, Interspeech 2009, Brighton, UK, 2009, pp. 340–343.
  • Metze, F., Polzehl, T, and Wagner, M., “Fusion of acoustic and linguistic speech features for emotion detection,” in Proceedings of the 3rd IEEE International Conference on Semantic Computing, ICSC 2009, Berkeley, USA, 2009, pp. 153–160.
  • Chetty, G. and Wagner, M., “Multimodal speaker verification using ancillary known speaker characteristics such as gender or age,” in Proceedings of the 10th Annual Conference of the International Speech Communication Association, Interspeech 2009, Brighton, UK, 2009, pp. 1167–1170.
  • Chetty, G. and Wagner, M., “Biometric person authentication with liveness detection based on audio-visual fusion,” International Journal of Biometrics, vol. 1, no. 4, pp. 463–478, 2009.
  • Chetty, G., Goecke, R., and Wagner, M., “Audio-visual mutual dependency models for biometric liveness checks,” in Proceedings of the International Conference on Auditory-Visual Speech Processing, AVSP 2009, Norwich, UK, 2009, pp. 32–37.
  • Burnham, D., Ambikairajah, E., Arciuli, J., Bennamoun, M., Best, C., Bird, S., Butcher, A., Cassidy, S., Chetty, G., Cox, F., Cutler, A., Dale, R., Epps, J., Fletcher, J., Goecke, R., Grayden, D., Hajek, J., Ingram, J., Ishihara, S., Kemp, N., Kinoshita, Y., Kuratate, T., Lewis, T., Loakes, D., Onslow, M., Powers, D., Rose, P., Togneri, R., Tran, D., and Wagner, M., “A blueprint for a comprehensive Australian English auditory-visual speech corpus,” in Selected Proceedings of the 2008 HSCNet Workshop on Designing theAustralian National Corpus, Sommerville, USA, 2009.
  • Asthana, A., Saragih, J., Wagner, M., and Goecke, R., “Evaluating AAM fitting methods for facial expression recognition,” in Proceedings of the 3rd International Conference on Affective Computing and Intelligent Interaction, ACII 2009, Amsterdam, Netherlands, 2009, pp. 1–8.


2008

  • Fletcher, J., Loakes, D., Goecke, R., Burnham, D., and Wagner, M., Eds., Proceedings of Interspeech-2008. Brisbane: International Speech Communication Association, 2008.
  • Farrús, M., Wagner, M., Anguita, J., and Hernando, J., “Robustness of prosodic features to voice imitation,” in Proceedings of the 9th Annual Conference of the International  peech Communication Association, Interspeech 2008, Brisbane, Australia, 2008, pp. 613–616.
  • Farrús, M., Wagner, M., Anguita, J., and Hernando, J., “How vulnerable are prosodic features to professional imitators?,” in Proceedings of Odyssey 2008: The Speaker and Language Recognition Workshop, Stellenbosch, South Africa, 2008, p. Paper 002. 
  • Chetty, G. and Wagner, M., “Robust face-voice based speaker identity verification using multilevel fusion,” Image and Vision Computing, vol. 26, no. 9, pp. 1249–1260, 2008.
  • Chetty, G. and Wagner, M., “Reliability score based multimodal fusion for biometric person authentication,” in Proceedings of the American Conference on Applied Mathematics, MATH 2008, Cambridge, USA, 2008, pp. 313–324.
  • Chetty, G. and Wagner, M., “Audio-visual multilevel fusion for speech and speaker recognition,” in Proceedings of the 9th Annual Conference of the International Speech Communication Association, Interspeech-2008, Brisbane, 2008, pp. 379–382.
  • Chetty, G. and Wagner, M., “A robust spatio-temporal face modelling approach using 3D multimodal fusion for biometric security applications,” in Proceedings of SPIE: Biometric Technology for Human Identification V, Orlando, USA, 2008, vol. 6944, p. Paper 08.
  • Girija Chetty and Michael Wagner, "A Multilevel Fusion Approach for Audiovisual Emotion Recognition", Proc. Audiovisual Speech Processing 2008, 26-29 September, Moreton Island, Australia.
  • Mireia Farrus, Michael Wagner, Jan Anguita and Javier Hernando, "Robustness of prosodic features to voice imitation", Proc. Interspeech-2008, 22-26 September, Brisbane, Australia, pp 613-616.
  • Girija Chetty and Michael Wagner, "Audio-Visual Multilevel Fusion for Speech and Speaker Recognition", Proc. Interspeech-2008, 22-26 September, Brisbane, Australia, pp 379-382.
  • Girija Chetty and Michael Wagner, "Robust Face-Voice Based Speaker Identity Verification Using Mulitilevel Fusion", Image and Vision Computing, vol 26 (2009) 1249-1260.
  • Mireia Farrus, Michael Wagner, Jan Anguita and Javier Hernando, "How vulnerable are prosodic features to professional imitators", Proc. IEEE Workshop on Speaker and language Recognition, Odyssey 2008, Stellenbosch, South Africa.

2007

  • Girija Chetty and Michael Wagner, "Audio Visual Speaker Identity Verification based on Lip Motion Features", Proc. Interspeech 2007, Aug. 27-31, Antwerp, Belgium.
  • Girija Chetty and Michael Wagner, " Audio Visual Speaker Identity Verification based on Cross Modal Fusion, " Proc. AVSP 2007, Sept. 1 -3, Tilburg, Netherlands.
  • Girija Chetty and Michael Wagner, "Spatiotemporal modelling of faces based on multilevel fusion", Proc. Image and Vision Computing New Zealand Conference, 5th -7th Dec. 2007, Waikato, New Zealand.
  • Girija Chetty and Michael Wagner , Springer Verlag LNCS Book Series - Book Title - Pattern Recognition and Machine Intelligence, Chapter Title:" Audio Visual Speaker Verification using hybrid fusion of cross-modal features" Dec. 18-22, 2007, Calcutta, India.
  • Girija Chetty and Michael Wagner, "A Robust Speaking Face Modelling Approach Based On Multilevel Fusion", Proc. IEEE conference on Digital Image Computing and Applications, 3-5 Dec. 2007, Adelaide, Australia.

 2006

  • Chetty, G. and Wagner M., "Speaking faces for face-voice speaker identity verification", Proc. Interspeech 2006 - Int Conf on Spoken Language Processing, Pittsburgh, September 2006, Paper Mon3A1O-6, 2006.
  • Wagner, M., Summerfield, C., Dunstone, T., Summerfield, R., Moss, J., An evaluation of ‘commercial off-the-shelf' speaker verification systems, Proc IEEE Speaker and Language Recognition Workshop Odyssey 2006, June 2006, CD-ROM, no page numbers.
  • Chetty, G. and Wagner M., "Multilevel liveness verification for face-voice biometric authentication", Proc. IEEE Biometrics Symposium, Special Session on research, Biometric Consortium Conference, Baltimore, September 2006, CD-ROM, no page numbers, 2006.
  • Chetty, G. and Wagner M., "Face-voice authentication based on 3D face models", in P.J. Narayanan, S.K. Nayar and H.-Y. Shum (eds), Computer Vision - ACCV 2006, Springer Verlag, pp 559-568, 2006.
  • Wagner M., "Speaker Verification Using the Shape of the Glottal Excitation Function for Vowels ", Proc 11th Australasian Int Conf on Speech Science & Technology, 2006 pp 233-238.
  • Chetty, G. and Wagner M., " UCBN: A New Audio-Visual Broadcast News Corpus for Multimodal Speaker Verification Studies ", Proc 11th Australasian Int Conf on Speech Science & Technology, 2006, pp 281-286.
  • Chetty, G. and Wagner M., "Audio-visual multimodal fusion for biometric person authentication and liveness verification, in F. Chen & J. Epps (eds), Multimodal User Interaction, Conferences in Research and Practice in Information Technology, Vol. 57, Australian Computer Society, pp 17-24, 2006.

2005

  • Chetty, G. and Wagner M., "Audio-video biometric system with liveness checks", Proceeding IVCNZ'05 Conference, pp. 132-137, 2005.
  • Chetty, G. and Wagner M., "Audio-video person authentication based on 3D facial feature warping", Proc. Digital Image Computing: Techniques and Applications (DICTA 2005), Cairns, 2005, CD-ROM, no page numbers.
  • Chetty, G. and Wagner M., "Investigating feature-level fusion for checking liveness in face-voice authentication", Proc. 8th IEEE Symposium on Signal Processing and its Applications, Sydney, 2005, pp 66-69.
  • Chetty, G. and Wagner M., "Liveness detection using cross-modal correlations in face-voice person authentication", Proc. Eurospeech 2005, Geneva, 2005, pp 2181-2184.
  • Lau, Y.W., Wagner M and Tran D., "Testing Voice Mimicry with the YOHO Speaker Verification Corpus", R. Khosla et al. (Eds.): KES 2005, LNAI 3684, pp. 15-21, 2005
  • Zhang, F. and Wagner M., "Effects of F0 feedback on the learning of Chinese tones by native speakers of English", Proc. Eurospeech 2005, Geneva, pp 181-184.

2004

  • Huang, X., Madoc, A.C. and Wagner M., "Noise Removal for Images by Wavelet-Based Bayesian Estimator via Levy Process Analysis", Proc. IEEE Int Conf on Multimedia and Expo, Paper TP2-1, 2004.
  • Tran, D., Wagner, M., Lau, Y.W., Gen, M., Fuzzy methods for voice-based person authentication, IEEJ (Institute of Electrical Engineers of Japan) Transactions on Electronics, Information and Systems, vol. 124, no. 10, pp. 1958-1963, October 2004.
  • Lau, Y.W., Tran D. and Wagner M., Vulnerability of speaker verification to voice mimicking, Proc. Int Symp on Intelligent Multimedia, Video & Speech Processing ISIMP'2004, Hong Kong, 20-22 Oct 2004, pp 145-148.
  • Chetty, G. and Wagner, M., "'Liveness' Verification in Audio-Video Authentication", Proc. Int Conf on Spoken Language Processing ICSLP-04, Paper Spec3603p6.
  • Millar, J.B., Wagner, M., Goecke, R., Aspects of Speaking-Face Data Corpus Design Methodology, Proc. Int Conf on Spoken Language Processing ICSLP-04, Paper Spec3601o7.
  • Chetty, G. and Wagner, M., "Automated lip feature extraction for liveness verification in audio-video authentication", Proc. Image and Vision Computing 2004, New Zealand, pp 17-22.
  • Chetty, G. and Wagner, M., "Liveness Verification in Audio-Video Speaker Authentication", Proc. 10th Aust Int Speech Sci and Tech Conf SST-2004, Sydney, pp 358-363, 2004.

2003

  • Lau, Y.W., Tran D. and Wagner M., "Fuzzy Normalisation Methods for Utterance Verification", Proc. Asia Pacific Symposium on Intelligent and Evolutionary Systems: Technology and Applications, pp39-43, Japan, 2003.

2002

  • Tran D. and Wagner M., "Fuzzy C-Means Clustering-Based Speaker Verification", Lecture Notes in Computer Science: Advances in Soft Computing - AFSS 2002, N.R.Pal, M.Sugeno (Eds.), pp. 318-324.
  • Tran D. and Wagner M., "Noise Clustering-Based Speaker Verification", Lecture Notes in Computer Science: Advances in Soft Computing - AFSS 2002, N.R.Pal, M.Sugeno (Eds.), pp. 325-331.
  • Tran D. and Wagner M., "Generalised Fuzzy Hidden Markov Models for Speech Recognition", Lecture Notes in Computer Science: Advances in Soft Computing - AFSS 2002, N.R.Pal, M.Sugeno (Eds.), pp. 345-351.
  • D. Tran, M. Wagner, Fuzzy Clustering Methods in Speaker Verification, Int J of Pattern Recognition and AI, vol.16, no 7, pp 913-925, 2002.
  • P. Collings, D. Walker, M. Wagner, Developing Mental Models and New Work Practices: an Evaluation of a State-of-the-Art Commercial Speech Recognition System, Proc. Human Factors Conference HF-2002, Melbourne, pp 25-27, 2002.
  • P. Collings, D. Walker, M. Wagner, Usability Evaluation of a Commercial Dictation System, Proc. 9th Aust Int Speech Sci and Tech Conf SST-2002, Melbourne, pp 479-484, 2002.
  • B. Kraal, M. Wagner, P. Collings, Improving the User Interface of Dictation Software, Proc. 9th Aust Int Speech Sci and Tech Conf SST-2002, Melbourne, pp 22-27, 2002.
  • D. Tran, M. Wagner, Fuzzy Modelling Techniques for Speech Recognition, Proc. 9th Aust Int Speech Sci and Tech Conf SST-2002, Melbourne, pp 473-478, 2002.
  • Tran D. and Wagner M., Fuzzy Entropy Models for Cluster Analysis and Speech Recognition, Proc Joint Third Int Conf on Intelligent Technologies and Third Vietnam-Japan Symposium on Fuzzy Systems and Applications, pp 338-343.
  • Tran D. and Wagner M., Bagging-Fuzzy Entropy Models for Speech and Speaker Recognition, Proc Joint Third Int Conf on Intelligent Technologies and Third Vietnam-Japan Symposium on Fuzzy Systems and Applications, pp 344-347.

2001

  • T. Pham, M. Wagner, D. Clark, Applications of genetic algorithms, geostatistics and fuzzy c-means clustering to image segmentation, Proc Congress on Evolutionary Computation CEC2001, pp 741-746, 2001.
  • D. Tran, M. Wagner, A Generalised Normalisation Method for Speaker Verification, Proc. Odyssey 2001 Workshop on Speaker Recognition, pp 73-76, 2001.
  • D. Tran, M. Wagner, A Proposed Fuzzy Pattern Verification System, Proc FUZZ-IEEE Conf., pp 932-935, 2001.
  • D. Tran, M. Wagner, Noise Clustering Approach to Speaker Verification, Proc. 5th World Multiconference on Systemetics, Cybernetics and Informatics, SCI'01, pp 439-444, 2001.

2000

  • T. Pham, M. Wagner, Similarity normalization for speaker verification by fuzzy fusion, Pattern Recognition, vol 32:2, pp 309-315, 2000.
  • T. Pham, M. Wagner, Speaker verification with fuzzy fusion and genetic optimization, Int J Pattern Recognition and Artificial Intelligence, vol 14:8, pp 1025-1038, 2000.
  • T. Pham, S. Shaw, D. Clark, M. Wagner, A neuro-fuzzy fusion of multiple classifiers, Proc. 3rd Int Conf Computer Vision, Pattern Recognition & Image Processing, pp 370-373, 2000.
  • T. Pham, M. Wagner, Abstraction and tolerance of imprecision in formal specification, 7th Int Conf on Fuzzy Theory and Technology, JCIS2000/FTT200, pp 232-235, 2000.
  • T. Pham, M. Wagner, Information based speaker verification, Proc 15th Int Conf on Pattern Recognition ICPR2000, pp 282-285, 2000.
  • T. Pham, M. Wagner, Image restoration by fuzzy convex ordinary kriging, Proc IEEE Int Conf on Image Processing ICIP2000, pp 1102-1105, 2000.
  • Tran D. and Wagner M., "A General Approach to Hard, Fuzzy, and Probabilistic Models for Pattern Recognition", Advances in Intelligent Systems: Theory and Applications, M. Mohammadian (ed.), pp. 244-251, 2000, IOS Press, Netherlands.
  • Tran D. and Wagner M., "Frame-Level Hidden Markov Models", Advances in Intelligent Systems: Theory and Applications, M. Mohammadian (ed.), pp. 252-259, 2000, IOS Press, Netherlands.
  • D. Tran, M. Wagner, Fuzzy Entropy Hidden Markov Models for Speech Recognition, Int Conf on Spoken Language Processing (ICSLP2000), pp 421-424, 2000.
  • D. Tran, M. Wagner, Fuzzy Normalisation Methods for Speaker Verification, Int Conf on Spoken Language Processing (ICSLP2000), pp 446-449, 2000.
  • D. Tran, M. Wagner, A Proposed Likelihood Transformation for Speaker Verification, Int Conf on Acoustics, Speech & Signal Processing (ICASSP'2000), pp 1069-1072, 2000.
  • D. Tran, M. Wagner, An Application of Fuzzy Entropy Clustering In Speaker Identification, Proc Joint Conf on Information Sciences 2000 (Fuzzy Theory and Technology Track), vol. 1, pp. 228-231, 2000, Atlantic City, 2000.
  • D. Tran, M. Wagner, T. Pham, Hard Gaussian Mixture Models for Speaker Recognition, Proc 4th World Multiconference on Systemetics, Cybernetics and Informatics/6th Int Conf Inf Sys Analysis and Synthesis, pp 608-613, 2000.
  • D. Tran, M. Wagner, T. Pham, Hard Hidden Markov Models for Speech Recognition, Proc 4th World Multiconference on Systemetics, Cybernetics and Informatics/6th Int Conf Inf Sys Analysis and Synthesis, pp 614-619, 2000.
  • D. Tran, M. Wagner, Fuzzy Entropy Clustering, PROC FUZZ-IEEE Conf, pp 152-157, 2000.

1999

  • T. Pham, M. Wagner, Ambiguity reduction in speaker identification using relaxation labelling, Pattern Recognition, vol. 32:7, pp 1249-1254, 1999.
  • T. Pham, M. Wagner, Speaker verification with fuzzy fusion and genetic optimization, Int J Advanced Computational Intelligence, vol 3:6, pp 451-456, 1999.
  • T. Pham, M. Wagner, Specification of fuzzy systems with fuzzy Z, Proc 3rd World Multiconference on Systemics, Cybernetics and Informatics/5th Int Conf Information Systems analysis and Synthesis SCI'00/ISAS'99, pp 383-389, 1999.
  • T. Pham, M. Wagner, Filtering noisy images using kriging, Proc 5th Int Symp on Signal Processing and its Applications ISSPA'99, pp 427-430, 1999.
  • T. Pham, M. Wagner, Fuzzy kriging filter for image restoration, 3rd Int Conf on Knowledge-Based Intelligent Information Engineering Systems KES'99, pp 333-336.
  • D. Tran, M. Wagner, T. Zheng, State mixture modelling applied to speech and speaker recognition, in the Pattern Recognition in Practice VI, a special issue of the Journal of Pattern Recognition Letters, vol. 20, no. 11-13, pp. 1449-1456, 1999.
  • D. Tran, M. Wagner, Hidden Markov models using fuzzy estimation, in Proceedings of the EUROSPEECH'99 Conference, vol. 6, pp. 2749-2752, 1999.
  • D. Tran, M. Wagner, Fuzzy expectation-maximisation algorithm for speech and speaker recognition, in Proceedings of the 18th International Conference of the North American Fuzzy Information Society (NAFIPS'99), pp. 421-425, 1999.
  • D. Tran, M. Wagner, Fuzzy hidden Markov models for speech and speaker recognition, in Proceedings of the 18th International Conference of the North American Fuzzy Information Society (NAFIPS'99), pp. 426-430, 1999.
  • T. Pham, D. Tran, M. Wagner, Optimal fuzzy information fusion for speaker verification, in Proceedings of the Computation Intelligence Methods and Applications (CIMA'99) Conference, pp. 141-146, 1999.
  • D. Tran, M. Wagner, Fuzzy approach to Gaussian mixture models and generalised Gaussian mixture models, in Proceedings of the Computation Intelligence Methods and Applications (CIMA'99) Conference, pp. 154-158, 1999.
  • D. Tran, M. Wagner, A robust clustering approach to fuzzy Gaussian mixture models for speaker identification, in Proceedings of the Third International Conference on Knowledge-Based Intelligent Information Engineering Systems (KES'99), pp. 337-340.
  • D. Tran, M. Wagner, T. Zheng, A Fuzzy Approach to Statistical Models in Speech and Speaker Recognition, in Proceedings of the FUZZ-IEEE'99 Conference, vol. 3, pp. 1275-1280, 1999.
  • D. Tran, M. Wagner, T. Zheng, Fuzzy nearest prototype classifier applied to speaker identification", in Proceedings of the European Symposium on Intelligent Techniques (ESIT'99) on CD-ROM, abstract on page 34, 1999.
  • D. Tran, T. Pham, M. Wagner, Speaker recognition using Gaussian mixture models and relaxation labeling, in Proceedings of the 3rd World Multiconference on Systemetics, Cybernetics and Informatics/ The 5th Int. Conf. Information Systems Analysis and Synthesis (SCI/ISAS99), vol. 6, pp. 383-389, 1999.
  • T. Van Le, D. Tran, M. Wagner, Fuzzy evolutionary programming for hidden Markov modelling in speaker identification, in Proceedings of the Congress on Evolutionary Computation'99, Washington DC, pp. 812-815, 1999.

1998

  • T. Pham, M. Wagner, A geostatistical model for linear prediction analysis of speech, Pattern Recognition , vol 31:12, pp 1981-1991, 1998.
  • M. Barlow, M. Wagner, Measuring the dynamic encoding of speaker identity and dialect in prosodic parameters, in Robert H Mannell and Jordi Robert-Ribes (ed), 5th International Conference on Spoken Language Processing, ICSLP '98, 1998.
  • T. Pham, M. Wagner, Speaker Identification Using Relaxation Labeling, in Robert H Mannell and Jordi Robert-Ribes (ed), 5th International Conference on Spoken Language Processing, ICSLP '98, pp 209 - 212, 1998
  • D. Tran, M. Wagner, T. Van Le, A Proposed Decision Rule for Speaker Recognition Based on Fuzzy C-Means Clustering, in Robert H Mannell and Jordi Robert-Ribes (ed), 5th International Conference on Spoken Language Processing, ICSLP '98, pp 755 - 758, 1998
  • D. Tran, T. Van Le, M. Wagner, Fuzzy Gaussian Mixture Models for Speaker Recognition, in Robert H Mannell and Jordi Robert-Ribes (ed), 5th International Conference on Spoken Language Processing, ICSLP '98, pp 759 - 762, 1998
  • T. Pham, M. Wagner, Fuzzy-Integration Based Normalization for Speaker Verification, in Robert H Mannell and Jordi Robert-Ribes (ed), 5th International Conference on Spoken Language Processing, ICSLP '98, pp 3273 - 3276, 1998
  • T. Pham, D. Tran, M. Wagner, Speaker Verification Using Relaxation Labeling, in Jean-Francois Bonastre (ed), Proc ESCA Workshop on Speaker Recognition and its Commercial and Forensic Applications, RLA2C, pp 29 - 32, 1998.
  • D. Tran, M. Do, M. Wagner, T. Van Le, Proposed Decision Rule for Speaker Identification Based on a Posteriori Probability, in Jean-Francois Bonastre (ed), Proc ESCA Workshop on Speaker Recognition and its Commercial and Forensic Applications, RLA2C, pp 85-88, 1998.
  • M. Do, M. Wagner, Speaker Recognition With Small Training Requirements Using A Combination of VQ and DHMM, in Jean-Francois Bonastre (ed), Proc ESCA Workshop on Speaker Recognition and its Commercial and Forensic Applications, RLA2C, pp 169 - 172, 1998
  • T. Pham, M. Wagner, Application of Geostatistics to Linear Predictive Coding of Speech, Proc Computational Engineering in Systems Applications , CESA'98, pp 220-225, 1998.
  • D. Tran, M. Wagner, Fuzzy Gaussian Mixture Models for Speaker Recognition, in the special issue of the Australian Journal of Intelligent Information Processing Systems (AJIIPS), vol. 5, no. 4, pp. 293-300, 1998.
  • D. Tran, M. Wagner, T. Pham, Minimum classifier error and relaxation labelling for speaker recognition, in Proceedings of the Speech Computer Workshop, St Petersburg, (Specom 98), pp. 229-232, 1998.

1997

  • T. Pham, M. Wagner, Linear Prediction Analysis Of Speech Based On Geostatistics, in N Harle, M Deriche, B Boashash (ed), Workshop on Signal Processing Applications WoSPA'97, pp 47 - 50, 1997

1996

  • M. Wagner, Combined speech-recognition/speaker-verification system with modest training requirements, Proc. 6th Austr Int Conf on Speech Science and Technology, SST-96, Adelaide, 139-143, 1996.
  • K. Barrelle, W. Laverty, R.D. Henderson, J. Gough, M. Wagner, M. Hiron, User verifi­cation through pointing characteristics: An exploratory examination. International Journal of Human-Computer Studies, 45, pp 47-57, 1996.

1995

  • R. Napier, W. Laverty, D. Mahar, R. Henderson, M. Hiron, M. Wagner, Keyboard user verification: toward an accurate, efficient and ecologically valid algorithm, Proc. Austr. Comp. Sc. Conf., ACSC'95, Adelaide, pp. 407-412, 1995.
  • K. Barrelle, W. Laverty, R. Henderson, J. Gough, M. Wagner, M. Hiron, User verifica­tion through indirect pointing device control characteristics: an exploratory examination, Int. Conf on Human Computer Interaction, HCI International 95, Tokyo, 1995.
  • M. Wagner, J.S. Mason, J.B. Millar, Speaker identification using vector quantisation with codeword-specific derivative coding, Proc 4th European Conference on Speech Communica­tion and Technology, Vol. 1, pp 383-386, 1995.
  • R. Napier, W. Laverty, D. Mahar, R.D. Henderson, M. Hiron, M. Wagner, Keyboard user verification: Toward an accurate, efficient, and ecologically valid algorithm. International Jour­nal of Human-Computer Studies, 43, pp 213-222, 1995.
  • D. Mahar, R. Napier, R.D. Henderson, W. Laverty, K. Lawrie, M. Hiron, M. Wagner, Op­timising digraph-latency based biometric typist verification systems: Inter and Intra typist dif­ferences in digraph latency distribution. International Journal of Human-Computer Studies. 43, pp 579-592, 1995.

1994

  • M. Wagner, F. Chen, I. Macleod, B. Millar, S. Ran, A. Tridgell, X. Zhu, Analysis of type-II errors for VQ distortion based speaker verification, Proc. ESCA Workshop on Auto­matic Speaker Recognition, Identification and Verification, Martigny, pp. 83-86, 1994.
  • X. Zhu, Y. Gao, S. Ran, F. Chen, I. Macleod, B. Millar, M. Wagner, Text-independent speaker recognition using VQ, mixture-Gaussian VQ and ergodic HMMs, Proc. ESCA Workshop on Automatic Speaker Recognition, Identification and Verification, Martigny, pp. 83-86, 1994.
  • X. Zhu, B. Millar, I. Macleod, M. Wagner, F. Chen, S. Ran, A comparative study of mixture-Gaussian VQ, ergodic HMMs and left-to-right HMMs for speaker recognition, Proc. Int. Symp. on Speech, Image Proc. and Neural Networks, ISSIPNN'94, Hong Kong, pp. 618-621, 1994.
  • D. Mahar, R. Henderson, W. Laverty, K. Lawrie, M. Hiron, J. Gough, M. Wagner, Typ­ist identity verification: a comparison of the utility of overall reference profile and di­graph-specific estimates of digraph latency variability, Proc. Int. Conf. on Human Com­puter Interface, HCI'94, Glasgow, 1994.
  • H. Tang, X. Zhu, I. Macleod, B. Millar, M. Wagner, A dynamic-window weighted-rms averaging filter applied to speaker identification, Proc. Int. Conf. on Spoken Lang. Proc., ICSLP'94, Yokohama, pp. 1603-1606, 1994.
  • F. Chen, B. Millar, M. Wagner, Hybrid-threshold approach in text-independent speaker verification, Proc. Int. Conf. on Spoken Lang. Proc., ICSLP'94, Yokohama, pp. 1855-1858, 1994.
  • R. Napier, D. Mahar, R. Henderson, W. Laverty, M. Hiron, J. Gough, M. Wagner, Typist identity verification: a comparison of the utility of the overall reference profile and di­graph-specific estimates of digraph latency variability, Proc. Int. Conf. on Computer Hu­man Interaction, OZCHI'94, Melbourne, 1994.
  • J.B. Millar, F. Chen, I. Macleod, S. Ran, H. Tang, M. Wagner, X. Zhu, Overview of speaker verification studies towards technology for robust user-conscious secure transac­tions, Proc. 5th Austr. Int. Conf. on Speech Sc. and Techn., SST-94, Perth, pp. 744-749, 1994.
  • J.B. Millar, F. Chen, M. Wagner, The efficacy of cohort normalisation in a speaker verifi­cation task under different types of speech signal variance, Proc. 5th Austr. Int. Conf. on Speech Sc. and Techn., SST-94, Perth, pp. 850-855, 1994.
  • S. Ran, J.B. Millar, W. Laverty, I. Macleod, M. Wagner, X. Zhu, Speaker recognition using continuous ergodic HMMs, Proc. 5th Austr. Int. Conf. on Speech Sc. and Techn., SST-94, Perth, pp. 706-711, 1994.
  • X. Zhu, J.B. Millar, I. Macleod, M. Wagner, Speaker verification: beyond the absolute threshold, Proc. 5th Austr. Int. Conf. on Speech Sc. and Techn., SST-94, Perth, pp. 756-761, 1994.
  • H. Tang, X. Zhu, J.B. Millar, I. Macleod, M. Wagner, Robust speaker verification in noisy environments, Proc. 5th Austr. Int. Conf. on Speech Sc. and Techn., SST-94, Perth, pp. 768-773, 1994.
  • S. Ran, W. Laverty, J.B. Millar, M. Wagner, Estimation of false-acceptance rate in speaker veri­fication, Proc. 5th Austr. Int. Conf. on Speech Sc. and Techn., SST-94, Perth, pp. 762-767, 1994.

1992

  • H. Oasa, M. Wagner, Evaluation of auditory models as preprocessors for automatic speech recognition, Proc. 4th Austr. Int. Conf. on Speech Sc. and Techn., SST-92, Bris­bane, pp. 585-590, 1992.
  • S. Sampath, D. Slater, M. Wagner, Acoustic analysis of diphthongs of English spoken by non-native speakers, Proc. 4th Austr. Int. Conf. on Speech Sc. and Techn., SST-92, Bris­bane, pp. 34-39, 1992.

1990

  • M. Wagner, R.I. McKay, S. Sampath, D.B. Slater, Modelling prosody parameters for declarative English sentence structures, in M. Lagunas et al. [eds], Signal Processing V, EUSIPCO-90, Barcelona, pp. 1135-1138, North Holland, 1990.
  • M. Wagner, R.I. McKay, S. Sampath, D.B. Slater, Modelling the prosody of simple Eng­lish sentences using Hidden Markov Models, Proc. 3rd Austr. Int. Conf. on Speech Sc. and Techn., SST-90, Melbourne, pp. 180-185, 1990.


1988

  • M. Wagner, Spoken syllable recognition using the Standard Chinese vocabulary, Proc. 11th Aust. Comp. Sc. Conf., ACSC-11, Brisbane, pp. 125-134, 1988.
  • M. Wagner, A study of syllable timing based on a database of Chinese monosyllables, in J.L. Lacoume et al. [eds], Signal Processing IV, EUSIPCO-88, Grenoble, pp.551-554, North Holland, 1988.
  • M. Barlow, M. Wagner, Prosody as a basis for determining speaker characteristics, Proc. 2nd Austr. Int. Conf. on Speech Sc. and Techn., SST-88, Sydney, pp. 80-85, 1988.
  • Lagos, M. Wagner, An integrated audio signal interface for use in the teaching labora­tory, Proc. 2nd Austr. Int. Conf. on Speech Sc. and Techn., SST-88, Sydney, pp. 244-247, 1988.

1987

  • M. Wagner, Speech recognition experiments with the syllable inventory of Standard Chi­nese, Speech Communication, 6, pp.363-369, 1987.

1986

  • M. Wagner, W. Wang, H. Ho, M. O'Kane, Isolated word recognition of the complete vo­cabulary of spoken Chinese, Proc. IEEE Int. Conf. Acoust. Speech and Signal Proc., ICASSP-86, Tokyo, pp. 701-704, 1986.
  • M. O'Kane, J. Gillis, P. Rose, M. Wagner, Deciphering speech waveforms, Proc. IEEE Int. Conf. Acoust. Speech and Signal Proc., ICASSP-86, Tokyo, pp. 2227-2230, 1986.
  • M. Wagner, J. Fulcher, An IBM-PC based speech research workstation, Proc. 1st Austr. Conf. on Speech Sc. and Techn., SST-86, Canberra, pp. 204-209, 1986.
  • M. Barlow, M. Wagner, Effects of acoustic parameter alteration upon perceived speaker characteristics, Proc. 1st Austr. Conf. on Speech Sc. and Techn., SST-86, Canberra, pp. 240-245, 1986.
  • M. Wagner, Speech recognition experiments with the syllable inventory of Standard Chi­nese, Proc. 1st Austr. Conf. on Speech Sc. and Techn., SST-86, Canberra, pp. 310-315, 1986.

1983

  • J.B. Millar, M. Wagner, The automatic analysis of acoustic variance in speech, Language and Speech, vol. 26, pp. 145-158, 1983.
  • M. Wagner, Linear predictive coding of speech using very long analysis windows, in H.W. Schuessler [ed.], Signal Processing II, EUSIPCO-83, Erlangen, pp.323-326, North Holland, 1983.

1982

  • M. Wagner, Formant extraction algorithm in error, IEEE Trans. Acoust. Speech and Sig­nal Proc., vol. ASSP-30, p. 520, 1982.

1981

  • M. Wagner, Automatic labelling of continuous speech with a given phonetic transcription using dynamic programming algorithms, Proc. IEEE Int. Conf. Acoust. Speech and Sig­nal Proc., ICASSP-81, Atlanta, pp. 1156-1159, 1981.

1980

  • M. Wagner, Bestimmung von Sprechereigenschaften in fließender Sprache [Determination of speaker characteristics in continuous speech], in "Fortschritte der Akustik", Proc. Conf. German Acoust. Soc., DAGA-78, München, VDE Verlag, Berlin, pp. 719-722, 1980.
  • J.B. Millar, H. Oasa-Stoycheff, M. Wagner, Towards modelling of speaker characteris­tics, J. Acoust. Soc. Am., vol. 67, Suppl.1, p.94, 1980.

1978

  • M. Wagner, Computers that understand speech, Proc. 1st Austr. Comp. Soc. Conf., ACSC-1, Canberra, pp. 2166-2175, 1978.

 

 

 

Edited Conference Proceedings

  • 1. J. Fletcher, D. Loakes, R. Goecke, D.Burnham, M. Wagner [eds], Proc. Interspeech-2008, 22-26 September, Brisbane, Australia.
  • 2. M. Wagner [ed.], Proc. 2nd Austr. Int. Conf. on Speech Sc. and Techn., SST-88, Syd­ney, Austr. Speech Sc. and Techn. Ass., Canberra, 1988.
  • 3. M. Wagner [ed.], Proc. 1st Austr. Conf. on Speech Sc. and Techn., SST-86, Canberra, Austr. Speech Sc. and Techn. Ass., Canberra, 1986.

 

 

Book Chapters

  • 1. T. Pham, M. Wagner, M. Mohammadian, Towards Fuzzy Z, in M. Mohammadian [ed], New frontiers in computational intelligence and its applications, IOS Press, Netherlands, 2000.
  • 2. M. Wagner, Communicating with computers using speech, in H. Ong et al [eds], Applied research and its management, pp 87-96, National University of Singapore, 1986.
  • 3. M. Wagner, Speech analysis, in J.E. Clark [ed], An introduction to speech science and technology, Australian Speech Science and Technology Association, 1986. Second and third editions published in 1988 and 1990.
  • 4. M. Wagner, Automatic speech recognition, in J.E. Clark [ed], An introduction to speech science and technology, Australian Speech Science and Technology Association, 1986. Second and third editions published in 1988 and 1990.

 

 

Other Publications

  • 1. M. Wagner, Spoken-Language Technology, Speech Science & Human Communication, Invited Paper, ConCom 2005, University of New England, Armidale, December 2005.
  • 2. M. Wagner, Computers of the 1980s - our servants, competitors or masters? Singapore Scientist, vol. 8, no 3, pp 6-9, 1982.
  • 3. M. Wagner, J.B. Millar, Experimental software speech synthesiser, Proc. DECUS Conf., vol. 3, no 5, pp. 1565-1568, 1977.

Theses

  • 1. M. Wagner, Berechnung der Auflösung eines supraleitenden Paarspektrometers hoher Ausbeute [Computation of the resolution of a supraconducting high-yield pair spectrome­ter], Diplomarbeit [MSc thesis], Dept of Physics, Universität München, 1973.
  • 2. M. Wagner, The application of a learning technique for the identification of speaker char­acteristics in continuous speech, PhD thesis, Australian National University, 1978.

Technical Reports

  • 1. M. Wagner, Experimental software speech synthesiser, Tech. Rep. No 2, Computing Re­search Group, Australian National University, 1978.
  • 2. M. Wagner, Signal interface software for the IBM PC: ADC and DAC using double-buff­ered DMA, Tech. Rep. CS 88/1, Dept of Computer Science, University College, University of NSW, 1988.

Patents

  • 1. S. Ran, W. Laverty, B. Millar, I. Macleod, M. Wagner, Method for estimation of false acceptance rate verification, Provisional Patent No PM 9829/94, Patents Act 1990.
  • 2. X. Zhu, B. Millar, I. Macleod, M. Wagner, Method for verification, Provisional Patent No PM 9831/94, Patents Act 1990.

 

 

Zusatzinformationen / Extras

Direktzugang

Schnellnavigation zur Seite über Nummerneingabe