Professor Thomas Hain

School of Computer Science

Professor of Speech and Audio Technology

Director of UKRI CDT in Speech and Language Technologies

Director of Centre for Speech and Language Technology (VoiceBase)

Member of the Speech and Hearing (SpandH) research group

Thomas Hain profile photo
Profile picture of Thomas Hain profile photo
t.hain@sheffield.ac.uk
+44 114 222 1836

Full contact details

Professor Thomas Hain
School of Computer Science
Regent Court (DCS)
211 Portobello
91Ö±²¥
S1 4DP
Profile

Thomas Hain obtained the degree 'Dipl.-Ing' in Electrical/Communication Engineering in 1994 from the University of Technology, Vienna. He joined the Speech Technology Group at Philips Speech Processing which he left in a senior position.

In 1997 he joined the Speech, Vision and Robotics Group at the Cambridge University Engineering Department as Research Associate and PhD Student. He took up a Lectureship at the SVR group in 2001.

In 2004 he joined the Speech and Hearing Group to work as Lecturer in Computer Science. He was promoted to Senior Lecturer in 2008 and Reader in 2011.

Research interests

Thomas' research interests cover many areas in natural language processing, speech, audio and multimedia technology, machine learning, and complex system optimisation and design.

His interests include: large vocabulary continuous speech recognition, non-linear methods in speech processing, low bit-rate speech coding, machine learning, multi-modal systems, image classification, microphone arrays, system and resource optimisation.

Publications

Books

  • Young SJ, Evermann G, Gales MJF, Hain T, Kershaw D, Moore GL, Odell JJ, Ollason D, Povey D, Valtchev V & Woodland PC (2004) The HTK Book. Cambridge, England: Cambridge University Engineering Department. RIS download Bibtex download
  • Young S, Evermann G, Gales M, Hain T, Kershaw D, Xunying L, Moore G, Odell J, Ollason D, Povey D , Ragni A et al () The HTK Book (for HTK Version 3.5, documentation alpha version). Cambridge University Engineering Department: Cambridge University Engineering Department. RIS download Bibtex download

Journal articles

  • Hasan M, Jefferson N, Hain T & Dawson J (2022) . Computer Speech and Language, 74. RIS download Bibtex download
  • Ravenscroft W, Goetze S & Hain T (2022) . Frontiers in Signal Processing, 2. RIS download Bibtex download
  • Shi Y, Huang Q & Hain T (2021) . Neural Netw, 142, 329-339. RIS download Bibtex download
  • El Hannani A, Errattahi R, Salmam FZ, Hain T & Ouahmane H (2021) . Journal of Big Data, 8. RIS download Bibtex download
  • Errattahi R, Hannani AE, Hain T & Ouahmane H (2019) . Computer Speech & Language, 55, 187-199. RIS download Bibtex download
  • Deena S, Hasan M, Doulaty M, Saz O & Hain T (2019) . IEEE/ACM Transactions on Audio, Speech and Language Processing, 27(3), 572-582. RIS download Bibtex download
  • Saz Torralba O, Deena S, Hasan M, Khaliq B, Milner R, Ng RWM, Olcoz J & Hain T (2018) . Multimedia Tools and Applications, 77(23), 30533-30550. RIS download Bibtex download
  • Ng W, Nicolao M & Hain T (2017) . Computer Speech and Language, 46, 327-342. RIS download Bibtex download
  • Saz O & Hain T (2017) . Computer Speech and Language, 41, 180-194. RIS download Bibtex download
  • Kamper H, De Wet F, Hain T & Niesler T (2014) . Computer Speech and Language, 28(6), 1255-1268. RIS download Bibtex download
  • Fox C & Hain T (2013) . ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 8086-8090. RIS download Bibtex download
  • Gibson M & Hain T (2012) . IEEE Transactions on Audio, Speech and Language Processing, PP(99). RIS download Bibtex download
  • Furui S, Fiscus J, Friedland G & Hain T (2012) . IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 20(2), 353-355. RIS download Bibtex download
  • Lecorvé G, Dines J, Hain T & Motlicek P (2012) Supervised and unsupervised Web-based language model domain adaptation. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, 1, 182-185. RIS download Bibtex download
  • Alharbi G & Hain T (2012) . 2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings, 398-403. RIS download Bibtex download
  • Gibson M & Hain T (2012) . ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 4341-4344. RIS download Bibtex download
  • Hain T, Burget L, Dines J, Garner PN, Grezl F, el Hannani A, Huijbregts M, Karafiat M, Lincoln M & Wan V (2011) . IEEE Transactions on Audio, Speech and Language Processing. RIS download Bibtex download
  • Gibson M & Hain T (2010) . IEEE Transactions on Audio, Speech and Language Processing, 18(6), 1269-1279. RIS download Bibtex download
  • El Hannani A & Hain T (2010) . IEEE SIGNAL PROC LET, 17(1), 95-98. RIS download Bibtex download
  • Karafiát M, Burget L, Hain T & ÄŒernocký J (2008) Discrimininative training of narrow band - Wide band adapted systems for meeting recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 1217-1220. RIS download Bibtex download
  • Hain T, El Hannani A, Wrigley SN & Wan V (2008) Automatic speech recognition for scientific purposes - WebASR. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 504-507. RIS download Bibtex download
  • Karafiát M, Burget L, Hain T & ÄŒernocký J (2008) Discrimininative training of narrow band - Wide band adapted systems for meeting recognition. INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association, 1217-1220. RIS download Bibtex download
  • Karafiát M, Burget L, ÄŒernocký J & Hain T (2007) Application of CMLLR in narrow band wide band adapted systems. International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007, 4, 2860-2863. RIS download Bibtex download
  • Renais S, Hain T & Boudard H (2007) Recognition and understanding of meetings the AMI and AMIDA projects. 2007 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2007, Proceedings, 238-247. RIS download Bibtex download
  • Hain T, Burget L, Dines J, Garau G, Wan V, Karafiat M, Vepa J & Lincoln M (2007) . ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 4. RIS download Bibtex download
  • Hain T, Woodland PC, Evermann G, Gales MJF, Liu X, Moore GL, Povey D & Wang L (2006) Corrections to "Automatic Transcription of Conversational Telephone Speech".. IEEE Trans. Speech Audio Process., 14, 727-727. RIS download Bibtex download
  • Wan V & Hain T (2006) Strategies for language model web-data collection. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1. RIS download Bibtex download
  • Hain T, Woodland PC, Evermann G, Gales MJF, Liu XY, Moore GL, Povey D & Wang L (2005) . IEEE T SPEECH AUDI P, 13(6), 1173-1185. RIS download Bibtex download
  • Hain T (2005) . SPEECH COMMUNICATION, 46(2), 171-188. RIS download Bibtex download

Chapters

  • Saenz JAL & Hain T (2021) , Statistical Language and Speech Processing (pp. 61-72). RIS download Bibtex download
  • Hain T & Garner PN (2012) , Multimodal Signal Processing (pp. 56-83). Cambridge University Press RIS download Bibtex download
  • Renals S & Hain T (2010) In Clark A, Fox C & Lappin S (Ed.), The Handbook of Computational Linguistics and Natural Language Processing (pp. 299-332). Wiley-Blackwell RIS download Bibtex download
  • Moore D, Dines J, Doss MM, Vepa J, Cheng O & Hain T (2006) (pp. 285-296). RIS download Bibtex download
  • Carletta J, Ashby S, Bourban S, Guillemot M, Kronenthal M, Lathoud G, Lincoln M, McCowan I, Hain T, Kraaij W , Post W et al (2005) , Machine Learning for Multimodal Interaction, Lecture Notes in Computer Science (pp. 28-39). Edinburgh: Springer. RIS download Bibtex download

Conference proceedings papers

  • Park C, Chen M & Hain T (2024) Automatic Speech Recognition System-Independent Word Error Rate Estimation. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings (pp 1979-1987) RIS download Bibtex download
  • Do CT, Imai S, Doddipatla R & Hain T (2024) Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis. European Signal Processing Conference (pp 136-140) RIS download Bibtex download
  • Park C, Kang H & Hain T (2024) Character Error Rate Estimation for Automatic Speech Recognition of Short Utterances. European Signal Processing Conference (pp 131-135) RIS download Bibtex download
  • Meghanani A & Hain T (2024) Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations. EACL 2024 - 18th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Vol. 1 (pp 1959-1967) RIS download Bibtex download
  • Mogridge R, Close G, Sutherland R, Hain T, Barker J, Goetze S & Ragni A (2024) Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users Using Intermediate ASR Features and Human Memory Models.. ICASSP (pp 306-310) RIS download Bibtex download
  • Close G, Hain T & Goetze S (2024) Hallucination in Perceptual Metric-Driven Speech Enhancement Networks. European Signal Processing Conference (pp 21-25) RIS download Bibtex download
  • Sutherland R, Close G, Hain T, Goetze S & Barker J (2024) Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement. European Signal Processing Conference (pp 421-425) RIS download Bibtex download
  • Farooq MU, Ahmad R & Hain T (2023) . 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 16 December 2023 - 20 December 2023. RIS download Bibtex download
  • Farooq MU, Ahmad R & Hain T (2023) MUST: A Multilingual Student-Teacher Learning approach for low-resource speech recognition RIS download Bibtex download
  • Close G, Hain T & Goetze S (2023) . 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 22 October 2023 - 25 October 2023. RIS download Bibtex download
  • Islam E, Park C & Hain T (2023) . 9th Workshop on Speech and Language Technology in Education (SLaTE) Proceedings (pp 151-155). Dublin, Ireland, 18 August 2023 - 18 August 2023. RIS download Bibtex download
  • Ahmad R, Jalal MA, Umar Farooq M, Ollerenshaw A & Hain T (2023) . ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 June 2023 - 10 June 2023. RIS download Bibtex download
  • Close G, Ravenscroft W, Hain T & Goetze S (2023) . ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 June 2023 - 10 June 2023. RIS download Bibtex download
  • Close G, Hain T & Goetze S (2023) PAMGAN+/-: Improving phase-aware speech enhancement performance via expanded discriminator training. AES Convention Europe 2023: 154th Audio Engineering Society Conference (pp 10656). Espoo, Helsinki, FInland, 13 May 2023 - 13 May 2023. RIS download Bibtex download
  • Ravenscroft W, Goetze S & Hain T (2023) . ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Rhodes Island, Greece, 4 June 2023 - 4 June 2023. RIS download Bibtex download
  • Close G, Hain T & Goetze S (2023) . IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Vol. 2023-October RIS download Bibtex download
  • Park B, Park C & Li G (2022) . 2022 29th IEEE International Conference on Electronics, Circuits and Systems (ICECS), 24 October 2022 - 26 October 2022. RIS download Bibtex download
  • Close G, Hain T & Goetze S (2022) Non-intrusive Speech Intelligibility Estimated By Metric Prediction for Hearing Impaired Individuals for the Clarity Prediction Challenge 1. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 18 September 2022 - 22 September 2022. RIS download Bibtex download
  • Park C, Ahmad R & Hain T (2022) . ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 23 May 2022 - 27 May 2022. RIS download Bibtex download
  • Lopez Saenz JA & Hain T (2022) . ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 7267-7271) RIS download Bibtex download
  • Close G, Hain T & Goetze S (2022) MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data. Proc. 30th European Signal Processing Conference, EUSIPCO 2022. Belgrade, Serbia, 29 August 2022 - 2 September 2022. RIS download Bibtex download
  • Saenz JAL, Jalal MA, Milner R & Hain T (2021) . 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 13 December 2021 - 17 December 2021. RIS download Bibtex download
  • Chen M, Shi Y & Hain T (2021) . ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 6 June 2021 - 11 June 2021. RIS download Bibtex download
  • Do C-T, Doddipatla R & Hain T (2021) . ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 6 June 2021 - 11 June 2021. RIS download Bibtex download
  • Huang Q & Hain T (2021) . ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 6473-6477). Toronto, ON, Canada, 6 June 2021 - 11 June 2021. RIS download Bibtex download
  • Shi Y & Hain T (2021) . 2021 IEEE Spoken Language Technology Workshop (SLT) (pp 758-765). Shenzhen, China, 19 January 2021 - 22 January 2021. RIS download Bibtex download
  • Do C-T, Zhang S & Hain T (2021) . 2020 28th European Signal Processing Conference (EUSIPCO), 18 January 2021 - 21 January 2021. RIS download Bibtex download
  • Shi Y & Hain T (2021) , Vol. 00 (pp 750-757) RIS download Bibtex download
  • Huang S, Chen M, Xu Y, Ke D & Hain T (2021) (pp 559-573) RIS download Bibtex download
  • Friedl K, Rizos G, Stappen L, Hasan M, Specia L, Hain T & Schuller BW (2021) Uncertainty Aware Review Hallucination for Science Article Classification. Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (pp 5004-5009) RIS download Bibtex download
  • Shi Y, Huang Q & Hain T (2020) . The Speaker and Language Recognition Workshop (Odyssey 2020) (pp 451-458). Tokyo, Japan, 1 November 2020 - 5 November 2020. RIS download Bibtex download
  • Jalal MA, Milner R, Hain T & Moore RK (2020) . Interspeech 2020 (pp 4084-4088). Shanghai, China, 25 October 2020 - 29 October 2020. RIS download Bibtex download
  • Shi Y, Huang Q & Hain T (2020) . Proceedings of Interspeech 2020 (pp 2992-2996). Shanghai, China, 25 October 2020 - 29 October 2020. RIS download Bibtex download
  • Huang Q & Hain T (2020) . Proceedings of Interspeech 2020 (pp 4611-4615). Shanghai, China, 25 October 2020 - 29 October 2020. RIS download Bibtex download
  • Shi Y, Huang Q & Hain T (2020) . Proceedings of Interspeech 2020 (pp 1530-1534). Shanghai, China, 25 October 2020 - 29 October 2020. RIS download Bibtex download
  • Shi Y, Huang Q & Hain T (2020) . ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 7579-7583). Barcelona, Spain (virtual), 4 May 2020 - 8 May 2020. RIS download Bibtex download
  • Sailor HB, Deena S, Jalal MA, Lileikyte R & Hain T (2019) . 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 14 December 2019 - 18 December 2019. RIS download Bibtex download
  • Jalal MA, Moore RK & Hain T (2019) . 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 14 December 2019 - 18 December 2019. RIS download Bibtex download
  • Milner R, Jalal MA, Ng RWM & Hain T (2019) . 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 14 December 2019 - 18 December 2019. RIS download Bibtex download
  • Hain T & Schuller B (2019) Message from the technical program chairs. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2019-September (pp 13-15) RIS download Bibtex download
  • Errattahi R, Deena S, El Hannani A, Ouahmane H & Hain T (2018) . 2018 IEEE Spoken Language Technology Workshop (SLT), 18 December 2018 - 21 December 2018. RIS download Bibtex download
  • Loweimi E, Barker JP & Hain T (2018) . 2018 IEEE International Conference on Acoustics, Speech and Signal Processing Proceedings, 15 April 2018 - 20 April 2018. RIS download Bibtex download
  • Nicolao M, Sanders M & Hain T (2018) . Proceedings of Interspeech 2018 (pp 1666-1670), 2 September 2018 - 6 September 2018. RIS download Bibtex download
  • Errattahi R, El Hannani A, Hain T & Ouahmane H (2018) . 2018 4th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP), 21 March 2018 - 24 March 2018. RIS download Bibtex download
  • Deena S, Ng RWM, Madhyastha P, Specia L & Hain T (2017) . 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 16 December 2017 - 20 December 2017. RIS download Bibtex download
  • Salil Deena , Raymond W.M. Ng , Pranava Madhyashta , Lucia Specia & Thomas Hain (2017) . Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2715-2719) RIS download Bibtex download
  • Loweimi E, Barker , Saz Torralba O & Hain (2017) . Proceedings of the Annual Conference of the International Speech Communication Association RIS download Bibtex download
  • Milner R & Hain T (2017) . Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing (pp 4925-4929) RIS download Bibtex download
  • Wu C, Ng RWM, Torralba OS & Hain T (2017) . International Conference on Systems, Signals and Image Processing (IWSSIP) RIS download Bibtex download
  • (2017) . Interspeech 2017 RIS download Bibtex download
  • Ng RWM, Kwan ACM, Lee T & Hain T (2017) . 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 5 March 2017 - 9 March 2017. RIS download Bibtex download
  • Loweimi E, Barker J & Hain T (2017) . ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp 5310-5314) RIS download Bibtex download
  • Errattahi R, Hannani AE, Ouahmane H & Hain T (2016) . 2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA), 29 November 2016 - 2 December 2016. RIS download Bibtex download
  • Saz O, Doulaty M, Deena S, Milner R, Ng RWM, Hasan M, Liu Y & Hain T (2016) . 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings (pp 624-631) RIS download Bibtex download
  • Olcoz J, Saz O & Hain T (2016) . Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech) RIS download Bibtex download
  • Hain T, Christian J, Saz O, Deena S, Hasan M, Ng RWM, Milner R, Doulaty M & Liu Y (2016) . Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech) (pp 1613-1617) RIS download Bibtex download
  • Casanueva I, Hain T, Nicolao M & Green P (2016) . Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, September 2016 - September 2016. RIS download Bibtex download
  • Loweimi E, Barker J & Hain T (2016) . Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 08-12-September-2016 (pp 3798-3802) RIS download Bibtex download
  • Ng R, Hain T & Chettri B (2016) . Combining weak tokenisers for phonotactic language recognition in a resource-constrained setting (pp 2939-2943), 9 September 2016 - 12 September 2016. RIS download Bibtex download
  • Loweimi E (2016) . USES RIS download Bibtex download
  • Ng W, Nicolao M, Saz O, Hasan M, Chettri B, Doulaty M, Lee T & Hain T (2016) . Proceedings of The Speaker and Language Recognition Workshop Odyssey 2016 RIS download Bibtex download
  • Nicolao M, Christensen H, Cunningham S, Green P & Hain T (2016) A framework for collecting realistic recordings of dysarthric speech - The homeService corpus. Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp 1993-1997) RIS download Bibtex download
  • Ng RWM, Shah K, Specia L & Hain T (2016) . ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 2016-May (pp 6120-6124) RIS download Bibtex download
  • Milner R & Hain T (2016) . 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016. RIS download Bibtex download
  • Alharbi G & Hain T (2016) The OpenCourseWare metadiscourse (OCWMD) corpus. Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp 1770-1776) RIS download Bibtex download
  • (2016) . Interspeech 2016 RIS download Bibtex download
  • Milner R, Saz O, Deena S, Doulaty M, Ng RWM & Hain T (2015) . 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015. RIS download Bibtex download
  • Loweimi E, Barker J & Hain T (2015) Source-filter separation of speech signal in the phase domain. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 598-602) RIS download Bibtex download
  • Doulaty M, Saz O & Hain T (2015) Unsupervised domain discovery using latent dirichlet allocation for acoustic modelling in speech recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 3640-3644) RIS download Bibtex download
  • Doulaty M, Saz O, Ng RWM & Hain T (2015) . 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015. RIS download Bibtex download
  • Bell P, Gales MJF, Hain T, Kilgour J, Lanchantin P, Liu X, McParland A, Renals S, Saz O, Wester M & Woodland PC (2015) . 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015. RIS download Bibtex download
  • Casanueva I, Hain T, Christensen H, Marxer R & Green P (2015) . Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, September 2015 - September 2015. RIS download Bibtex download
  • Doulaty M, Saz O & Hain T (2015) Data-selective transfer learning for multi-domain speech recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 2897-2901) RIS download Bibtex download
  • Liu Y, Karanasou P & Hain T (2015) . 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015. RIS download Bibtex download
  • Nicolao M, Beeston AV & Hain T (2015) . 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015. RIS download Bibtex download
  • Liu Y, Karanasou P, Hain T & IEEE (2015) AN INVESTIGATION INTO SPEAKER INFORMED DNN FRONT-END FOR LVCSR. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 4300-4304) RIS download Bibtex download
  • Ng RWM, Shah K, Aziz W, Specia L, Hain T & IEEE (2015) QUALITY ESTIMATION FOR ASR K-BEST LIST RESCORING IN SPOKEN LANGUAGE TRANSLATION. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 5226-5230) RIS download Bibtex download
  • AlHarbi G, Ng RWM & Hain T (2015) Annotating meta-discourse in academic lectures from different disciplines.. SLaTE (pp 161-166) RIS download Bibtex download
  • Saz O, Doulaty M, Deena S, Milner R, Ng RWM, Hasan M, Liu Y & Hain T (2015) The 2015 sheffield system for transcription of Multi-Genre Broadcast media.. ASRU (pp 624-631) RIS download Bibtex download
  • Christensen H, Nicolao M, Cunningham S, Green P, Deena S & Hain T (2015) . IET International Conference on Technologies for Active and Assisted Living (TechAAL) RIS download Bibtex download
  • Hasan M, Doddipatla R & Hain T (2015) Noise-matched training of CRF based sentence end detection models. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 349-353) RIS download Bibtex download
  • Ng RWM, Shah K, Specia L & Hain T (2015) A study on the stability and effectiveness of features in quality estimation for spoken language translation. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 2257-2261) RIS download Bibtex download
  • Loweimi E, Doulaty M, Barker J & Hain T (2015) (pp 173-184) RIS download Bibtex download
  • Ng RWM, Shah K, Aziz W, Specia L & Hain T (2015) . 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015. RIS download Bibtex download
  • AlHarbi G & Hain T (2015) Using Topic Segmentation Models for the Automatic Organisation of MOOCs resources.. EDM (pp 524-527) RIS download Bibtex download
  • Zhang P, Liu Y & Hain T (2014) . 2014 IEEE Spoken Language Technology Workshop (SLT), 7 December 2014 - 10 December 2014. RIS download Bibtex download
  • Liu Y, Zhang P & Hain T (2014) . 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) RIS download Bibtex download
  • Zhang P, Liu Y & Hain T (2014) . 2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings (pp 141-146) RIS download Bibtex download
  • Saz O, Doulaty M & Hain T (2014) . 2014 IEEE Spoken Language Technology Workshop (SLT), 7 December 2014 - 10 December 2014. RIS download Bibtex download
  • Christensen H, Casanueva I, Cunningham S, Green P & Hain T (2014) . 2014 IEEE Spoken Language Technology Workshop (SLT), 7 December 2014 - 10 December 2014. RIS download Bibtex download
  • Ng RWM, Doulaty M, Doddipatla R, Aziz W, Shah K, Saz O, Hasan M, AlHarbi G, Specia L & Hain T (2014) The USFD SLT System for IWSLT 2014. IWSLT RIS download Bibtex download
  • Fox C & Hain T (2014) Extending Limabeam with discrimination and coarse gradients. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2440-2444) RIS download Bibtex download
  • Hasan M, Doddipatla R & Hain T (2014) Multi-pass sentence-end detection of lecture speech. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2902-2906) RIS download Bibtex download
  • Doddipatla R, Hasan M & Hain T (2014) Speaker dependent bottleneck layer training for Speaker adaptation in automatic speech recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2199-2203) RIS download Bibtex download
  • Casanueva I, Christensen H, Hain T & Green P (2014) Adaptive speech recognition and dialogue management for users with speech disorders. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1033-1037) RIS download Bibtex download
  • Saz O & Hain T (2014) . 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 May 2014 - 9 May 2014. RIS download Bibtex download
  • Saz O & Hain T (2013) Asynchronous factorisation of speaker and background with feature transforms in speech recognition. Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech) (pp 1238-1242), 25 August 2013 - 29 August 2013. RIS download Bibtex download
  • Ng R, Cohn T & Hain T (2013) Adaptation of lecture speech recognition system with machine translation output. Proceedings of the 38th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Vancouver, Canada RIS download Bibtex download
  • Fox C, Liu Y, Zwyssig E & Hain T (2013) The 91Ö±²¥ Wargames Corpus. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1115-1119) RIS download Bibtex download
  • Saz O & Hain T (2013) Asynchronous Factorisation of Speaker and Background with Feature Transforms in Speech Recognition. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1237-1241) RIS download Bibtex download
  • Christensen H, Green P & Hain T (2013) Learning speaker-specific pronunciations of disordered speech. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1158-1162) RIS download Bibtex download
  • Christensen H, Aniol MB, Bell P, Green P, Hain T, King S & Swietojanski P (2013) Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 3609-3612) RIS download Bibtex download
  • Lanchantin P, Bell PJ, Gales MJF, Hain T, Liu X, Long Y, Quinnell J, Renals S, Saz O, Seigel MS , Swietojanski P et al (2013) Automatic transcription of multi-genre media archives. CEUR Workshop Proceedings, Vol. 1012 (pp 26-31) RIS download Bibtex download
  • Christensen H, Green P & Hain T (2013) Learning speaker-specific pronunciations of disordered speech. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1159-1163) RIS download Bibtex download
  • Christensen H, Aniol MB, Bell P, Green P, Hain T, King S & Swietojanski P (2013) Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 3642-3645) RIS download Bibtex download
  • Fox C, Liu Y, Zwyssig E & Hain T (2013) The 91Ö±²¥ Wargames Corpus.. 14th Annual Conference of the International Speech Communication Association (Interspeech 2013). Lyon, France, 25 August 2013 - 29 August 2013. RIS download Bibtex download
  • Christensen H, Casanuevo I, Cunningham S, Green P & Hain T (2013) HomeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition. SLPAT 2013 - 4th Workshop on Speech and Language Processing for Assistive Technologies, SLPAT 2013, Workshop Proceedings (pp 29-34) RIS download Bibtex download
  • Al-Shareef S & Hain T (2012) CRF-based diacritisation of colloquial Arabic for automatic speech recognition. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol. 3 (pp 1822-1825) RIS download Bibtex download
  • Ng RWM, Hain T & Hirose K (2012) An alignment matching method to explore pseudosyllable properties across different corpora. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol. 1 (pp 862-865) RIS download Bibtex download
  • Christensen H, Cunningham S, Fox C, Green P & Hain T (2012) A comparative study of adaptive, automatic recognition of disordered speech. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol. 2 (pp 1774-1777) RIS download Bibtex download
  • Kamper H, de Wet F, Hain T & Niesler T (2012) RESOURCE DEVELOPMENT AND EXPERIMENTS IN AUTOMATIC SOUTH AFRICAN BROADCAST NEWS TRANSCRIPTION. 3rd Workshop on Spoken Language Technologies for Under-Resourced Languages, SLTU 2012 (pp 102-106) RIS download Bibtex download
  • Tucker R, Fry D, Wan V, Wrigley S & Hain T (2011) Extending Audio Notetaker to Browse WebASR Transcriptions. ±õ²Ô³Ù±ð°ù²õ±è±ð±ð³¦³ó’11 RIS download Bibtex download
  • Marino D & Hain T (2011) An Analysis of Automatic Speech Recognition with Multiple Microphones. ±õ²Ô³Ù±ð°ù²õ±è±ð±ð³¦³ó’11. Florence RIS download Bibtex download
  • Wrigley SN & Hain T (2011) Web-based automatic speech recognition service - webASR. ±õ²Ô³Ù±ð°ù²õ±è±ð±ð³¦³ó’11 RIS download Bibtex download
  • Al-Shareef S & Hain T (2011) An Investigation in Speech Recognition for Colloquial Arabic. ±õ²Ô³Ù±ð°ù²õ±è±ð±ð³¦³ó’11 RIS download Bibtex download
  • Wrigley SN & Hain T (2011) Making an automatic speech recognition service freely available on the web. ±õ²Ô³Ù±ð°ù²õ±è±ð±ð³¦³ó’11 RIS download Bibtex download
  • Kempton T, Moore RK & Hain T (2011) Cross-language phone recognition when the target language phoneme inventory is not known. ±õ²Ô³Ù±ð°ù²õ±è±ð±ð³¦³ó’11. Florence RIS download Bibtex download
  • Tucker R, Fry D, Wan V, Wrigley S, Hain T & Assoc ISC (2011) Extending Audio Notetaker to Browse WebASR Transcriptions. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 3336-+) RIS download Bibtex download
  • Hain T & Renals S (2010) Meeting Recognition. Tutorial interspeech 2010 RIS download Bibtex download
  • Hain T, Burget L, Dines J, Garner PN, El Hannani A, Huijbregts M, Karafiat M, Lincoln M & Wan V (2010) The AMIDA 2009 Meeting Transcription System. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4 (pp 358-361) RIS download Bibtex download
  • Hain T, Burget L, Dines J, Garner PN, el Hannani A, Huijbregts M, Karafiat M, Lincoln M & Wan V (2010) The AMIDA 2009 Meeting Transcription System. ±õ²Ô³Ù±ð°ù²õ±è±ð±ð³¦³ó’10 (pp 358-361) RIS download Bibtex download
  • Garner PN, Dines J, Hain T, El Hannani A, Karafiar M, Korchagin D, Lincoln M, Wan V, Zhang L & ASSOC I-ISC (2009) Real-Time ASR from Meetings. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 2067-+) RIS download Bibtex download
  • Garner PN, Dines J, Hain T, El Hannani A, Karafiát M, Korchagin D, Lincoln M, Wan V & Zhang L (2009) Real-time ASR from meetings. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2119-2122) RIS download Bibtex download
  • Renals S, Hain T, Bourlard H & IEEE (2008) Interpretation of multiparty meetings the AMI and AMIDA projects. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (pp 116-+) RIS download Bibtex download
  • Renals S, Hain T & Bourlard H (2008) . 2008 Hands-free Speech Communication and Microphone Arrays, Proceedings, HSCMA 2008 (pp 115-118) RIS download Bibtex download
  • Hain T, El Hannani A, Wrigley SN & Wan V (2008) Automatic speech recognition for scientific purposes - webASR. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 (pp 504-507) RIS download Bibtex download
  • Wan V, Dines J, El Hannani A & Hain T (2008) BOB: A LEXICON AND PRONUNCIATION DICTIONARY GENERATOR. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS (pp 217-220) RIS download Bibtex download
  • Hain T, Burget L, Dines J, Garau G, Karafiat M, van Leeuwen D, Lincoln M & Wan V (2008) The 2007 AMI(DA) system for meeting transcription. MULTIMODAL TECHNOLOGIES FOR PERCEPTION OF HUMANS, Vol. 4625 (pp 414-428) RIS download Bibtex download
  • Karafiat M, Burget L, Hain T & Cernocky J (2007) Application of CMLLR in narrow band wide band adapted systems. ±õ²Ô³Ù±ð°ù²õ±è±ð±ð³¦³ó’07 (pp 282-285). Antwerp RIS download Bibtex download
  • Gibson M & Hain T (2007) Temporal Masking for Unsupervised Minimum Bayes Risk Speaker Adaptation. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 1577-1580) RIS download Bibtex download
  • Hain T, Burget L, Dines J, Garau G, Karafiat M, Lincoln M, Vepa J & Wan V (2007) The AMI system for the transcription of speech in meetings. 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol IV, Pts 1-3 (pp 357-360) RIS download Bibtex download
  • Dines J, Vepa J & Hain T (2006) The segmentation of multi-channel meeting recordings for automatic speech recognition. INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP, Vol. 3 (pp 1213-1216) RIS download Bibtex download
  • Gibson M & Hain T (2006) Hypothesis Spaces For Minimum Bayes Risk Training In Large Vocabulary Speech Recognition. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 2406-2409) RIS download Bibtex download
  • Uraga E & Hain T (2006) Automatic Speech Recognition Experiments with Articulatory Data. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 353-356) RIS download Bibtex download
  • Al-Hames M, Hain T, Cernocky J, Schreiber S, Poel M, Muller R, Marcel S, van Leeuwen D, Odobez JM, Ba S , Bourlard H et al (2006) Audio-visual processing in meetings: Seven questions and current AMI answers. Machine Learning for Multimodal Interaction, Vol. 4299 (pp 24-35) RIS download Bibtex download
  • Wan V & Hain T (2006) Strategies for language model web-data collection. 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol I, Proceedings (pp 1069-1072). Toulouse, FRANCE, 14 May 2006 - 19 May 2006. RIS download Bibtex download
  • Wan V & Hain T (2006) Strategies for language model web-data collection. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13 (pp 1069-1072) RIS download Bibtex download
  • Hain T, Burget L, Dines J, Garau G, Karafiat M, Lincoln M, Vepal J & Wan V (2006) The AMI meeting transcription system: Progress and performance. Machine Learning for Multimodal Interaction, Vol. 4299 (pp 419-431) RIS download Bibtex download
  • McCowan I, Carletta J, Kraaij W, Ashby S, Bourban S, Flynn M, Guillemot M, Hain T, Kadlec J, Karaiskos V , Kronenthal M et al (2005) The AMI Meeting Corpus. 5th International Conference on Methods and Techniques in Behavioral Research RIS download Bibtex download
  • Garau G, Renals S & Hain T (2005) Applying vocal tract length normalization to meeting recordings. 9th European Conference on Speech Communication and Technology (pp 265-268) RIS download Bibtex download
  • Hain T, Dines J, Garau G, Karafiat M, Moore D, Wan V, Ordelman R & Renals S (2005) Transcription of conference room meetings: An investigation. 9th European Conference on Speech Communication and Technology (pp 1661-1664) RIS download Bibtex download
  • Hain T, Burget L, Dines J, McCowan I, Garau G, Karafiat M, Lincoln M, Moore D, Wan V, Ordelman R & Renals S (2005) The development of the AMI system for the transcription of speech in meetings. MACHINE LEARNING FOR MULTIMODAL INTERACTION, Vol. 3869 (pp 344-356) RIS download Bibtex download
  • Hain T, Burget L, Dines J, Garau G, Karafiat M, Lincoln M, McCowan I, Moore D, Wan V, Ordelman R & Renals S (2005) The 2005 AMI system for the transcription of speech in meetings. MACHINE LEARNING FOR MULTIMODAL INTERACTION, Vol. 3869 (pp 450-462) RIS download Bibtex download
  • Kim DY, Gales MJF, Chan HY, Woodland PC, Umesh S & Hain T (2004) Progress in Broadcast News English Transcription. EARS STT Technical Meeting 2004. Montreal, Canada RIS download Bibtex download
  • Woodland PC, Chan HY, Evermann G, Gales MJF, Hain T, Jia B, Kim DY, Liu X, Mrva D, Sim KC , Tranter SE et al (2004) Cambridge STT Overview. EARS Mid-year Meeting 2004 RIS download Bibtex download
  • Kim DY, Umesh S, Gales MJF, Hain T & Woodland PC (2004) Using VTLN for Broadcast News Transcription. ±õ°ä³§³¢±Ê’04. Cambridge University, UK RIS download Bibtex download
  • Evermann G, Chan HY, Gales MJF, Hain T, Liu X, Mrva D, Wang L & Woodland PC (2004) Development of the 2003 CU-HTK conversational telephone speech transcription system. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 1 RIS download Bibtex download
  • Evermann G, Chan HY, Gales MJF, Hain T, Liu X, Mrva D, Wang L & Woodland P (2004) Development of the 2003 CU-HTK Conversational Telephone Speech transcription system. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS (pp 249-252) RIS download Bibtex download
  • Hain T (2003) Single Pronunciation Dictionaries - Construction and Performance. EARS STT Technical Meeting 2004 RIS download Bibtex download
  • Kim DY, Evermann G, Hain T, Mrva D, Tranter SE, Wang L & Woodland PC (2003) 2003 CU-HTK Broadcast News English System Development. Rich Transcription Workshop 2003s RIS download Bibtex download
  • Woodland PC, Chan HY, Evermann G, Gales MJF, Hain T, Kim DY, Liu X, Mrva D, Povey D, Tranter SE , Wang L et al (2003) 2003 CU-HTK English CTS Systems. Rich Transcription Workshop 2003s. Boston, Ma RIS download Bibtex download
  • Jia B, Sim KC, Gales MJF, Hain T, Liu X, Woodland PC & Yu K (2003) CU-HTK RT-03 Mandarin CTS System. Rich Transcription Workshop 2003 RIS download Bibtex download
  • Woodland PC, Evermann G, Gales MJF, Hain T, Chan HY, Jia B, Kim DY, Liu X, Mrva D, Povey D , Sim KC et al (2003) Recent Experiments with HTK Broadcast News and Conversational Telephone Systems. EARS Mid-year meeting 2003 RIS download Bibtex download
  • Kim DY, Evermann G, Hain T, Mrva D, Tranter SE, Wang L & Woodland P (2003) Recent advances in broadcast news transcription. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03 (pp 105-110) RIS download Bibtex download
  • Hain T (2002) Implicit Pronunciation Modelling in ASR. ITRW PMLA 2002. Estes Park, Colorado RIS download Bibtex download
  • Woodland PC, Evermann G, Gales MJF, Hain T, Liu X, Moore GL, Povey D & Wang L (2002) CU-HTK APRIL 2002 SWITCHBOARD SYSTEM. Rich Transcription Workshop 2002. Vienna, VA RIS download Bibtex download
  • Hain T, Woodland PC, Evermann G & Povey D (2001) New features in the CU-HTK system for transcription of conversational telephone speech. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS (pp 57-60) RIS download Bibtex download
  • Hain T, Woodland PC, Evermann G & Povey D (2000) The CU-HTK March 2000 HUB5E Transcription System. Speech Transcription Workshop 2000. College Park, Maryland RIS download Bibtex download
  • Hain T & Woodland PC (2000) Modelling sub-phone insertions and deletions in continuous speech recognition. ICSLP 2000 RIS download Bibtex download
  • Hain T & Woodland PC (1999) Dynamic HMM selection for continuous speech recognition. ·¡³Ü°ù´Ç²õ±è±ð±ð³¦³ó’99 (pp 1327-1330). Budapest RIS download Bibtex download
  • Woodland PC, Hain T, Moore GL, Niesler TR, Povey D, Tuerk A & Whittaker EWD (1999) The 1998 HTK Broadcast News Transcription System: Development and Results. 1999 DARPA Broadcast News Transcription and Understanding Workshop. Herndon, VA RIS download Bibtex download
  • Odell JJ, Woodland PC & Hain T (1999) The CUHTK-Entropic 10xRT Broadcast News Transcription System. 1999 DARPA Broadcast News Transcription and Understanding Workshop (pp 271-275). Herndon, VA RIS download Bibtex download
  • Woodland PC, Odell JJ, Hain T, Moore GL, Niesler TR, Tuerk A & Whittaker EWD (1999) Improvements in Accuracy and Speed in the HTK Broadcast News Transcription System. ·¡³Ü°ù´Ç²õ±è±ð±ð³¦³ó’99 RIS download Bibtex download
  • Hain T & Woodland PC (1999) Hidden model sequences. Hub5 Workshop’99 RIS download Bibtex download
  • Hain T & Woodland PC (1999) RECENT EXPERIMENTS WITH THE CU-HTK HUB5 SYSTEM. Hub5 Workshop’99 RIS download Bibtex download
  • Hain T, Woodland PC, Niesler TR & Whittaker EWD (1999) The 1998 HTK system for transcription of conversational telephone speech. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI (pp 57-60) RIS download Bibtex download
  • Hain T & Woodland PC (1998) SEGMENTATION AND CLASSIFICATION OF BROADCAST NEWS AUDIO. ±õ°ä³§³¢±Ê’98 RIS download Bibtex download
  • Woodland PC, Hain T, Johnson SE, Niesler TR, Tuerk A, Whittaker EWD & Young SJ (1998) The 1997 HTK Broadcast News Transcription System. 1998 DARPA Broadcast News Transcription and Understanding Workshop (pp 41-48) RIS download Bibtex download
  • Hain T & Woodland PC (1998) CU-HTK Acoustic modeling experiments. Hub5 Workshop 98 RIS download Bibtex download
  • Hain T, Johnson SE, Tuerk A, Woodland PC & Young SJ (1998) Segment Generation and Clustering in the HTK Broadcast News Transcription System. 1998 DARPA Broadcast News Transcription and Understanding Workshop (pp 133-137) RIS download Bibtex download
  • Woodland PC, Hain T, Johnson SE, Niesler TR, Tuerk A & Young SJ (1998) Experiments in broadcast news transcription. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6 (pp 909-912) RIS download Bibtex download
  • Huertgen B & Hain T (1994) . ±õ°ä´¡³§³§±Ê’94 (pp 561-564) RIS download Bibtex download
  • Chen M, Zhang H, Li Y, Luo J, Wu W, Ma Z, Bell P, Lai C, Reiss JD, Wang L , Woodland PC et al () . The Speaker and Language Recognition Workshop (Odyssey 2024) RIS download Bibtex download
  • Ravenscroft W, Close G, Goetze S, Hain T, Soleymanpour M, Chowdhury A & Fuhs MC () Transcription-free fine-tuning of speech separation models for noisy and reverberant multi-speaker automatic speech recognition. Proceedings of Interspeech 2024. Kos Island, Greece, 1 September 2024 - 1 September 2024. RIS download Bibtex download
  • Ravenscroft W, Goetze S & Hain T () Combining Conformer and Dual-Path-Transformer Networks for Single Channel Noisy Reverberant Speech Separation. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) RIS download Bibtex download
  • Mogridge R, Close G, Sutherland R, Hain T, Barker J, Goetze S & Ragni A () Non-intrusive speech intelligibility prediction for hearing-impaired users using intermediate ASR features and human memory models. 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2024). Seoul, Korea, 14 April 2024 - 14 April 2024. RIS download Bibtex download
  • Ahmad R, Farooq MU & Hain T () PROGRESSIVE UNSUPERVISED DOMAIN ADAPTATION FOR ASR USING ENSEMBLE MODELS AND MULTI-STAGE TRAINING. 2024 IEEE International Conference on Acoustics, Speech and Signal Processing RIS download Bibtex download
  • Meghanani A & Hain T () SCORE: Self-supervised correspondence fine-tuning for improved content representations. 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024). Seoul, Korea, 14 April 2024 - 14 April 2024. RIS download Bibtex download
  • Hain T, Close G, Ravenscroft W & Goetze S () MULTI-CMGAN+/+: LEVERAGING MULTI-OBJECTIVE SPEECH QUALITY METRIC PREDICTION FOR SPEECH ENHANCEMENT. 2024 IEEE International Conference on Acoustics, Speech and Signal Processing RIS download Bibtex download
  • Hain T, Ragni A & Protima NS () Adapting pretrained models for adult to child voice conversion, EUSIPCO 2023. Adapting pretrained models for adult to child voice conversion RIS download Bibtex download
  • Hain T, Ollerenshaw A & Md Asif J () Probing Statistical Representations. Probing Statistical Representations for End-to-End ASR RIS download Bibtex download
  • Ravenscroft JW, Goetze S & Hain T () On time domain conformer models for monaural speech separation in noisy reverberant acoustic environments. Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding. Beitou, Taipei, 16 December 2023 - 16 December 2023. RIS download Bibtex download
  • Hain T & Meghanani A () DERIVING TRANSLATIONAL ACOUSTIC SUB-WORD EMBEDDINGS. Proceedings of the IEEE RIS download Bibtex download
  • Hain T, Islam E & Nomo Sudro P () SIMULATION OF TEACHER-LEARNER INTERACTION IN ENGLISH LANGUAGE PRONUNCIATION LEARNING. IEEE RIS download Bibtex download
  • Close GL, Ravenscroft W, Hain T & Goetze S () . 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023) RIS download Bibtex download
  • Do C-T, Doddipatla R, Li M & Hain T () . INTERSPEECH 2023 RIS download Bibtex download
  • Hain T & Farooq MU () Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition. INTERSPEECH 2023. Dublin, Ireland, 20 August 2023 - 20 August 2023. RIS download Bibtex download
  • Ravenscroft J, Goetze S & Hain T () On data sampling strategies for training neural network speech separation models. 2023 31st European Signal Processing Conference (EUSIPCO). Helsinki, Finland, 4 September 2023 - 4 September 2023. RIS download Bibtex download
  • Hain T, Goetze S & Ravenscroft W () Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation. Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation, 1 September 2022. RIS download Bibtex download
  • Ravenscroft W, Goetze S & Hain T () Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation. IEEE 30th European Signal Processing Conference RIS download Bibtex download
  • Farooq M, Haniya Narayana DA & Hain T () Non-Linear Pairwise Language Mappings for Low-Resource Multilingual Acoustic Model Fusion. Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association RIS download Bibtex download
  • Farooq M & Hain T () Investigating the Impact of Cross-lingual Acoustic-Phonetic Similarities on Multilingual Speech Recognition. Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association RIS download Bibtex download
  • Ollerenshaw A, Jalal MA & Hain T () . Interspeech 2021 RIS download Bibtex download
  • Jalal MA, Milner R & Hain T () . Interspeech 2020 RIS download Bibtex download
  • Stappen L, Rizos G, Hasan M, Hain T & Schuller BW () . Interspeech 2020 RIS download Bibtex download
  • Chen M & Hain T () . Interspeech 2020 RIS download Bibtex download
  • Sailor HB & Hain T () . Interspeech 2020 RIS download Bibtex download
  • Huang Q & Hain T () . Interspeech 2019 RIS download Bibtex download
  • Doulaty M & Hain T () . Interspeech 2019 RIS download Bibtex download
  • Jalal MA, Loweimi E, Moore RK & Hain T () . Interspeech 2019 RIS download Bibtex download
  • Loweimi E, Barker J & Hain T () . Interspeech 2018 RIS download Bibtex download
  • Loweimi E, Barker J & Hain T () . Interspeech 2017 RIS download Bibtex download
  • Deena S, Hasan M, Doulaty M, Saz O & Hain T () . Interspeech 2016 RIS download Bibtex download
  • Al-Shareef S & Hain T () . Interspeech 2016 RIS download Bibtex download
  • Liu Y, Fox C, Hasan M & Hain T () . Interspeech 2016 RIS download Bibtex download
  • Casanueva I, Hain T & Green P () . Interspeech 2016 RIS download Bibtex download
  • Milner R & Hain T () . Interspeech 2016 RIS download Bibtex download
  • Doulaty M, Saz O, Ng RWM & Hain T () . Interspeech 2016 RIS download Bibtex download
  • Loweimi E, Barker J & Hain T () . USES Conference Proceedings RIS download Bibtex download
  • Woodland PC, Odell JJ, Hain T, Moore GL, Niesler TR, Tuerk A & Whittaker EWD () . 6th European Conference on Speech Communication and Technology (pp 1043-1046) RIS download Bibtex download

Reports

  • Close G, Hollands S, Goetze S & Hain T (2022) Clarity Prediction Challenge 1 Entry: Non-intrusive Speech Intelligibility Metric Prediction - Technical Report RIS download Bibtex download
  • el Hannani A & Hain T (2011) Data Dependence of Speech Decoder Parameters RIS download Bibtex download
  • Gibson M & Hain T (2011) Confidence-informed unsupervised Minimum Bayes Risk acoustic model adaptation RIS download Bibtex download
  • Hain T, Dines J & McCowan I (2006) Conversational multi-party speech recognition using remote microphones RIS download Bibtex download
  • Hain T, Woodland PC, Evermann G, Liu X, Moore GL, Povey D & Wang L (2003) Automatic Transcription of Conversational Telephone Speech. Development of the CU-HTK 2002 System RIS download Bibtex download

Theses / Dissertations

  • Hain T (2001) Hidden Model Sequence Models for Automatic Speech Recognition. RIS download Bibtex download
  • Hain T (1993) On the Use of Iterated Function Systems for Coding of Grayscale Images. RIS download Bibtex download

Datasets

  • Nicolao M, Hain T, Christensen H, Green P & Cunningham S . RIS download Bibtex download
  • Deena S, Hasan M, Bashkand MD, Torralba OS & Hain T . RIS download Bibtex download
  • Torralba OS, Hain T & Martinez JO . RIS download Bibtex download
  • Torralba OS, Hain T, Deena S, Bashkand MD, Hasan M, Ng WM, Milner R & Liu Y . RIS download Bibtex download
  • Torralba OS & Hain T . RIS download Bibtex download
  • Torralba OS, Hain T, Deena S, Bashkand MD, Khaliq B, Ng WM, Milner R, Hasan M & Martinez JO . RIS download Bibtex download
  • Deena S, Hasan M, Bashkand MD, Torralba OS & Hain T . RIS download Bibtex download
  • Liu Y, Hain T & Hasan M . RIS download Bibtex download
  • Ng WM, Specia L, Hain T & Shah K . RIS download Bibtex download

Other

  • Ng WM, Kwan ACM, Lee T & Hain T () . RIS download Bibtex download

Preprints

  • Sutherland R, Close G, Hain T, Goetze S & Barker J (2024) Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement, arXiv. RIS download Bibtex download
  • Do C-T, Imai S, Doddipatla R & Hain T (2024) , arXiv. RIS download Bibtex download
  • Ravenscroft W, Close G, Goetze S, Hain T, Soleymanpour M, Chowdhury A & Fuhs MC (2024) Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition, arXiv. RIS download Bibtex download
  • Meghanani A & Hain T (2024) , arXiv. RIS download Bibtex download
  • Chen M, Zhang H, Li Y, Luo J, Wu W, Ma Z, Bell P, Lai C, Reiss J, Wang L , Woodland PC et al (2024) , arXiv. RIS download Bibtex download
  • Park C, Chen M & Hain T (2024) Automatic Speech Recognition System-Independent Word Error Rate Estimation. RIS download Bibtex download
  • Close G, Hain T & Goetze S (2024) Hallucination in Perceptual Metric-Driven Speech Enhancement Networks, arXiv. RIS download Bibtex download
  • Meghanani A & Hain T (2024) , arXiv. RIS download Bibtex download
  • Meghanani A & Hain T (2024) , arXiv. RIS download Bibtex download
  • Ahmad R, Farooq MU & Hain T (2024) Progressive unsupervised domain adaptation for ASR using ensemble models and multi-stage training. RIS download Bibtex download
  • Mogridge R, Close G, Sutherland R, Hain T, Barker J, Goetze S & Ragni A (2024) Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users using Intermediate ASR Features and Human Memory Models, arXiv. RIS download Bibtex download
  • Close G, Ravenscroft W, Hain T & Goetze S (2023) Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement. RIS download Bibtex download
  • Park C, Lu C, Chen M & Hain T (2023) Fast Word Error Rate Estimation Using Self-Supervised Representations For Speech And Text. RIS download Bibtex download
  • Ravenscroft W, Goetze S & Hain T (2023) , arXiv. RIS download Bibtex download
  • Close G, Hain T & Goetze S (2023) , arXiv. RIS download Bibtex download
  • Close G, Hain T & Goetze S (2023) , arXiv. RIS download Bibtex download
  • Ollerenshaw A, Jalal MA, Milner R & Hain T (2023) , arXiv. RIS download Bibtex download
  • Ravenscroft W, Goetze S & Hain T (2023) , arXiv. RIS download Bibtex download
  • Ahmad R, Jalal MA, Farooq MU, Ollerenshaw A & Hain T (2023) Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation, arXiv. RIS download Bibtex download
  • Ollerenshaw A, Jalal MA & Hain T (2022) Dynamic Kernels and Channel Attention with Multi-Layer Embedding Aggregation for Speaker Verification, arXiv. RIS download Bibtex download
  • Ollerenshaw A, Jalal MA & Hain T (2022) Probing Statistical Representations For End-To-End ASR, arXiv. RIS download Bibtex download
  • Ravenscroft W, Goetze S & Hain T (2022) Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation. RIS download Bibtex download
  • Park C, Ahmad R & Hain T (2022) Unsupervised data selection for Speech Recognition with contrastive loss ratios. RIS download Bibtex download
  • Farooq MU, Narayana DAH & Hain T (2022) Non-Linear Pairwise Language Mappings for Low-Resource Multilingual Acoustic Model Fusion. RIS download Bibtex download
  • Farooq MU & Hain T (2022) Investigating the Impact of Cross-lingual Acoustic-Phonetic Similarities on Multilingual Speech Recognition. RIS download Bibtex download
  • Milner R, Jalal MA, Ng RWM & Hain T (2022) A cross-corpus study on speech emotion recognition. RIS download Bibtex download
  • Ollerenshaw A, Jalal MA & Hain T (2022) Insights on Neural Representations for End-to-End Speech Recognition, arXiv. RIS download Bibtex download
  • Ravenscroft W, Goetze S & Hain T (2022) Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation. RIS download Bibtex download
  • Ravenscroft W, Goetze S & Hain T (2022) Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation. RIS download Bibtex download
  • Chen M, Zhou Y, Huang H & Hain T (2022) Efficient Non-Autoregressive GAN Voice Conversion using VQWav2vec Features and Dynamic Convolution. RIS download Bibtex download
  • Close G, Hain T & Goetze S (2022) MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data. RIS download Bibtex download
  • Do C-T, Doddipatla R & Hain T (2021) Multiple-hypothesis CTC-based semi-supervised adaptation of end-to-end speech recognition, arXiv. RIS download Bibtex download
  • Chen M, Shi Y & Hain T (2020) Towards Low-Resource StarGAN Voice Conversion using Weight Adaptive Instance Normalization. RIS download Bibtex download
  • Chen M & Hain T (2020) Unsupervised Acoustic Unit Representation Learning for Voice Conversion using WaveNet Auto-encoders. RIS download Bibtex download
  • Doulaty M, Saz O, Ng RWM & Hain T (2016) Automatic Genre and Show Identification of Broadcast Media, arXiv. RIS download Bibtex download
  • Saz O, Doulaty M, Deena S, Milner R, Ng RWM, Hasan M, Liu Y & Hain T (2015) The 2015 91Ö±²¥ System for Transcription of Multi-Genre Broadcast Media, arXiv. RIS download Bibtex download
  • Doulaty M, Saz O, Ng RWM & Hain T (2015) Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation, arXiv. RIS download Bibtex download
  • Saz O, Doulaty M & Hain T (2015) Background-tracking Acoustic Features for Genre Identification of Broadcast Shows, arXiv. RIS download Bibtex download
  • Doulaty M, Saz O & Hain T (2015) Data-selective Transfer Learning for Multi-Domain Speech Recognition, arXiv. RIS download Bibtex download
  • Doulaty M, Saz O & Hain T (2015) Unsupervised Domain Discovery using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition, arXiv. RIS download Bibtex download
  • Hain T & Farooq MU () . RIS download Bibtex download
  • Ng RWM, Doulaty M, Doddipatla R, Aziz W, Shah K, Saz O, Hasan M, AlHarbi G, Specia L & Hain T () The USFD Spoken Language Translation System for IWSLT 2014. RIS download Bibtex download
Grants

Current grants

  • Automatic voice conversion for transforming professional adult voice actors to artificial child voice actors, Innovate UK, 01/2021 - 01/2023, £173,605, as PI
  • , EPSRC, 04/2019 to 09/2027, £5,508,850, as PI
  • VoiceBase Centre, VoiceBase Inc., 04/2018 - 03/2022, £1,499,972, as PI
  • WFST-based integration of ASR and MT in Spoken Language Translation, Google, 03/2014 to 12/2022, £63,588, as PI

Previous grants

  • MAUDIE: Multimedia Analysis for Unsupervised Dubbing In Entertainment, Innovate UK, 05/2018 to 07/2021, £393,115, as PI
  • TUTO II: Reading skills tutoring system, ITSLANGUAGE BV, 08/2017 to 12/2019, £121,439, as PI
  • Sound Source Separation Based on Deep Learning, Industrial, 05/2019 - 04/2020, £48,000, as PI
  • Acoustic correlates of emotions for automatic recognition, Industrial, 10/2018 to 09/2019, £48,900, as PI
  • Bridge Project, VoiceBase Inc., 09/2017 to 03/2018, £61,200, as PI
  • STATUS IV: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 01/2017 to 10/2017, £60,000, as PI
  • TUTO: Reading skills tutoring system, ITSLANGUAGE BV, 09/2016 to 08/2017, £61,983, as PI
  • STATUS III: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 01/2015 to 07/2016, £78,684, as PI
  • STATUS II: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 11/2013 to 05/2014, £98,982, as PI
  • ItsLanguage, ITSLANGUAGE BV, 11/2012 to 03/2015, £68,333, as PI
  • German System Adaptation, ITSLANGUAGE BV, 11/2012 to 03/2015, £42,373, as PI
  • DocuMeet: Transcription, summarisation and documentation of meetings using advanced speech technologies, indexing and browsing capabilities, European Commission - FP7, 11/2012 to 10/2014, £368,433, as PI
  • STATUS: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 10/2012 to 08/2013, £73,726, as PI
  • A Joint Model of Spoken Language Translation, Google, 09/2011 to 12/2016, £43,014, as PI
  • , EPSRC, 05/2011 to 07/2016, £1,798,665, as PI
  • Unsupervised Domain Adaptation, CISCO, 11/2010 to 04/2012, £121,745, as PI
  • , European Commission - FP6, 10/2006 to 12/2009, £467,074, as PI
  • , European Commission - FP6, 10/2006 to 12/2009, £345,350, as PI
Professional activities and memberships
  • Head of the  research group
  • Editorial Board member, 
  • Associate Editor, 
  • Organising committee member, ASRU 2013
  • Area Chair, Interspeech 2014, Speech Recognition - Signal Processing, Acoustic Modelling, Robustness and Adaptation.
  • Area Chair, ICPR 2014, Track 3 Image, Speech. Signal and Video Processing
  • Programme Committee,