My research spans two broad areas: Natural Language Processing and Machine Learning. Understanding human language by computers has been a central goal of AI. Human language is an intricate system; each sentence has its own grammatical structure, inter-connected references, and set of possible meanings. The field of Natural Language Processing (NLP) aims to build computational models of language in order to make predictions based on real-world textual data. Example applications of NLP include machine translation, information extraction, and question answering. Tools developed for these problems are increasingly becoming part of daily life, from speech and dialogue systems on mobile devices to structured search on the web to real-time translation. NLP is a rich intersection of formal modeling, applied algorithms and scalable data systems, and has served as an important application domain for related fields such as Machine Learning (ML).

My research aligns with the Machine Learning flagship in our faculty. In the past, I have organised the following reading groups in our faculty: Deep Learning Reading Group, The Machine Learning Book Reading Group, and Natural Language Processing Reading Group.

Research Areas

  • Deep learning methods, particularly why they work and how to use them well.
  • Structured prediction, particularly predicting complex linguistic structure.
  • Discourse, semantics, syntax, and morphology; particularly for machine translation.
  • Advanced data structures (eg succinct suffix trees) for large-scale NLP problem, such as language modelling.
  • Learning with limited amounts of supervision and large amount of un-annotated data; learning across different domains of data.
  • Learning and inference in probabilistic graphical models, particularly for NLP problems.
  • Non-parametric Bayesian models, particularly for NLP.
  • Reinforcement learning, Markov decision processes, and multi-armed bandit.
  • Dialogue systems, particularly with the deep learning approach.
  • Learning programs from data, particularly with deep learning.


Current and Past Projects

  • Improving the quality of primary health care for TAC clients. 2016-2017. Transport Accident Commission (TAC). With Behrooz Hassani Mahmooei, Ann Nicholson et al; $174k.

  • Learning Deep Semantics for Automatic Translation between Human Languages. 2016 – 2019. ARC Discovery with Trevor Cohn; $450k.

  • Data Analysis of Victoria Police Incident and Injuries Data. 2016-2016. Victoria Police. With Behrooz Hassani Mahmooei and Carlyn Muir; $65k.

  • Large-scale qualitative data analysis. 2014-2015. Victoria Government. With Mark Carman, Wray Buntine, and Geoff Webb; $65k.

  • Scalable Semi-Supervised Learning for Structured Prediction, 2014-2015. NICTA. With Kim Marriot and Geoff Webb; $49k.