1. 2018. Crowdsourcing Critical Appraisal of Research Evidence (CrowdCARE) was found to be a valid approach to assessing clinical research quality. In Journal of Clinical Epidemiology,
  2. 2018. Integrating automatic transcription into the language documentation workflow: Experiments with Na data and the PERSEPHONE toolkit. In Language Documentation and Conservation,
  3. 2018. Discourse-aware rumour stance classification in social media using sequential classifiers. In Information Processing & Management,
  4. 2016. Fast, Small and Exact: Infinite-order Language Modelling with Compressed Suffix Trees. In Transactions of ACL (TACL),
  5. 2015. Learning Adaptive Structural Kernels for Natural Language Processing. In Transactions of ACL (TACL),
  6. 2014. Day trading profit maximization with multi-task learning and technical analysis. In Machine Learning, December.
  7. 2015. A Bayesian non-Linear Method for Feature Selection in Machine Translation Quality Estimation. In Machine Translation, June, Volume 29, Issue 2.
  8. 2013. An abstractive approach to sentence compression. In ACM Transactions on Intelligent Systems and Technology, 4.
  9. 2010. Inducing tree-substitution grammars. In Journal of Machine Learning Research, 11.
  10. 2008. Constructing corpora for development and evaluation of paraphrase systems. In Computational Linguistics, 34.
  11. 2009. Sentence compression as tree transduction. In Journal of Artificial Intelligence Research (JAIR), 34.
Conferences and Transactions
  1. 2019. Exploiting Worker Correlation for Label Aggregation in Crowdsourcing. In Proceedings of ICML.
  2. 2019. Multilingual NER Transfer for Low-resource Languages. In Proceedings of ACL.
  3. 2019. Semi-supervised Stochastic Domain Adaptation using Variational Inference. In Proceedings of ACL.
  4. 2019. Putting Evaluation in Context: Contextual Embeddings improve Machine Translation Evaluation. In Proceedings of ACL (short).
  5. 2019. Contextualization of Morphological Inflection. In Proceedings of NAACL (short).
  6. 2019. Truth inference at scale: A Bayesian model for adjudicating highly redundant crowd annotations. In Proceedings of WWW.
  7. 2019. A unified neural architecture for instrumental audio tasks. In Proceedings of ICASSP.
  8. 2018. Natural Language Processing Not-At-All from Scratch: Evaluating The Utility of Hand-crafted Features in Deep Learning. In Proceedings of EMNLP.
  9. 2018. Semi-supervised User Geolocation via Graph Convolutional Networks. In Proceedings of ACL.
  10. 2018. A Stochastic Decoder for Neural Machine Translation. In Proceedings of ACL.
  11. 2018. Graph-to-Sequence Learning using Gated Graph Neural Networks. In Proceedings of ACL.
  12. 2018. Deep-speare: A joint neural model of poetic language, meter and rhyme. In Proceedings of ACL.
  13. 2018. Narrative Modeling with Memory Chains and Semantic Supervision. In Proceedings of ACL (short).
  14. 2018. Towards Robust and Privacy-preserving Text Representations. In Proceedings of ACL (short).
  15. 2018. Content-based Popularity Prediction of Online Petitions Using a Deep Regression Model. In Proceedings of ACL (short).
  16. 2018. Hierarchical Structured Model for Fine-to-coarse Manifesto Text Analysis. In Proceedings of NAACL.
  17. 2018. Recurrent Entity Networks with Delayed Memory Update for Targeted Aspect-based Sentiment Analysis. In Proceedings of NAACL (short).
  18. 2018. What's in a Domain? Learning Domain-Robust Text Representations Using Adversarial Training. In Proceedings of NAACL (short).
  19. 2018. Evaluation Phonemic Transcription of Low-Resource Tonal Languages for Language Documentation. In Proceedings of LREC.
  20. 2017. Capturing Long-range Contextual Dependencies with Memory-enhanced Conditional Random Fields. In Proceedings of IJCNLP.
  21. 2017. End-to-end Network for Twitter Geolocation Prediction and Hashing. In Proceedings of IJCNLP.
  22. 2017. Learning Kernels over Strings using Gaussian Processes. In Proceedings of IJCNLP.
  23. 2017. Continuous Representation of Location for Geolocation and Lexical Dialectology using Mixture Density Networks. In Proceedings of EMNLP.
  24. 2017. Learning how to Active Learn: A Deep Reinforcement Learning Approach. In Proceedings of EMNLP.
  25. 2017. Towards Decoding as Continuous Optimisation in Neural Machine Translation. In Proceedings of EMNLP.
  26. 2017. Sequencing bias in Crowd-sourced annotation for NLP. In Proceedings of EMNLP.
  27. 2017. Modelling the Working Week for Multi-Step Forecasting using Gaussian Process Regression. In Proceedings of IJCAI.
  28. 2017. Compressed Nonparametric Language Modelling. In Proceedings of IJCAI.
  29. 2017. Topically Driven Neural Language Model. In Proceedings of ACL.
  30. 2017. Model Transfer for Tagging Low-resource Languages without Bilingual Corpora. In Proceedings of ACL.
  31. 2017. A Neural Model for User Geolocation and Lexical Dialectology. In Proceedings of ACL.
  32. 2017. Longitudinal Modeling of Social Media with Hawkes Process based on Users and Networks. In Proceedings of ASONAM.
  33. 2017. Context-Aware Prediction of Derivational Word-forms. In Proceedings of EACL (short).
  34. 2017. Robust Training under Linguistic Adversity. In Proceedings of EACL (short).
  35. 2017. Pairwise Webpage Coreference Classification using Distant Supervision. In Proceedings of WWW (posters).
  36. 2017. Cross-Lingual Word Embeddings for Low-Resource Language Modeling. In Proceedings of EACL.
  37. 2017. Multilingual Training of Crosslingual Word Embeddings. In Proceedings of EACL.
  38. 2016. Learning Crosslingual Word Embeddings without Bilingual Corpora. In Proceedings of EMNLP.
  39. 2016. Learning a Lexicon and Translation Model from Phoneme Lattices. In Proceedings of EMNLP (short papers).

    Winner of Best Short Paper Award

  40. 2016. Richer Interpolative Smoothing Based on Modified Kneser-Ney Language Modeling. In Proceedings of EMNLP (short papers).
  41. 2016. Learning Robust Representations of Text. In Proceedings of EMNLP (short papers).
  42. 2016. Learning a Translation Model from Word Lattices. In Proceedings of Interspeech-16.
  43. 2016. Exploiting Tree Kernels for high performance Chemical Induced Disease relation extraction. In Proceedings of International Symposium on Semantic Mining in Biomedicine (SMBM).
  44. 2016. SeeDev Binary Event Extraction using SVMs and a Rich Feature Set. In Proceedings of BioNLP-ST.
  45. 2016. Learning when to trust distant supervision: An application to low-resource POS tagging using cross-lingual projection. In Proceedings of CoNLL-16.
  46. 2016. Exploring Prediction Uncertainty in Machine Translation Quality Estimation,. In Proceedings of CoNLL-16.
  47. 2016. Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vector Differences for Lexical Relation Learning. In Proceedings of ACL-16.
  48. 2016. Hawkes Processes for Continuous Time Sequence Classification: An Application to Rumour Stance Classification in Twitter. In Proceedings of ACL-16 (Short papers).
  49. 2016. Incorporating Structural Alignment Biases into an Attentional Neural Translation Model. In Proceedings of NAACL-16.
  50. 2016. An Attentional Model for Speech Translation Without Transcription. In Proceedings of NAACL-16.
  51. 2016. Incorporating Context into Recurrent Neural Network Language Models. In Proceedings of NAACL-16 (short).
  52. 2016. Convolution Kernels for Discriminative Learning from Streaming Text. In Proceedings of AAAI-16.
  53. 2016. Studying the temporal dynamics of word co-occurrences: An application to event detection. In Proceedings of LREC-16.
  54. 2015. Inducing Bilingual Lexicons from Small Quantities of Sentence-Aligned Phonemic Transcriptions. In Proceedings of IWSLT-15.
  55. 2015. Low-Resource Neural Network Modelling for Universal Dependency Parsing. In Proceedings of EMNLP-15.
  56. 2015. Compact, Efficient and Unlimited Capacity: Language Modeling with Compressed Suffix Trees. In Proceedings of EMNLP-15.
  57. 2015. Classifying Tweet Level Judgements of Rumours in Social Media. In Proceedings of EMNLP-15 (short).
  58. 2015. Modeling Tweet Arrival Times using Log-Gaussian Cox Processes. In Proceedings of EMNLP-15 (short).
  59. 2015. Cross-lingual Transfer for Unsupervised Dependency Parsing Without Parallel Data. In Proceedings of CoNLL-15.
  60. 2015. Non-Linear Text Regression with a Deep Convolutional Neural Network. In Proceedings of ACL-IJCNLP-15 Short Papers.
  61. 2015. Point process modelling of rumour dynamics in social media. In Proceedings of ACL-IJCNLP-15 Short Papers.
  62. 2015. Twitter User Geolocation Using a Unified Text and Network Prediction Model. In Proceedings of ACL-IJCNLP-15 Short Papers.
  63. 2015. Low Resource Dependency Parsing: Cross-lingual Parameter Sharing in a Neural Network Parser. In Proceedings of ACL-IJCNLP-15 Short Papers.
  64. 2015. Structured Prediction of Sequences and Trees using Infinite Contexts. In Proceedings of ECML-15.
  65. 2015. Exploiting Text and Network Context for Geolocation of Social Media Users. In Proceedings of NAACL-15 Short Papers.
  66. 2015. Predicting Peer-to-Peer Loan Rates using Bayesian Non-Linear Regression. In Proceedings of AAAI-15.
  67. 2014. What We Can Get From 1k Tokens? Case Study of Multilingual POS Tagging For Resource-poor Languages. In Proceedings of EMNLP.
  68. 2014. Joint Emotion Analysis via Multi-task Gaussian Processes. In Proceedings of EMNLP (short).
  69. 2014. Simple extensions for a reparameterised IBM Model 2. In Proceedings of ACL Short papers.
  70. 2014. Factored Markov translation with robust modeling. In Proceedings of CoNLL.
  71. 2014. Data selection for discriminative training in statistical machine translation. In Proceedings of EAMT.
  72. 2014. Predicting and characterising user impact on Twitter. In Proceedings of EACL.
  73. 2013. BLEU deconstructed: Designing a better MT evaluation metric. In Proceedings of CICLING.
  74. 2013. Topic-oriented words as features for named entity recognition. In Proceedings of CICLING.

    Winner of best paper award, 2nd place

  75. 2013. Where's @wally: A classification approach to geolocating users based on their social ties. In Proceedings of Hypertext.

    Winner of Ted Nelson Award

  76. 2013. Mining user behaviors: A study of check-in patterns in location based social networks. In Proceedings of WebSci.
  77. 2013. Reducing annotation effort for quality estimation via active learning. In Proceedings of ACL short papers.
  78. 2013. Modelling annotator bias with multi-task Gaussian Processes: An application to machine translation quality estimation. In Proceedings of ACL.
  79. 2013. A user-centric model of voting intention from social media. In Proceedings of ACL.
  80. 2013. An infinite hierarchical Bayesian model of phrasal translation. In Proceedings of ACL.
  81. 2013. A Markov model of Machine Translation using non-parametric Bayesian inference. In Proceedings of ACL.
  82. 2013. An investigation on the effectiveness of features for translation quality estimation. In Proceedings of MT Summit.
  83. 2013. Adaptation of lecture speech recognition system with machine translation output. In Proceedings of ICASSP.
  84. 2013. A temporal model of text periodicities using Gaussian Processes. In Proceedings of EMNLP.
  85. 2012. Evaluating a morphological analyser of Inuktitut. In Proceedings of NAACL:HLT.
  86. 2012. Left-to-right tree-to-string decoding with prediction. In Proceedings of EMNLP-CoNLL.
  87. 2011. A hierarchical Pitman-Yor process HMM for unsupervised part of speech induction. In Proceedings of ACL-HLT.
  88. 2010. Inducing synchronous grammars with slice sampling. In Proceedings of HLT-NAACL.
  89. 2010. Blocked inference in Bayesian tree substitution grammars. In Proceedings of ACL short papers.
  90. 2010. Unsupervised induction of tree substitution grammars for dependency parsing. In Proceedings of EMNLP.
  91. 2010. Multi-document summarization using A* search and discriminative training. In Proceedings of EMNLP.
  92. 2009. A Bayesian model of syntax-directed tree to string grammar induction. In Proceedings of EMNLP.
  93. 2009. A Gibbs sampler for phrasal synchronous grammar induction. In Proceedings of ACL-IJCNLP.
  94. 2009. A note on the implementation of hierarchical Dirichlet processes. In Proceedings of ACL-IJCNLP short papers.
  95. 2009. Inducing compact but accurate tree-substitution grammars. In Proceedings of HLT-NAACL.
  96. 2009. Word lattices for multi-source translation. In Proceedings of EACL.
  97. 2009. Bayesian synchronous grammar induction. In NIPS.
  98. 2008. Sentence compression beyond word deletion. In Proceedings of COLING.
  99. 2008. ParaMetric: An automatic evaluation metric for paraphrasing. In Proceedings of COLING.
  100. 2008. A discriminative latent variable model for statistical machine translation. In Proceedings of ACL-08: HLT.
  101. 2007. Machine translation by triangulation: Making effective use of multi-parallel corpora. In Proceedings of ACL.
  102. 2007. Large margin synchronous generation and its application to sentence compression. In Proceedings of EMNLP-CoNLL.
  103. 2006. Discriminative word alignment with conditional random fields. In Proceedings of COLING-ACL.
  104. 2006. Efficient inference in large conditional random fields. In Proceedings of ECML.
  105. 2005. Scaling conditional random fields using error-correcting codes. In Proceedings of ACL.
  106. 2005. Semantic role labelling with tree conditional random fields. In Proceedings of CoNLL.
  107. 2005. Logarithmic opinion pools for conditional random fields. In Proceedings of ACL.
Workshops and Demonstrations
  1. 2018. Improved Neural Machine Translation using Side Information. In Proceedings of ALTA.

    Winner of Best Paper Award

  2. 2018. Towards Efficient Machine Translation Evaluation by Modelling Annotators. In Proceedings of ALTA.

    Winner of Best Short Paper Award

  3. 2018. Iterative Back-Translation for Neural Machine Translation. In Proceedings of 2nd Workshop on Neural Machine Translation and Generation.
  4. 2017. Improving End-to-End Memory Networks with Unified Weight Tying. In Proceedings of ALTA.
  5. 2017. Joint Sentence-Document Model for Manifesto Text Analysis. In Proceedings of ALTA.
  6. 2017. Phonemic Transcription of Low-Resource Tonal Languages. In Proceedings of ALTA.
  7. 2017. Word Representation Models for Morphologically Rich Languages in Neural Machine Translation. In Proceedings of SCLEM workshop at EMNLP.
  8. 2016. Proceedings of the Australasian Language Technology Association (ALTA) workshop. In
  9. 2016. ASM Kernel: Graph Kernel using Approximate Subgraph Matching for Relation Extraction. In Proceedings of ALTA.
  10. 2016. Improving Neural Translation Models with Linguistic Factors. In Proceedings of ALTA.

    Winner of best student paper award

  11. 2016. pigeo: A Python Geotagging Tool. In Proceedings of ACL-16 (Demonstrations).
  12. 2016. Document Context Language Models. In Proceedings of ICLR-16 Workshop.
  13. 2014. Extracting socioeconomic patterns from the news: Modelling text and outlet importance jointly. In Proceedings of ACL LACSS.
  14. 2013. QuEst - a translation quality estimation framework. In Proceedings of ACL demonstrations.
  15. 2013. SHEF-Lite: when less is more for translation quality estimation. In Proceedings of WMT.
  16. 2012. Proceedings of the NAACL-HLT workshop on the induction of linguistic structure. In
  17. 2012. Using senses in HMM word alignment. In Proceedings of WILS.
  18. 2012. The PASCAL challenge on grammar induction. In Proceedings of WILS.
  19. 2012. Trendminer: An architecture for real time analysis of social media text. In Proceedings of RAMSS.
  20. 2011. Regression and ranking based optimisation for sentence level machine translation evaluation. In Proceedings of WMT.
  21. 2003. Performance metrics for word sense disambiguation. In Proceedings of ALTW.

Visiting positions



Invited Talks

Academic Courses

Invited Tutorials and Short Courses

Staff and Students Supervised

Thesis examinations

Professional Service

Engineering Positions