Mark James Carman

Senior Lecturer
Caulfield School of Information Technology
Monash University

email:
phone: +61 3 9903 4329

office: H6.46, Monash University Caulfield Campus
900 Dandenong Road, Caulfield East, Victoria

postal address: PO Box 197, Caulfield East, Victoria 3145, Australia

Research Interests and Expertise

Dr. Carman is an expert in Data Science, Data Mining and the analysis of Big Data. His primary research interests are Information Retrieval and Machine Learning. His specific areas of expertise include: His research and interests span theoretical studies (e.g. investigating statistical properties of information retrieval measures), through to practical applications (e.g. technology for assisting police during digital forensic investigations).

Dr. Carman has authored a large number of publications in prestigious venues, including full papers at SIGIR, KDD, IJCAI, CIKM, ECIR, WSDM, HT, CoNLL, EACL , HCOMP and ICDAR, and articles in TOIS, IR, JMLR, ML, PR, JAIR and IP&M. According to Google Scholar, his publications have a citation count over 1150 and an h-index of 17.

Major contributions of his research career have included developing state-of-the-art techniques for:

A Brief Bio (see also CV)

Mark Carman is a senior lecturer at Monash University in Melbourne, Australia. He joined Monash in 2010 after three years as a postdoc at the University of Lugano. He received his PhD from the University of Trento in 2006 after working at the Fondazione Bruno Kessler and the Information Sciences Institute (USC). Mark works primarily in information retrieval, applying and extending statistical machine learning techniques to the modelling of users and user-generated content. He has served on the program committees of many IR/DM conferences (SIGIR, ECIR, KDD, CIKM, EMNLP, AAAI, ACML, etc.) and is an Associate Editor for TOIS.

PhD Students

Graduated: Under examination: Current:

Publications (see Google Scholar page)

The following list of publications is hopelessly out of date. Please see this Google Scholar page for a more complete list including more recent publications. If you can't find or get access to the paper you're after please send an email to .
Improving social bookmark search using personalised latent variable language models
Morgan Harvey, Mark James Carman, and Ian Ruthven.
4th ACM International Conference on Web Search and Data Mining, (WSDM 2011), 2011.
A multi-collection latent topic model for federated search
Mark Baillie, Mark Carman, and Fabio Crestani.
Information Retrieval, (to appear).
Towards query log based personalization using topic models
Mark James Carman, Fabio Crestani, Morgan Harvey, and Mark Baillie.
19th ACM Conference on Information and Knowledge Management, (CIKM 2010), 2010.
Ranking social bookmarks using topic models
Morgan Harvey, Mark James Carman, and Ian Ruthven.
19th ACM Conference on Information and Knowledge Management, (CIKM 2010), 2010.
Proximity Based Opinion Retrieval
Shima Gerani, Mark Carman, Fabio Crestani.
33rd Annual International ACM SIGIR Conference (SIGIR 2010), 2010.
Tripartite Hidden Topic Models for Personalised Tag Suggestion
Morgan Harvey, Mark Baillie, Ian Ruthven, and Mark Carman.
32nd European Conference on IR Research (ECIR 2010), 2010.
Statistics of Online User-Generated Short Documents
Giacomo Inches, Mark Carman and Fabio Crestani.
32nd European Conference on IR Research (ECIR 2010), 2010. (poster)
Piloted Search and Recommendation with Social Tag Cloud-Based Navigation
Cedric Mesnage and Mark Carman.
1st Workshop On Music Recommendation And Discovery (WOMRAD) at ACM RecSys, 2010.
A Statistical Comparison of Tag and Query Logs
Mark Carman, Mark Baillie, Robert Gwadera and Fabio Crestani.
32nd Annual International ACM SIGIR Conference on Research & Development on Information Retrieval (SIGIR 2009), 2009.
Blog Distillation using Random Walks
Mostafa Keikha, Mark Carman and Fabio Crestani.
32nd Annual International ACM SIGIR Conference on Research & Development on Information Retrieval (SIGIR 2009), 2009. (poster)
A Topic-based Measure of Resource Description Quality for Distributed Information Retrieval
Mark Baillie, Mark Carman and Fabio Crestani.
31st European Conference on Information Retrieval (ECIR 2009), Toulouse, France, 2009
Investigating Learning Approaches for Blog Post Opinion Retrieval
Shima Gerani, Mark Carman and Fabio Crestani.
31st European Conference on Information Retrieval (ECIR 2009), Toulouse, France, 2009
Tag Data and Personalized Information Retrieval
Mark J. Carman, Mark Baillie and Fabio Crestani.
CIKM 2008 Workshop on Search in Social Media (SSM 2008), 2008
Exploiting data semantics to discover, extract, and model web sources
José Luis Ambite, Craig A. Knoblock, Kristina Lerman, Anon Plangprasopchok, Thomas Russ, Cenk Gazen, Steven Minton and Mark Carman.
First International Workshop on Semantic Aspects in Data Mining (SADM'08), 2008
Beyond the Elves: Making Intelligent Agents Intelligent
Craig A. Knoblock, José Luis Ambite, Mark Carman, Matthew Michelson, Pedro Szekely and Rattapoom Tuchinda.
AI Magazine, 2008
Learning Semantic Definitions of Online Information Sources
Mark James Carman and Craig A. Knoblock.
Journal of Artificial Intelligence Research (JAIR), volume 30, pages 1-50, 2007
Learning Semantic Descriptions of Web Information Sources
Mark James Carman and Craig A. Knoblock.
Twentieth International Joint Conference on Artificial Intelligence (IJCAI-07). Hyderabad, India, January 2007
Learning Semantic Definitions of Information Sources on the Internet
Mark James Carman.
Doctorate Thesis, (Advisors: Paolo Traverso and Craig A. Knoblock),
Department of Information and Communication Technologies, University of Trento, August 2006
Inducing Source Descriptions for Automated Web Service Composition
Mark James Carman and Craig A. Knoblock.
AAAI 2005 Workshop on Exploring Planning and Scheduling for Web Services, Grid, and Autonomic Computing. 2005
Web Service Composition as Planning
Mark Carman, Luciano Serafini and Paolo Traverso.
ICAPS'03 Workshop on Planning for Web Services, Trento, Italy, June 2003
Planning for Web Services the Hard Way
Mark Carman and Luciano Serafini.
SAINT'03 Workshop on Service Oriented Computing, Orlando, USA, January 2003
Towards an Economy-Based Optimisation of File Access and Replication on a Data Grid
Mark Carman, Floriano Zini, Luciano Serafini and Kurt Stockinger.
International Workshop on Agent based Cluster and Grid Computing at International Symposium on Cluster Computing and the Grid (CCGrid'2002), 2002
A Request Language for Web-Services based on Planning and Constraint Satisfaction
M. Aiello, M. Papazoglou, J. Yang, M. Carman, M. Pistore, L. Serafini and P. Traverso.
VLDB workshop on Technologies for E-Services (TES), 2002

Software

EIDOS (Efficiently Inducing Definitions for Online Sources) is a system for learning semantic descriptions of online information sources (such as these RSS feeds). The descriptions are used to automatically integrate the sources into (mediator based) information integration systems. A complete description of the purpose and functionality of the system can be found in my thesis. You can also have a look at the slides I presented at my defense. The software can be downloaded from the ISI website. It is royalty-free for research purposes and comes with all the source code. Here is the latest documentation. Feel free to contact me with installation questions.



Monash Personal Page Disclaimer
This web page is not authorized by Monash University and any opinions expressed on the page are those of the author and not those of the University.