Drill Across & Visualization of Cubes with Non-Conformed Dimensions (2009)
Dariush Riazati, James A. Thom, Xiuzhen Zhang
Data analysts would benefit greatly from the ability to navigate and view combined multidimensional data from multiple sources, a key requirement of which is the conformity between their dimensions....
Use of Wikipedia Categories in Entity Ranking (2008)
Abstract Wikipedia is a useful source of knowledge that has many applications in language processing and knowledge representation. The Wikipedia category graph can be compared with the class...
Abstract Exploring Human Judgement of Digital Imagery (2008)
Statistical learning methods are commonly applied in content-based video and image retrieval. Such methods require a large number of examples which are usually obtained through a manual annotation...
Challenges Facing the Retrieval and Reuse of Learning Objects (2008)
Michael C. Harris, James A. Thom
Abstract. This paper investigates organisational, cultural and technical challenges facing the retrieval and reuse of learning objects. Although there are many barriers to learning object reuse,...
Shot Boundary Detection (2008)
We participated in the shot boundary detection and video search tasks. This page provides a summary of our experiments:
Social Media Retrieval Using Image Features and Structured Text (2008)
Jovan Pehcevski, James A. Thom
Abstract. Use of XML offers a structured approach for representing information while maintaining separation of form and content. XML information retrieval is different from standard text retrieval in...
Combining Image and Structured Text Retrieval (2008)
Jovan Pehcevski, James A. Thom
Abstract. Two common approaches in retrieving images from a collection are
ABSTRACT Ontology Evaluation Using Wikipedia Categories for Browsing (2008)
Jonathan Yu, James A. Thom, Audrey Tam
Ontology evaluation is a maturing discipline with methodologies and measures being developed and proposed. However, evaluation methods that have been proposed have not been applied to specific...
RMIT University at INEX 2005: Ad hoc Track (2008)
Jovan Pehcevski, James A. Thom
Abstract. Different scenarios of XML retrieval are analysed in the INEX 2005 ad hoc track, which reflect different query interpretations and user behaviours that may be observed during XML retrieval....
Group Memory Based on the Task Information (2008)
Jonathan Yu James, James A. Thom, Leila Alem
A group memory of a project is an information space storing the documents produced and exchanged by members of the group, which may include the electronic discussions that took place during the life...
Video Cut Detection Using Frame Windows (2008)
Hugh E. Williams, James A. Thom, Timo Volkmer
Segmentation is the first step in managing data for many information retrieval tasks. Automatic audio transcriptions and digital video footage are typically continuous data sources that must be...
Running title: Assessing Recall (2007)
Abstract--- Recall and Precision have become the principle measures of the effectiveness of information retrieval systems. Inherent in these measures of performance is the idea of a relevant...
Entity Ranking in Wikipedia (2007)
Vercoustre, Anne-Marie, Thom, James A., Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
Use of Wikipedia Categories in Entity Ranking (2007)
Thom, James A., Pehcevski, Jovan, Vercoustre, Anne-Marie
Wikipedia is a useful source of knowledge that has many applications in language processing and knowledge representation. The Wikipedia category graph can be compared with the class hierarchy in an...
Evaluating Focused Retrieval Tasks (2007)
Pehcevski, Jovan, Thom, James A.
Focused retrieval, identified by question answering, passage retrieval, and XML element retrieval, is becoming increasingly important within the broad task of information retrieval. In this paper, we...
Evaluating Focused Retrieval Tasks (2007)
Pehcevski, Jovan, Thom, James A.
Focused retrieval, identified by question answering, passage retrieval, and XML element retrieval, is becoming increasingly important within the broad task of information retrieval. In this paper, we...
Enhancing Content-And-Structure Information Retrieval using a Native XML Database (2005)
Pehcevski, Jovan, Thom, James A., Vercoustre, Anne-Marie
Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a full-text information retrieval system; second by using eXist, a native XML database, and...
Users and Assessors in the Context of INEX: Are Relevance Dimensions Relevant? (2005)
Pehcevski, Jovan, Thom, James A., Vercoustre, Anne-Marie
The main aspects of XML retrieval are identified by analysing and comparing the following two behaviours: the behaviour of the assessor when judging the relevance of returned document components; and...
Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database (2005)
Pehcevski, Jovan, Thom, James A., Vercoustre, Anne-Marie
This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that...
Evaluating an ontology with OntoClean (2005)
Jonathan Yu James, James A. Thom, Audrey Tam, Power Kiting
tivity Hiking Jogging Orienteering Walking Train Walking Self-guided Cultural Eco Food/Drink Guided Live-aboard Package Shopping Adventure Bus BMX Mountain Road/Track Bungee Jumping Power Kiting...
HiXEval: Highlighting XML retrieval evaluation (2005)
Jovan Pehcevski, James A. Thom
Abstract. This paper describes our proposal for an evaluation metric for XML retrieval that is solely based on the highlighted text. We support our decision of ignoring the exhaustivity dimension by...
Hybrid XML retrieval: Combining information retrieval and a native XML database (2005)
Jovan Pehcevski, James A. Thom
Abstract. This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid...
Is CORI effective for collection selection? an exploration of parameters, queries, and data (2004)
Abstract In distributed information retrieval, a wide range of techniques have been proposed for choosing collections to interrogate. Many of these collection-selection techniques are based on...
Collection selection for managed distributed document databases (2004)
In a distributed document database system, a query is processed by passing it to a set of individual collections and collating the responses. For a system with many such collections, it is attractive...
Collection Selection for (2004)
Managed Distributed Document, James A. Thom, Justin Zobel
In a distributed document database system, a query is processed by passing it to a set of individual collections and collating the responses. For a system with many such collections, it is attractive...
Is CORI effective for collection selection? an exploration of parameters, queries, and data (2004)
Abstract In distributed information retrieval, a wide range of techniques have been proposed for choosing collections to interrogate. Many of these collection-selection techniques are based on...
Hybrid XML retrieval revisited (2004)
Jovan Pehcevski, James A. Thom
Abstract. The widespread adoption of XML necessitates structureaware systems that can effectively retrieve information from XML document collections. This paper reports on the participation of the...
The moving query window for shot boundary detection at TREC-12 (2003)
Timo Volkmer, Hugh E. Williams, James A. Thom
Digital video is widely used in multimedia databases and requires effective retrieval techniques. Shot boundary detection is a common first step in analysing video content. The effective detection of...
Indexing documents for queries on structure, content and attributes (1997)
Ron Sacks-davis, Tuong Dao, James A. Thom, Justin Zobel
Indexing and retrieval techniques for large text databases are well developed, but most of the techniques developed to date assume that the text to be indexed has little or no structure. With the...
A model for word clustering (1992)
It is common to model the distribution of words in text by measures such as the Poisson approximation. However, these measures ignore effects such as clustering: our analysis of document collections...