Entity Ranking in Wikipedia (2008)
Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
Entity Ranking in Wikipedia (2008)
Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
Using Wikipedia Categories and Links in Entity Ranking (2008)
Vercoustre, Anne-Marie, Pehcevski, Jovan, Thom, James
This paper describes the participation of the INRIA group in the INEX 2007 XML entity ranking and ad hoc tracks. We developed a system for ranking Wikipedia entities in answer to a query. Our...
Using Wikipedia Categories and Links in Entity Ranking (2008)
Vercoustre, Anne-Marie, Pehcevski, Jovan, Thom, James
This paper describes the participation of the INRIA group in the INEX 2007 XML entity ranking and ad hoc tracks. We developed a system for ranking Wikipedia entities in answer to a query. Our...
Exploiting Locality of Wikipedia Links in Entity Ranking (2008)
Pehcevski, Jovan, Vercoustre, Anne-Marie, Thom, James
Information retrieval from web and XML document collections is ever more focused on returning entities instead of web pages or XML elements. There are many research fields involving named entities;...
Exploiting Locality of Wikipedia Links in Entity Ranking (2008)
Pehcevski, Jovan, Vercoustre, Anne-Marie, Thom, James
Information retrieval from web and XML document collections is ever more focused on returning entities instead of web pages or XML elements. There are many research fields involving named entities;...
CSIRO Mathematical and Information Sciences (2007)
Nick Craswell, David Hawking, James Thom, Trystan Upstill, Ross Wilkinson, Mingfang Wu
web and interactive. This year's Web track participation was a preliminary exploration of forms of evidence which might be useful for named page finding and topic distillation. For this reason,...
4Technologies for Electronic Documents CS1RO Mathematical and Information Sciences (2007)
Nick Craswell, David Hawking, James Thom, Trystan Upstilp, Ross Wilkinson, Mingfang Wu
This year, the CSIRO teams participated and completed runs in two tracks: web and interactive. Our web track participation was a preliminary exploration of forms of evidence which might be useful for...
Entity ranking in Wikipedia (2007)
Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
Entity ranking in Wikipedia (2007)
Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
Entity Ranking in Wikipedia (2007)
Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
Entity Ranking in Wikipedia (2007)
Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
Use of Wikipedia Categories in Entity Ranking (2007)
Thom, James, Pehcevski, Jovan, Vercoustre, Anne-Marie
Wikipedia is a useful source of knowledge that has many applications in language processing and knowledge representation. The Wikipedia category graph can be compared with the class hierarchy in an...
Use of Wikipedia Categories in Entity Ranking (2007)
Thom, James, Pehcevski, Jovan, Vercoustre, Anne-Marie
Wikipedia is a useful source of knowledge that has many applications in language processing and knowledge representation. The Wikipedia category graph can be compared with the class hierarchy in an...
Users and Assessors in the Context of INEX: Are Relevance Dimensions Relevant? (2005)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
The main aspects of XML retrieval are identified by analysing and comparing the following two behaviours: the behaviour of the assessor when judging the relevance of returned document components; and...
Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database (2005)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that...
Hybrid XML Retrieval Revisited (2005)
Pehcevski, Jovan, Thom, James, Tahaghoghi, S., Vercoustre, Anne-Marie
The widespread adoption of XML necessitates structureaware systems that can effectively retrieve information from XML document collections. This paper reports on the participation of the RMIT group...
Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database (2005)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that...
Users and Assessors in the Context of INEX: Are Relevance Dimensions Relevant? (2005)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
The main aspects of XML retrieval are identified by analysing and comparing the following two behaviours: the behaviour of the assessor when judging the relevance of returned document components; and...
Hybrid XML Retrieval Revisited (2005)
Pehcevski, Jovan, Thom, James, Tahaghoghi, S., Vercoustre, Anne-Marie
The widespread adoption of XML necessitates structureaware systems that can effectively retrieve information from XML document collections. This paper reports on the participation of the RMIT group...
Users and Assessors in the Context of INEX: Are Relevance Dimensions Relevant? (2005)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
The main aspects of XML retrieval are identified by analysing and comparing the following two behaviours: the behaviour of the assessor when judging the relevance of returned document components; and...
Hybrid XML Retrieval Revisited (2005)
Pehcevski, Jovan, Thom, James, Tahaghoghi, S., Vercoustre, Anne-Marie
The widespread adoption of XML necessitates structureaware systems that can effectively retrieve information from XML document collections. This paper reports on the participation of the RMIT group...
Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database (2005)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that...
Enhancing Content-And-Structure Information Retrieval using a Native XML Database (2004)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a full-text information retrieval system; second by using eXist, a native XML database, and...
Enhancing Content-And-Structure Information Retrieval using a Native XML Database (2004)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a full-text information retrieval system; second by using eXist, a native XML database, and...
Enhancing Content-And-Structure Information Retrieval using a Native XML Database (2004)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a full-text information retrieval system; second by using eXist, a native XML database, and...
RMIT INEX experiments: XML Retrieval using Lucy and eXist (2003)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper reports on the RMIT group's approach to XML retrieval while participating in INEX 2003. We indexed XML documents using Lucy, a compact and fast text search engine designed and written by...
RMIT INEX experiments: XML Retrieval using Lucy and eXist (2003)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper reports on the RMIT group's approach to XML retrieval while participating in INEX 2003. We indexed XML documents using Lucy, a compact and fast text search engine designed and written by...
XML-search Query Language: Needs and Requirements (2003)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper explores the needs of XML-search and makes comparisons between the INEX experience in evaluating XML retrieval and the recently proposed W3C requirements for extending XQuery and XPath to...
XML-search Query Language: Needs and Requirements (2003)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper explores the needs of XML-search and makes comparisons between the INEX experience in evaluating XML retrieval and the recently proposed W3C requirements for extending XQuery and XPath to...
RMIT INEX experiments: XML Retrieval using Lucy and eXist (2003)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper reports on the RMIT group's approach to XML retrieval while participating in INEX 2003. We indexed XML documents using Lucy, a compact and fast text search engine designed and written by...
XML-search Query Language: Needs and Requirements (2003)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper explores the needs of XML-search and makes comparisons between the INEX experience in evaluating XML retrieval and the recently proposed W3C requirements for extending XQuery and XPath to...
CSIRO INEX Experiments: XML Search using PADRE (2002)
Vercoustre, Anne-Marie, Thom, James, Krumpholz, Alexander, Mathieson, Ian, Wilkins, Peter, Wu, Mingfang, ...
This paper reports on the CSIRO group's participation in INEX. We indexed documents and document fragments using PADRE, the core of CSIRO's Panoptic Enterprise Search Engine. A query translator...
CSIRO INEX Experiments: XML Search using PADRE (2002)
Vercoustre, Anne-Marie, Thom, James, Krumpholz, Alexander, Mathieson, Ian, Wilkins, Peter, Wu, Mingfang, ...
This paper reports on the CSIRO group's participation in INEX. We indexed documents and document fragments using PADRE, the core of CSIRO's Panoptic Enterprise Search Engine. A query translator...
CSIRO INEX Experiments: XML Search using PADRE (2002)
Vercoustre, Anne-Marie, Thom, James, Krumpholz, Alexander, Mathieson, Ian, Wilkins, Peter, Wu, Mingfang, ...
This paper reports on the CSIRO group's participation in INEX. We indexed documents and document fragments using PADRE, the core of CSIRO's Panoptic Enterprise Search Engine. A query translator...
The Melbourne TREC-9 Experiments (2000)
Michael Fuller, James Thom, Phil Vines, Justin Zobel, Owen De Kretser, Ross Wilkinson, ...
We report results for experiments conducted in Melbourne---at CSIRO, RMIT, and The University of Melbourne---for TREC-9. We present results for the
Design of document database systems (1993)
Typescript (photocopy) Includes bibliographical references (leaves 192-209) and index (leaves 212-213)
Privacy protection, technological change and law reform (1985)
Typescript (photocopy) Errata slip inserted. Bibliography: leaves 146-161.
Petrified man.
TREC11 Web and Interactive Tracks at CSIRO
Nick Craswell David, David Hawking, James Thom, Trystan Upstill, Ross Wilkinson, Mingfang Wu
topics as time increases. 2. The web track 2.1. Topic distillation In topic distillation we used the following forms of evidence: . BM25 on content. Pages returned should be relevant. We indexed the...