James Thom

Details der Publikationsliste

Zeitraum

1854 - 2008

Anzahl

40

Co-Autoren

Entity Ranking in Wikipedia (2008)

Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan

The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...

Entity Ranking in Wikipedia (2008)

Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan

The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...

Using Wikipedia Categories and Links in Entity Ranking (2008)

Vercoustre, Anne-Marie, Pehcevski, Jovan, Thom, James

This paper describes the participation of the INRIA group in the INEX 2007 XML entity ranking and ad hoc tracks. We developed a system for ranking Wikipedia entities in answer to a query. Our...

Using Wikipedia Categories and Links in Entity Ranking (2008)

Vercoustre, Anne-Marie, Pehcevski, Jovan, Thom, James

This paper describes the participation of the INRIA group in the INEX 2007 XML entity ranking and ad hoc tracks. We developed a system for ranking Wikipedia entities in answer to a query. Our...

Exploiting Locality of Wikipedia Links in Entity Ranking (2008)

Pehcevski, Jovan, Vercoustre, Anne-Marie, Thom, James

Information retrieval from web and XML document collections is ever more focused on returning entities instead of web pages or XML elements. There are many research fields involving named entities;...

Exploiting Locality of Wikipedia Links in Entity Ranking (2008)

Pehcevski, Jovan, Vercoustre, Anne-Marie, Thom, James

Information retrieval from web and XML document collections is ever more focused on returning entities instead of web pages or XML elements. There are many research fields involving named entities;...

CSIRO Mathematical and Information Sciences (2007)

Nick Craswell, David Hawking, James Thom, Trystan Upstill, Ross Wilkinson, Mingfang Wu

web and interactive. This year's Web track participation was a preliminary exploration of forms of evidence which might be useful for named page finding and topic distillation. For this reason,...

4Technologies for Electronic Documents CS1RO Mathematical and Information Sciences (2007)

Nick Craswell, David Hawking, James Thom, Trystan Upstilp, Ross Wilkinson, Mingfang Wu

This year, the CSIRO teams participated and completed runs in two tracks: web and interactive. Our web track participation was a preliminary exploration of forms of evidence which might be useful for...

Entity ranking in Wikipedia (2007)

Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan

The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...

Entity ranking in Wikipedia (2007)

Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan

The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...

Entity Ranking in Wikipedia (2007)

Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan

The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...

Entity Ranking in Wikipedia (2007)

Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan

The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...

Use of Wikipedia Categories in Entity Ranking (2007)

Thom, James, Pehcevski, Jovan, Vercoustre, Anne-Marie

Wikipedia is a useful source of knowledge that has many applications in language processing and knowledge representation. The Wikipedia category graph can be compared with the class hierarchy in an...

Use of Wikipedia Categories in Entity Ranking (2007)

Thom, James, Pehcevski, Jovan, Vercoustre, Anne-Marie

Wikipedia is a useful source of knowledge that has many applications in language processing and knowledge representation. The Wikipedia category graph can be compared with the class hierarchy in an...

Users and Assessors in the Context of INEX: Are Relevance Dimensions Relevant? (2005)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

The main aspects of XML retrieval are identified by analysing and comparing the following two behaviours: the behaviour of the assessor when judging the relevance of returned document components; and...

Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database (2005)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that...

Hybrid XML Retrieval Revisited (2005)

Pehcevski, Jovan, Thom, James, Tahaghoghi, S., Vercoustre, Anne-Marie

The widespread adoption of XML necessitates structureaware systems that can effectively retrieve information from XML document collections. This paper reports on the participation of the RMIT group...

Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database (2005)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that...

Users and Assessors in the Context of INEX: Are Relevance Dimensions Relevant? (2005)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

The main aspects of XML retrieval are identified by analysing and comparing the following two behaviours: the behaviour of the assessor when judging the relevance of returned document components; and...

Hybrid XML Retrieval Revisited (2005)

Pehcevski, Jovan, Thom, James, Tahaghoghi, S., Vercoustre, Anne-Marie

The widespread adoption of XML necessitates structureaware systems that can effectively retrieve information from XML document collections. This paper reports on the participation of the RMIT group...

Users and Assessors in the Context of INEX: Are Relevance Dimensions Relevant? (2005)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

The main aspects of XML retrieval are identified by analysing and comparing the following two behaviours: the behaviour of the assessor when judging the relevance of returned document components; and...

Hybrid XML Retrieval Revisited (2005)

Pehcevski, Jovan, Thom, James, Tahaghoghi, S., Vercoustre, Anne-Marie

The widespread adoption of XML necessitates structureaware systems that can effectively retrieve information from XML document collections. This paper reports on the participation of the RMIT group...

Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database (2005)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that...

Enhancing Content-And-Structure Information Retrieval using a Native XML Database (2004)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a full-text information retrieval system; second by using eXist, a native XML database, and...

Enhancing Content-And-Structure Information Retrieval using a Native XML Database (2004)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a full-text information retrieval system; second by using eXist, a native XML database, and...

Enhancing Content-And-Structure Information Retrieval using a Native XML Database (2004)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a full-text information retrieval system; second by using eXist, a native XML database, and...

RMIT INEX experiments: XML Retrieval using Lucy and eXist (2003)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

This paper reports on the RMIT group's approach to XML retrieval while participating in INEX 2003. We indexed XML documents using Lucy, a compact and fast text search engine designed and written by...

RMIT INEX experiments: XML Retrieval using Lucy and eXist (2003)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

This paper reports on the RMIT group's approach to XML retrieval while participating in INEX 2003. We indexed XML documents using Lucy, a compact and fast text search engine designed and written by...

XML-search Query Language: Needs and Requirements (2003)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

This paper explores the needs of XML-search and makes comparisons between the INEX experience in evaluating XML retrieval and the recently proposed W3C requirements for extending XQuery and XPath to...

XML-search Query Language: Needs and Requirements (2003)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

This paper explores the needs of XML-search and makes comparisons between the INEX experience in evaluating XML retrieval and the recently proposed W3C requirements for extending XQuery and XPath to...

RMIT INEX experiments: XML Retrieval using Lucy and eXist (2003)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

This paper reports on the RMIT group's approach to XML retrieval while participating in INEX 2003. We indexed XML documents using Lucy, a compact and fast text search engine designed and written by...

XML-search Query Language: Needs and Requirements (2003)

Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie

This paper explores the needs of XML-search and makes comparisons between the INEX experience in evaluating XML retrieval and the recently proposed W3C requirements for extending XQuery and XPath to...

CSIRO INEX Experiments: XML Search using PADRE (2002)

Vercoustre, Anne-Marie, Thom, James, Krumpholz, Alexander, Mathieson, Ian, Wilkins, Peter, Wu, Mingfang, ...

This paper reports on the CSIRO group's participation in INEX. We indexed documents and document fragments using PADRE, the core of CSIRO's Panoptic Enterprise Search Engine. A query translator...

CSIRO INEX Experiments: XML Search using PADRE (2002)

Vercoustre, Anne-Marie, Thom, James, Krumpholz, Alexander, Mathieson, Ian, Wilkins, Peter, Wu, Mingfang, ...

This paper reports on the CSIRO group's participation in INEX. We indexed documents and document fragments using PADRE, the core of CSIRO's Panoptic Enterprise Search Engine. A query translator...

CSIRO INEX Experiments: XML Search using PADRE (2002)

Vercoustre, Anne-Marie, Thom, James, Krumpholz, Alexander, Mathieson, Ian, Wilkins, Peter, Wu, Mingfang, ...

This paper reports on the CSIRO group's participation in INEX. We indexed documents and document fragments using PADRE, the core of CSIRO's Panoptic Enterprise Search Engine. A query translator...

The Melbourne TREC-9 Experiments (2000)

Michael Fuller, James Thom, Phil Vines, Justin Zobel, Owen De Kretser, Ross Wilkinson, ...

We report results for experiments conducted in Melbourne---at CSIRO, RMIT, and The University of Melbourne---for TREC-9. We present results for the

Design of document database systems (1993)

Thom, James.

Typescript (photocopy) Includes bibliographical references (leaves 192-209) and index (leaves 212-213)

Privacy protection, technological change and law reform (1985)

Thom, James.

Typescript (photocopy) Errata slip inserted. Bibliography: leaves 146-161.

TREC11 Web and Interactive Tracks at CSIRO

Nick Craswell David, David Hawking, James Thom, Trystan Upstill, Ross Wilkinson, Mingfang Wu

topics as time increases. 2. The web track 2.1. Topic distillation In topic distillation we used the following forms of evidence: . BM25 on content. Pages returned should be relevant. We indexed the...