INEX 2007 Evaluation Measures (Draft) (2008)
Jovan Pehcevski, Jaap Kamps, Gabriella Kazai, Mounia Lalmas, Benjamin Piwowarski, Stephen Robertson
Abstract. This paper describes the official measures of retrieval effectiveness that are planned to be employed for the ad hoc track of INEX 2007. 1
Social Media Retrieval Using Image Features and Structured Text (2008)
Jovan Pehcevski, James A. Thom
Abstract. Use of XML offers a structured approach for representing information while maintaining separation of form and content. XML information retrieval is different from standard text retrieval in...
Combining Image and Structured Text Retrieval (2008)
Jovan Pehcevski, James A. Thom
Abstract. Two common approaches in retrieving images from a collection are
RMIT University at INEX 2005: Ad hoc Track (2008)
Jovan Pehcevski, James A. Thom
Abstract. Different scenarios of XML retrieval are analysed in the INEX 2005 ad hoc track, which reflect different query interpretations and user behaviours that may be observed during XML retrieval....
Entity Ranking in Wikipedia (2008)
Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
Entity Ranking in Wikipedia (2008)
Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
Using Wikipedia Categories and Links in Entity Ranking (2008)
Vercoustre, Anne-Marie, Pehcevski, Jovan, Thom, James
This paper describes the participation of the INRIA group in the INEX 2007 XML entity ranking and ad hoc tracks. We developed a system for ranking Wikipedia entities in answer to a query. Our...
Using Wikipedia Categories and Links in Entity Ranking (2008)
Vercoustre, Anne-Marie, Pehcevski, Jovan, Thom, James
This paper describes the participation of the INRIA group in the INEX 2007 XML entity ranking and ad hoc tracks. We developed a system for ranking Wikipedia entities in answer to a query. Our...
Exploiting Locality of Wikipedia Links in Entity Ranking (2008)
Pehcevski, Jovan, Vercoustre, Anne-Marie, Thom, James
Information retrieval from web and XML document collections is ever more focused on returning entities instead of web pages or XML elements. There are many research fields involving named entities;...
Exploiting Locality of Wikipedia Links in Entity Ranking (2008)
Pehcevski, Jovan, Vercoustre, Anne-Marie, Thom, James
Information retrieval from web and XML document collections is ever more focused on returning entities instead of web pages or XML elements. There are many research fields involving named entities;...
Entity Ranking in Wikipedia (2007)
Vercoustre, Anne-Marie, Thom, James A., Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
Use of Wikipedia Categories in Entity Ranking (2007)
Thom, James A., Pehcevski, Jovan, Vercoustre, Anne-Marie
Wikipedia is a useful source of knowledge that has many applications in language processing and knowledge representation. The Wikipedia category graph can be compared with the class hierarchy in an...
Entity ranking in Wikipedia (2007)
Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
Entity ranking in Wikipedia (2007)
Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
Evaluating Focused Retrieval Tasks (2007)
Pehcevski, Jovan, Thom, James A.
Focused retrieval, identified by question answering, passage retrieval, and XML element retrieval, is becoming increasingly important within the broad task of information retrieval. In this paper, we...
Evaluating Focused Retrieval Tasks (2007)
Pehcevski, Jovan, Thom, James A.
Focused retrieval, identified by question answering, passage retrieval, and XML element retrieval, is becoming increasingly important within the broad task of information retrieval. In this paper, we...
Entity Ranking in Wikipedia (2007)
Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
Entity Ranking in Wikipedia (2007)
Vercoustre, Anne-Marie, Thom, James, Pehcevski, Jovan
The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document...
INEX 2007 Evaluation Measures (2007)
Pehcevski, Jovan, Kamps, Jaap, Kazai, Gabriella, Lalmas, Mounia, Ogilvie, Paul, Piwowarski, Benjamin, ...
This paper describes the official measures of retrieval effectiveness that are planned to be employed for the ad hoc track of INEX 2007.
Evaluating Relevant in Context: Document Retrieval with a Twist (2007)
Kamps, Jaap, Lalmas, Mounia, Pehcevski, Jovan
The Relevant in Context retrieval task is document or article retrieval with a twist, where not only the relevant articles should be retrieved but also the relevant information within each article...
Pehcevski, Jovan, Piwowarski, Benjamin
An evaluation metric is used to evaluate the effectiveness of information retrieval systems and to justify theoretical and/or pragmatical developments of these systems. It consists of a set of...
INEX 2006 Evaluation Measures (2007)
Lalmas, Mounia, Kazai, Gabriella, Kamps, Jaap, Pehcevski, Jovan, Piwowarski, Benjamin, Robertson, Stephen
This paper describes the official measures of retrieval effectiveness employed at the ad hoc track of INEX 2006.
INEX 2007 Evaluation Measures (2007)
Pehcevski, Jovan, Kamps, Jaap, Kazai, Gabriella, Lalmas, Mounia, Ogilvie, Paul, Piwowarski, Benjamin, ...
This paper describes the official measures of retrieval effectiveness that are planned to be employed for the ad hoc track of INEX 2007.
Evaluating Relevant in Context: Document Retrieval with a Twist (2007)
Kamps, Jaap, Lalmas, Mounia, Pehcevski, Jovan
The Relevant in Context retrieval task is document or article retrieval with a twist, where not only the relevant articles should be retrieved but also the relevant information within each article...
Pehcevski, Jovan, Piwowarski, Benjamin
An evaluation metric is used to evaluate the effectiveness of information retrieval systems and to justify theoretical and/or pragmatical developments of these systems. It consists of a set of...
INEX 2006 Evaluation Measures (2007)
Lalmas, Mounia, Kazai, Gabriella, Kamps, Jaap, Pehcevski, Jovan, Piwowarski, Benjamin, Robertson, Stephen
This paper describes the official measures of retrieval effectiveness employed at the ad hoc track of INEX 2006.
Pehcevski, Jovan, Larsen, Birger
Relevance is the extent to which some information is pertinent, connected, or applicable to the matter at hand. It represents a key concept in the fields of documentation, information science, and...
Pehcevski, Jovan, Piwowarski, Benjamin
Specificity is a relevance dimension that describes the extent to which a document part focuses on the topic of request. In the context of semi-structured text (XML) retrieval, a document part...
Pehcevski, Jovan, Larsen, Birger
Relevance is the extent to which some information is pertinent, connected, or applicable to the matter at hand. It represents a key concept in the fields of documentation, information science, and...
Pehcevski, Jovan, Piwowarski, Benjamin
Specificity is a relevance dimension that describes the extent to which a document part focuses on the topic of request. In the context of semi-structured text (XML) retrieval, a document part...
Use of Wikipedia Categories in Entity Ranking (2007)
Thom, James, Pehcevski, Jovan, Vercoustre, Anne-Marie
Wikipedia is a useful source of knowledge that has many applications in language processing and knowledge representation. The Wikipedia category graph can be compared with the class hierarchy in an...
Use of Wikipedia Categories in Entity Ranking (2007)
Thom, James, Pehcevski, Jovan, Vercoustre, Anne-Marie
Wikipedia is a useful source of knowledge that has many applications in language processing and knowledge representation. The Wikipedia category graph can be compared with the class hierarchy in an...
Evaluation of Effective XML Information Retrieval (2007)
XML is being adopted as a common storage format in scientific data repositories, digital libraries, and on the World Wide Web. Accordingly, there is a need for content-oriented XML retrieval systems...
Evaluation of Effective XML Information Retrieval (2007)
XML is being adopted as a common storage format in scientific data repositories, digital libraries, and on the World Wide Web. Accordingly, there is a need for content-oriented XML retrieval systems...
Evaluation of Effective XML Information Retrieval (2007)
XML is being adopted as a common storage format in scientific data repositories, digital libraries, and on the World Wide Web. Accordingly, there is a need for content-oriented XML retrieval systems...
Evaluation of Effective XML Information Retrieval (2007)
XML is being adopted as a common storage format in scientific data repositories, digital libraries, and on the World Wide Web. Accordingly, there is a need for content-oriented XML retrieval systems...
Evaluation of Effective XML Information Retrieval (2007)
XML is being adopted as a common storage format in scientific data repositories, digital libraries, and on the World Wide Web. Accordingly, there is a need for content-oriented XML retrieval systems...
Evaluation of Effective XML Information Retrieval (2007)
XML is being adopted as a common storage format in scientific data repositories, digital libraries, and on the World Wide Web. Accordingly, there is a need for content-oriented XML retrieval systems...
Evaluation of Effective XML Information Retrieval (2007)
XML is being adopted as a common storage format in scientific data repositories, digital libraries, and on the World Wide Web. Accordingly, there is a need for content-oriented XML retrieval systems...
Evaluation of Effective XML Information Retrieval (2007)
XML is being adopted as a common storage format in scientific data repositories, digital libraries, and on the World Wide Web. Accordingly, there is a need for content-oriented XML retrieval systems...
INEX 2006 evaluation measures (2007)
Mounia Lalmas, Gabriella Kazai, Jaap Kamps, Jovan Pehcevski, Stephen Robertson
Abstract. This paper describes the official measures of retrieval effectiveness employed at the ad hoc track of INEX 2006. 1
INEX 2007 evaluation measures (2007)
Jaap Kamps, Jovan Pehcevski, Gabriella Kazai, Mounia Lalmas, Stephen Robertson
Abstract. This paper describes the official measures of retrieval effectiveness that are employed for the Ad Hoc Track at INEX 2007. Whereas in earlier years all, but only, XML elements could be...
Evaluation of Effective XML Information Retrieval (2007)
XML is being adopted as a common storage format in scientific data repositories, digital libraries, and on the World Wide Web. Accordingly, there is a need for content-oriented XML retrieval systems...
Evaluation of Effective XML Information Retrieval (2007)
XML is being adopted as a common storage format in scientific data repositories, digital libraries, and on the World Wide Web. Accordingly, there is a need for content-oriented XML retrieval systems...
Evaluation of Effective XML Information Retrieval (2007)
XML is being adopted as a common storage format in scientific data repositories, digital libraries, and on the World Wide Web. Accordingly, there is a need for content-oriented XML retrieval systems...
This thesis is dedicated to my father, Dimitar Pehcevski, who never stopped believing iii iv
Enhancing Content-And-Structure Information Retrieval using a Native XML Database (2005)
Pehcevski, Jovan, Thom, James A., Vercoustre, Anne-Marie
Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a full-text information retrieval system; second by using eXist, a native XML database, and...
Users and Assessors in the Context of INEX: Are Relevance Dimensions Relevant? (2005)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
The main aspects of XML retrieval are identified by analysing and comparing the following two behaviours: the behaviour of the assessor when judging the relevance of returned document components; and...
Users and Assessors in the Context of INEX: Are Relevance Dimensions Relevant? (2005)
Pehcevski, Jovan, Thom, James A., Vercoustre, Anne-Marie
The main aspects of XML retrieval are identified by analysing and comparing the following two behaviours: the behaviour of the assessor when judging the relevance of returned document components; and...
Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database (2005)
Pehcevski, Jovan, Thom, James A., Vercoustre, Anne-Marie
This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that...
Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database (2005)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that...
Hybrid XML Retrieval Revisited (2005)
Pehcevski, Jovan, Thom, James, Tahaghoghi, S., Vercoustre, Anne-Marie
The widespread adoption of XML necessitates structureaware systems that can effectively retrieve information from XML document collections. This paper reports on the participation of the RMIT group...
Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database (2005)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that...
Users and Assessors in the Context of INEX: Are Relevance Dimensions Relevant? (2005)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
The main aspects of XML retrieval are identified by analysing and comparing the following two behaviours: the behaviour of the assessor when judging the relevance of returned document components; and...
Hybrid XML Retrieval Revisited (2005)
Pehcevski, Jovan, Thom, James, Tahaghoghi, S., Vercoustre, Anne-Marie
The widespread adoption of XML necessitates structureaware systems that can effectively retrieve information from XML document collections. This paper reports on the participation of the RMIT group...
Users and Assessors in the Context of INEX: Are Relevance Dimensions Relevant? (2005)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
The main aspects of XML retrieval are identified by analysing and comparing the following two behaviours: the behaviour of the assessor when judging the relevance of returned document components; and...
Hybrid XML Retrieval Revisited (2005)
Pehcevski, Jovan, Thom, James, Tahaghoghi, S., Vercoustre, Anne-Marie
The widespread adoption of XML necessitates structureaware systems that can effectively retrieve information from XML document collections. This paper reports on the participation of the RMIT group...
Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database (2005)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that...
Users and assessors in the context of INEX: Are relevance dimensions relevant (2005)
The main aspects of XML retrieval are identified by analysing and comparing the following two behaviours: the behaviour of the assessor when judging the relevance of returned document components; and...
HiXEval: Highlighting XML retrieval evaluation (2005)
Jovan Pehcevski, James A. Thom
Abstract. This paper describes our proposal for an evaluation metric for XML retrieval that is solely based on the highlighted text. We support our decision of ignoring the exhaustivity dimension by...
Hybrid XML retrieval: Combining information retrieval and a native XML database (2005)
Jovan Pehcevski, James A. Thom
Abstract. This paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid...
Enhancing Content-And-Structure Information Retrieval using a Native XML Database (2004)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a full-text information retrieval system; second by using eXist, a native XML database, and...
Enhancing Content-And-Structure Information Retrieval using a Native XML Database (2004)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a full-text information retrieval system; second by using eXist, a native XML database, and...
Enhancing Content-And-Structure Information Retrieval using a Native XML Database (2004)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a full-text information retrieval system; second by using eXist, a native XML database, and...
Hybrid XML retrieval revisited (2004)
In this paper, we report on the participation of the RMIT University group in the INEX 2004 ad-hoc track. Our preliminary analysis of CO and VCAS relevance assessments identifies two complementary...
Enhancing Content-And-Structure Information Retrieval using a Native XML Database (2004)
Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a fulltext information retrieval system; second by using eXist, a native XML database, and...
Hybrid XML retrieval revisited (2004)
Jovan Pehcevski, James A. Thom
Abstract. The widespread adoption of XML necessitates structureaware systems that can effectively retrieve information from XML document collections. This paper reports on the participation of the...
RMIT INEX experiments: XML Retrieval using Lucy and eXist (2003)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper reports on the RMIT group's approach to XML retrieval while participating in INEX 2003. We indexed XML documents using Lucy, a compact and fast text search engine designed and written by...
RMIT INEX experiments: XML Retrieval using Lucy and eXist (2003)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper reports on the RMIT group's approach to XML retrieval while participating in INEX 2003. We indexed XML documents using Lucy, a compact and fast text search engine designed and written by...
XML-search Query Language: Needs and Requirements (2003)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper explores the needs of XML-search and makes comparisons between the INEX experience in evaluating XML retrieval and the recently proposed W3C requirements for extending XQuery and XPath to...
XML-search Query Language: Needs and Requirements (2003)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper explores the needs of XML-search and makes comparisons between the INEX experience in evaluating XML retrieval and the recently proposed W3C requirements for extending XQuery and XPath to...
RMIT INEX experiments: XML Retrieval using Lucy and eXist (2003)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper reports on the RMIT group's approach to XML retrieval while participating in INEX 2003. We indexed XML documents using Lucy, a compact and fast text search engine designed and written by...
XML-search Query Language: Needs and Requirements (2003)
Pehcevski, Jovan, Thom, James, Vercoustre, Anne-Marie
This paper explores the needs of XML-search and makes comparisons between the INEX experience in evaluating XML retrieval and the recently proposed W3C requirements for extending XQuery and XPath to...