Cluster-based information retrieval modeling

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Cluster-based information retrieval modeling Sze, Richard

Abstract

Cluster-based information retrieval, an extension of information retrieval strategy, is based on the assumption that a document collection can be organized into a set of topics so that a user can enhance retrieval effectiveness. The cluster-based IR model assumes that queries can be associated with clusters that contain high concentrations of relevant documents, and that such association can lead to gains in retrieval effectiveness. Earlier studies, however, have provided negative to mixed results for the performance of the model. Moreover, studies are lacking which investigate the potential of the model in situations where queries are manually associated with the appropriate clusters. The goal of this thesis is to provide evidence for the validity of the cluster-base IR model's effectiveness through conducting extensive empirical studies which explore alternative schemes of the model on a large scale and according to a well-accepted benchmark. Investigation shows that the cluster-based IR model has the potential to enhance retrieval effectiveness, and yet, alternative techniques fail to actually achieve enhanced effectiveness.

Item Metadata

Title	Cluster-based information retrieval modeling
Creator	Sze, Richard
Publisher	University of British Columbia
Date Issued	2004
Description	Cluster-based information retrieval, an extension of information retrieval strategy, is based on the assumption that a document collection can be organized into a set of topics so that a user can enhance retrieval effectiveness. The cluster-based IR model assumes that queries can be associated with clusters that contain high concentrations of relevant documents, and that such association can lead to gains in retrieval effectiveness. Earlier studies, however, have provided negative to mixed results for the performance of the model. Moreover, studies are lacking which investigate the potential of the model in situations where queries are manually associated with the appropriate clusters. The goal of this thesis is to provide evidence for the validity of the cluster-base IR model's effectiveness through conducting extensive empirical studies which explore alternative schemes of the model on a large scale and according to a well-accepted benchmark. Investigation shows that the cluster-based IR model has the potential to enhance retrieval effectiveness, and yet, alternative techniques fail to actually achieve enhanced effectiveness.
Extent	3855745 bytes
Genre	Thesis/Dissertation
Type	Text
File Format	application/pdf
Language	eng
Date Available	2009-11-21
Provider	Vancouver : University of British Columbia Library
Rights	For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
DOI	10.14288/1.0091246
URI	http://hdl.handle.net/2429/15435
Degree	Master of Science in Business - MScB
Program	Business Administration
Affiliation	Business, Sauder School of
Degree Grantor	University of British Columbia
Graduation Date	2004-05
Campus	UBCV
Scholarly Level	Graduate
Aggregated Source Repository	DSpace

Item Media

ubc_2004-0308.pdf -- 3.68MB

Item Citations and Data

Rights

For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.

Open Collections

UBC Theses and Dissertations

Cluster-based information retrieval modeling Sze, Richard

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights