Supporting user interaction for the exploratory mining of constrained frequent sets

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Supporting user interaction for the exploratory mining of constrained frequent sets Mah, Teresa

Abstract

Data mining is known to some as "knowledge discovery from large databases". It is the technology of taking large quantities of data, and searching through the data looking for previously unknown patterns of information. One can see how this can be useful to entrepreneurs and researchers of all kinds. Retailers can apply data mining to find customer shopping patterns. On a grander scale, meteorologists can use the technology to identify telltale signs of extreme weather conditions, such as tornadoes or hurricanes. Unfortunately, albeit so useful, data mining has not yet broken out of its shell. There are two main reasons for this. The first reason is that the mining process is still slow, even with all the research done to optimize the algorithms. The second reason is that there has not been much work done on improving the user interaction aspect of the technology. Most of the systems created so far have resembled a black box. Input is entered in at one end of the black box, and output is received at the other end. There is no concept of human-centred exploration or control of the process, and no mechanism to specify focus in the database. The work described here provides a glimpse of a new exploratory mining framework that encourages exploration and control. In addition, this new framework is incorporated into the first fully functional prototype capable of constrained frequent set mining. A user of the prototype can specify focus by providing constraints on data to be mined, and can view frequent sets satisfying these constraints before relationships are found. The prototype also allows users to sort or format frequent set output, and to choose only interesting sets to find relationships on. Furthermore, frequent sets in our system can be mined between sets with different or similar domains, and users can choose other notions of relationship besides confidence. Combining this new exploratory mining paradigm with the faster, more efficient C A P algorithm, we have what we believe is the first in a new generation of fast and human-centred data mining systems.

Item Metadata

Title	Supporting user interaction for the exploratory mining of constrained frequent sets
Creator	Mah, Teresa
Publisher	University of British Columbia
Date Issued	1999
Description	Data mining is known to some as "knowledge discovery from large databases". It is the technology of taking large quantities of data, and searching through the data looking for previously unknown patterns of information. One can see how this can be useful to entrepreneurs and researchers of all kinds. Retailers can apply data mining to find customer shopping patterns. On a grander scale, meteorologists can use the technology to identify telltale signs of extreme weather conditions, such as tornadoes or hurricanes. Unfortunately, albeit so useful, data mining has not yet broken out of its shell. There are two main reasons for this. The first reason is that the mining process is still slow, even with all the research done to optimize the algorithms. The second reason is that there has not been much work done on improving the user interaction aspect of the technology. Most of the systems created so far have resembled a black box. Input is entered in at one end of the black box, and output is received at the other end. There is no concept of human-centred exploration or control of the process, and no mechanism to specify focus in the database. The work described here provides a glimpse of a new exploratory mining framework that encourages exploration and control. In addition, this new framework is incorporated into the first fully functional prototype capable of constrained frequent set mining. A user of the prototype can specify focus by providing constraints on data to be mined, and can view frequent sets satisfying these constraints before relationships are found. The prototype also allows users to sort or format frequent set output, and to choose only interesting sets to find relationships on. Furthermore, frequent sets in our system can be mined between sets with different or similar domains, and users can choose other notions of relationship besides confidence. Combining this new exploratory mining paradigm with the faster, more efficient C A P algorithm, we have what we believe is the first in a new generation of fast and human-centred data mining systems.
Extent	5373986 bytes
Genre	Thesis/Dissertation
Type	Text
File Format	application/pdf
Language	eng
Date Available	2009-06-26
Provider	Vancouver : University of British Columbia Library
Rights	For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
DOI	10.14288/1.0051619
URI	http://hdl.handle.net/2429/9722
Degree (Theses)	Master of Science - MSc
Program (Theses)	Computer Science
Affiliation	Science, Faculty of; Computer Science, Department of
Degree Grantor	University of British Columbia
Graduation Date	1999-11
Campus	UBCV
Scholarly Level	Graduate
Aggregated Source Repository	DSpace

Item Media

ubc_1999-0557.pdf -- 5.13MB

Item Citations and Data

Rights

For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.

Open Collections

UBC Theses and Dissertations

Supporting user interaction for the exploratory mining of constrained frequent sets Mah, Teresa

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights