- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- BIRS Workshop Lecture Videos /
- Investigation of Code Tables to compress and describe...
Open Collections
BIRS Workshop Lecture Videos
BIRS Workshop Lecture Videos
Investigation of Code Tables to compress and describe the underlying characteristics of binary databases Hess, Sibylle
Description
We inspect the spectrum of methods (from frequent pattern mining to numerical optimization) to extract the pattern set that describes a binary database best. Invoking the Minimum Description Length (MDL) principle, this objective can be stated as: find the code table that compresses the database most. A particularly interesting interpretation of this task, relating it to biclustering, arises from the formulation as a matrix factorisation problem. Biclustering has a variety of applications in research fields such as collaborative filtering, gene expression analysis and text mining. The derived matrix factorisation analogy provides a new perspective on distinct data mining subfields (unifying biclustering and pattern mining concepts such as Krimp), initialising a cross-over of their applications and interpretations of derived models.
Item Metadata
Title |
Investigation of Code Tables to compress and describe the underlying characteristics of binary databases
|
Creator | |
Publisher |
Banff International Research Station for Mathematical Innovation and Discovery
|
Date Issued |
2015-07-25T14:48
|
Description |
We inspect the spectrum of methods (from frequent pattern mining to numerical optimization) to extract the pattern set that describes a binary database best. Invoking the Minimum Description Length (MDL) principle, this objective can be stated as: find the code table that compresses the database most. A particularly interesting interpretation of this task, relating it to biclustering, arises from the formulation as a matrix factorisation problem. Biclustering has a variety of applications in research fields such as collaborative filtering, gene expression analysis and text mining. The derived matrix factorisation analogy provides a new perspective on distinct data mining subfields (unifying biclustering and pattern mining concepts such as Krimp), initialising a cross-over of their applications and interpretations of derived models.
|
Extent |
26 minutes
|
Subject | |
Type | |
File Format |
video/mp4
|
Language |
eng
|
Notes |
Author affiliation: TU Dortmund
|
Series | |
Date Available |
2016-03-08
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
Attribution-NonCommercial-NoDerivatives 4.0 International
|
DOI |
10.14288/1.0227981
|
URI | |
Affiliation | |
Peer Review Status |
Unreviewed
|
Scholarly Level |
Graduate
|
Rights URI | |
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International