The discovery of Top-K DNA frequent patterns with approximate method / Nittaya Kerdprasop and Kittisak Kerdprasop
Top-k frequent pattern discovery is indeed an association analysis concerning automatic extraction of the k most correlated and interesting patterns from large databases. Current studies in association mining concentrate on how to effectively find all objects that are frequently co-occurring. Given...
Saved in:
Main Authors: | , |
---|---|
Format: | Book |
Published: |
Faculty of Computer and Mathematical Sciences,
2014.
|
Subjects: | |
Online Access: | Link Metadata |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
MARC
LEADER | 00000 am a22000003u 4500 | ||
---|---|---|---|
001 | repouitm_12418 | ||
042 | |a dc | ||
100 | 1 | 0 | |a Kerdprasop, Nittaya |e author |
700 | 1 | 0 | |a Kerdprasop, Kittisak |e author |
245 | 0 | 0 | |a The discovery of Top-K DNA frequent patterns with approximate method / Nittaya Kerdprasop and Kittisak Kerdprasop |
260 | |b Faculty of Computer and Mathematical Sciences, |c 2014. | ||
500 | |a https://ir.uitm.edu.my/id/eprint/12418/1/12418.pdf | ||
520 | |a Top-k frequent pattern discovery is indeed an association analysis concerning automatic extraction of the k most correlated and interesting patterns from large databases. Current studies in association mining concentrate on how to effectively find all objects that are frequently co-occurring. Given a set of objects with m features, there are almost 2m frequent patterns to consider. For DNA data that are normally very high in dimensionality, frequent pattern discovery from genetic data is obviously a computationally expensive problem. We therefore devise an approximate approach to tackle this problem. We propose an approximate method based on the window sliding concept to estimate data density and obtain data characteristics from a small set of samples. Then we draw a set of representatives with reservoir sampling technique. These representatives are subsequently used in the main process of frequent pattern mining. Our designed algorithm had been implemented with the Erlang language, which is the functional programming paradigm with inherent support for pattern matching. The experimental results confirm the efficiency and reliability of our approximate method. | ||
546 | |a en | ||
655 | 7 | |a Article |2 local | |
655 | 7 | |a PeerReviewed |2 local | |
787 | 0 | |n https://ir.uitm.edu.my/id/eprint/12418/ | |
787 | 0 | |n https://mjoc.uitm.edu.my/ | |
856 | 4 | 1 | |u https://ir.uitm.edu.my/id/eprint/12418/ |z Link Metadata |