Mining significant high utility gene regulation sequential patterns

BACKGROUND: Mining frequent gene regulation sequential patterns in time course microarray datasets is an important mining task in bioinformatics. Although finding such patterns are of paramount important for studying a disease, most existing work do not consider gene-disease association during gene regulation sequential pattern discovery. Moreover, they consider more absent/existence effects of genes during the mining process than taking the degrees of genes expression into account. Consequently, such techniques discover too many patterns which may not represent important information to biologists to investigate the relationships between the disease and underlying reasons hidden in gene regulation sequences.

RESULTS: We propose a utility model by considering both the gene-disease association score and their degrees of expression levels under a biological investigation. We propose an efficient method called Top-HUGS, for discoverying significant high utility gene regulation sequential patterns from a time-course microarray dataset.

CONCLUSIONS: In this study, the proposed methods were evaluated on a publicly available time course microarray dataset. The experimental results show higher accuracies compared to the baseline methods. Our proposed methods found that several new gene regulation sequential patterns involved in such patterns were useful for biologists and provided further insights into the mechanisms underpinning biological processes. To effectively work with the proposed method, a web interface is developed to our system using Java. To the best of our knowledge, this is the first demonstration for significant high utility gene regulation sequential pattern discovery.

Medienart:

E-Artikel

Erscheinungsjahr:

2017

Erschienen:

2017

Enthalten in:

Zur Gesamtaufnahme - volume:11

Enthalten in:

BMC systems biology - 11(2017), Suppl 6 vom: 14. Dez., Seite 109

Sprache:

Englisch

Beteiligte Personen:

Zihayat, Morteza [VerfasserIn]
Davoudi, Heidar [VerfasserIn]
An, Aijun [VerfasserIn]

Links:

Volltext

Themen:

Gene regulation sequential patterns
High utility pattern mining
Journal Article
Research Support, Non-U.S. Gov't
Time-course microarray datasets

Anmerkungen:

Date Completed 28.02.2019

Date Revised 28.02.2019

published: Electronic

Citation Status MEDLINE

doi:

10.1186/s12918-017-0475-4

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM279605463