TY - JOUR
T1 - Pathway-level disease data mining through hyper-box principles
AU - Yang, Lingjian
AU - Ainali, Chrysanthi
AU - Kittas, Aristotelis
AU - Nestle, Frank
AU - Papageorgiou, Lazaros G.
AU - Tsoka, Sophia
PY - 2015/2
Y1 - 2015/2
N2 - In microarray data analysis, traditional methods that focus on single genes are increasingly replaced by methods that analyse functional units corresponding to biochemical pathways, as these are considered to offer more insight into gene expression and disease associations. However, the development of robust pipelines to relate genotypic functional modules to disease phenotypes through known molecular interactions is still at its early stages.In this article we first discuss methodologies that employ groups of genes in disease classification tasks that aim to link gene expression patterns with disease outcome. Then we present a pathway-based approach for disease classification through a mathematical programming model based on hyper-box principles. Association rules derived from the model are extracted and discussed with respect to pathway-specific molecular patterns related to the disease. Overall, we argue that the use of gene sets corresponding to disease-relevant pathways is a promising route to uncover expression-to-phenotype relations in disease classification and we illustrate the potential of hyper-box classification in assessing the predictive power of functional pathways and uncover the effect of specific genes in the prediction of disease phenotypes.
AB - In microarray data analysis, traditional methods that focus on single genes are increasingly replaced by methods that analyse functional units corresponding to biochemical pathways, as these are considered to offer more insight into gene expression and disease associations. However, the development of robust pipelines to relate genotypic functional modules to disease phenotypes through known molecular interactions is still at its early stages.In this article we first discuss methodologies that employ groups of genes in disease classification tasks that aim to link gene expression patterns with disease outcome. Then we present a pathway-based approach for disease classification through a mathematical programming model based on hyper-box principles. Association rules derived from the model are extracted and discussed with respect to pathway-specific molecular patterns related to the disease. Overall, we argue that the use of gene sets corresponding to disease-relevant pathways is a promising route to uncover expression-to-phenotype relations in disease classification and we illustrate the potential of hyper-box classification in assessing the predictive power of functional pathways and uncover the effect of specific genes in the prediction of disease phenotypes.
KW - Disease classification
KW - Hyper-box-representation
KW - Mathematical programming
KW - Mixed integer optimisation
KW - Pathway-based classification
UR - http://www.scopus.com/inward/record.url?scp=84921389859&partnerID=8YFLogxK
U2 - 10.1016/j.mbs.2014.09.005
DO - 10.1016/j.mbs.2014.09.005
M3 - Article
AN - SCOPUS:84921389859
SN - 0025-5564
VL - 260
SP - 25
EP - 34
JO - Mathematical Biosciences
JF - Mathematical Biosciences
ER -