Online citations, reference lists, and bibliographies.
Please confirm you are human
(Sign Up for free to never see this)
← Back to Search

About The Authors

I. Shmulevich
Published 2012 · Medicine

Save to my Library
Download PDF
Analyze on Scholarcy
Mining large datasets using machine learning approaches often leads to models that are hard to interpret and not amenable to the generation of hypotheses that can be experimentally tested. Finding ‘actionable knowledge’ is becoming more important, but also more challenging as datasets grow in size and complexity. We present ‘Logic Optimization for Binary Input to Continuous Output’ (LOBICO), a computational approach that infers small and easily interpretable logic models of binary input features that explain a binarized continuous output variable. Although the continuous output variable is binarized prior to optimization, the continuous information is retained to find the optimal logic model. Applying LOBICO to a large cancer cell line panel, we find that logic combinations of multiple mutations are more predictive of drug response than single gene predictors. Importantly, we show that the use of the continuous information leads to robust and more accurate logic models. LOBICO is formulated as an integer programming problem, which enables rapid computation on large datasets. Moreover, LOBICO implements the ability to uncover logic . CC-BY-NC-ND 4.0 International license peer-reviewed) is the author/funder. It is made available under a The copyright holder for this preprint (which was not . doi: bioRxiv preprint first posted online Jan. 15, 2016;
This paper references
The influence of variables on Boolean functions
J. Kahn (1988)
A Method of Producing a Boolean Function Having an Arbitrarily Prescribed Prime Implicant Table
James F. Gimpel (1965)
Markov Logic Networks in the Analysis of Genetic Data
Nikita A. Sakhanenko (2010)
PI3K pathway alterations in cancer: variations on a theme
T. Yuan (2008)
Cellular mechanisms of tumour suppression by the retinoblastoma gene
D. Burkhart (2008)
Regularization and variable selection via the elastic net
H. Zou (2005)
Systematic identification of genomic markers of drug sensitivity in cancer cells
M. Garnett (2012)
Inferring combinatorial association logic networks in multimodal genomewide
J De Ridder (2010)
Activities and sensitivities in boolean network models.
I. Shmulevich (2004)
An Implementation of Logical Analysis of Data
E. Boros (2000)
Detecting high-order interactions of single nucleotide polymorphisms using genetic programming
Robin Nunkesser (2007)
A continuous approach to inductive inference
A. Kamath (1992)
Sparse combinatorial inference with an application in cancer biology
S. Mukherjee (2009)
Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells
Wanjuan Yang (2013)
Collections of simultaneously altered genes as biomarkers of cancer cell drug response.
D. Masica (2013)
Inferring combinatorial association logic networks in multimodal genome-wide screens
J. Ridder (2010)
Markov logic networks
M. Richardson (2006)
Logic Regression
I. Ruczinski (2003)
Identifying interacting SNPs using Monte Carlo logic regression
C. Kooperberg (2005)
Genetic Interactions Between Transcription Factors Cause Natural Variation in Yeast
J. Gerke (2009)
Cumulative probability of false-positive recall or biopsy recommendation after 10 years of screening mammography: A cohort study
R. Hubbard (2012)
An Implementation of Logical Analysis of Data
BorosEndre (2000)
Individual and Combined Effects of Age, Breast Density, and Hormone Replacement Therapy Use on the Accuracy of Screening Mammography
P. Carney (2003)

Semantic Scholar Logo Some data provided by SemanticScholar