TY - JOUR
T1 - GeneSPIDER-gene regulatory network inference benchmarking with controlled network and data properties
AU - Tjärnberg, Andreas
AU - Morgan, Daniel C.
AU - Studham, Matthew
AU - Nordling, Torbjörn E.M.
AU - Sonnhammer, Erik L.L.
N1 - Publisher Copyright:
© The Royal Society of Chemistry.
PY - 2017
Y1 - 2017
N2 - A key question in network inference, that has not been properly answered, is what accuracy can be expected for a given biological dataset and inference method. We present GeneSPIDER-a Matlab package for tuning, running, and evaluating inference algorithms that allows independent control of network and data properties to enable data-driven benchmarking. GeneSPIDER is uniquely suited to address this question by first extracting salient properties from the experimental data and then generating simulated networks and data that closely match these properties. It enables data-driven algorithm selection, estimation of inference accuracy from biological data, and a more multifaceted benchmarking. Included are generic pipelines for the design of perturbation experiments, bootstrapping, analysis of linear dependence, sample selection, scaling of SNR, and performance evaluation. With GeneSPIDER we aim to move the goal of network inference benchmarks from simple performance measurement to a deeper understanding of how the accuracy of an algorithm is determined by different combinations of network and data properties.
AB - A key question in network inference, that has not been properly answered, is what accuracy can be expected for a given biological dataset and inference method. We present GeneSPIDER-a Matlab package for tuning, running, and evaluating inference algorithms that allows independent control of network and data properties to enable data-driven benchmarking. GeneSPIDER is uniquely suited to address this question by first extracting salient properties from the experimental data and then generating simulated networks and data that closely match these properties. It enables data-driven algorithm selection, estimation of inference accuracy from biological data, and a more multifaceted benchmarking. Included are generic pipelines for the design of perturbation experiments, bootstrapping, analysis of linear dependence, sample selection, scaling of SNR, and performance evaluation. With GeneSPIDER we aim to move the goal of network inference benchmarks from simple performance measurement to a deeper understanding of how the accuracy of an algorithm is determined by different combinations of network and data properties.
UR - http://www.scopus.com/inward/record.url?scp=85021632676&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85021632676&partnerID=8YFLogxK
U2 - 10.1039/c7mb00058h
DO - 10.1039/c7mb00058h
M3 - Article
AN - SCOPUS:85021632676
SN - 1742-206X
VL - 13
SP - 1304
EP - 1312
JO - Molecular BioSystems
JF - Molecular BioSystems
IS - 7
ER -