A key step in the development of new pharmaceutical drugs is that of identifying direct targets of the bioactive compounds, and distinguishing these from all other gene products that respond indirectly to the drug targets. Currently dominating approaches to this problem are based on often time consuming and costly experimental methods aimed at locating physical bindings of the corresponding small molecule to proteins or DNA sequences. In this paper we consider target identification based on time-series expression data of the corresponding gene regulatory network, using perturbation with the active compound only. As we show, the problem of identifying the direct targets can then be cast as a linear regression problem and, in principle, be accomplished with a number of samples equal to the number of involved genes and bioactive compounds. However, the regression matrix will typically be highly ill-conditioned and the target identification therefore prone even to small measurement uncertainties. In order to provide a label of confidence for the target identification, we consider conditions that can be used to quantify the robustness of the identification of individual drug targets with respect to uncertainty in the expression data. For this purpose, we cast the uncertain regression problem as a robust rank problem and employ SVD or the structured singular value to compute the robust rank. The proposed method is illustrated by application to a small scale gene regulatory network synthesised in yeast to serve as a benchmark problem in network inference.
All Science Journal Classification (ASJC) codes
- Control and Systems Engineering