You are not currently logged in.
Access your personal account or get JSTOR access through your library or other institution:
A Difficulty Information Approach to Substituent Selection in QSAR Studies
David M. Borth, Robert J. McKay and J. Richard Elliott
Vol. 27, No. 1 (Feb., 1985), pp. 25-35
Published by: Taylor & Francis, Ltd. on behalf of American Statistical Association and American Society for Quality
Stable URL: http://www.jstor.org/stable/1270466
Page Count: 11
Preview not available
In the development of quantitative structure-activity relationships (QSAR), a small subset of chemical compounds must be chosen for synthesis from a much larger population of potentially bioactive molecules. The ultimate goal of the QSAR study is to determine, at the lowest cost, the most biologically active member of the population. Hence it is important that the sample be optimally selected for both predictive ability and ease of synthesis. This article describes a method, based on information theory, that simultaneously incorporates these concerns into the substituent selection process. This procedure is essentially a generalization of previous algorithms for obtaining a D-optimal design by examining prediction variances (Mitchell 1974). Results from applying this difficulty-information approach suggest that the method is capable of achieving large decreases in total synthesis difficulty at the expense of only a moderate decrease in predictive ability.
Technometrics © 1985 American Statistical Association