Thursday, September 25, 2008

From Biological Spectra (multiple protein binding data) to pharmacological profiling!

An ideal drug cures a decease and does not kill a patient (or even lab animals in the course of preclinical testing). Usual drug discovery paradigm is based on studying a compound's properties against a specific, normally decease-related (protein) target. The ability of a compound to bind (inhibit) a specific target is called efficacy.

Even if the efficacy is good, another important property of a compound is its toxicity. Toxicity is related to the compound physical properties, such as solubility etc, as well by its ability to bind to and hence inhibit various vital human proteins (and may be even DNA and RNA).

Common sense suggests that an ideal compound binds its specific drug related target and does not bind to anything else. Anything in between is toxic, at least to a some extent. For example, most of important properties utilize ATP molecules, which means that human body contains a lot of ATP-bindig proteins. If you make a drug attacking an ATP-binding site of a "bad" protein, most probably, a lot of "good" and useful proteins will be also affected. In that case your compound should be toxic. This is indeed the case for many cancer drugs attacking ATP-binding sites of kinases.

The latter statement is the foundation of our approach. Although it's quite conceptually simple, it's useless unless it can be supplemented by a meaningful mathematical model. Let us dwell into some more details to see how the whole thing can be made working.

Let us overview important properties of a drug candidate. First there is a bunch of physical properties, such as solubility, differential solubility, LogP (namely the difference between water and lipid solubility) etc. These quantities are easy to measure, are of direct physical meaning and can be pretty easily calculated (with or without QUANTUM software).

Another set of characteristics defines a compound ability to penetrate through cell membranes and its biochemical in liver. These are quantities deturmining bioavailability, half life, volume of distribution etc. None of such quantities can be evaluated using the simple physical properties alone. For example, drug absorbtion depends on the molecule interaction with proteins actively transporting the molecules through the cell membranes.

The bottom line: bioavailability and other quantities require understanding of a compound binding properties to a selected number of proteins participating in a compound transport and metabolism.

So the conclusion is that IF YOU KNOW WHICH PROTEINS ARE IMPORTANT, AND IF YOU CAN CALCULATE HOW YOUR COMPOUND BINDS TO THEM, YOU KNOW THE COMPOUND PHARMACOLOGICAL AND TOXICOLOGICAL PROPERTIES

Now the only problem how to identify those "important" proteins.

Fortunately, there are thousands of molecules with known properties. What we can do is the following:

- take a molecule
- calculate its binding to any human protein with known 3d structure
- use the obtained binding affinities (numbers) as a molecule's binding profile fingerprint (the Biological Spectrum), characterizing the ability of the molecule to interact with the whole human proteome

Now assume we know such Biological Spectra for 1000s molecules with well known properties. This means we can now datamine the fingerprints->known properites relations. The basic premise is, of course, that the molecules with similar fingerprints have similar properties.

We have a number of proofs of such technology working. The most recent one is the prediction of active transport drug absorption properties for drug like molecues based on the binding data against human brain hexokinase type I-related protein. We prove that the binding energy of a compound against the protein may serve to distinguish between the passively and actively transported molecules and even help to calculated the drug absorbtion quantitatevely.

No comments: