Drug Discovery Division
Material Design Division
Chem/Bioinformatics   (CBI)
drug discovery divisionHome>R&D> Chem/Bioinformatics (CBI)
Chemoinformatics and bioinformatics are relatively new terms, but they today have an innovative and powerful influence on various important subjects such as drug discovery or elucidating of important biological functions of enzymes. They also have a great deal to do with the current and potential contributions of computer science to human health. Especially chemoinformatics has an increasingly important role in drug discovery and drug design. In terms of this trend, the CBI team has been developing chemical software for drug discovery and understanding the relationships between chemical structure and biological activity. In 2004, our team released onto the market a commercial software package called "PreADME", intended to provide the most outstanding and practical guidance for drug discovery. This PreADME had great meaning as the first drug discovery software package released for commercial purpose in Korea, with an attempt to provide computational (or rational) solution to the early drug discovery. However, the missions of our CBI in Research Institute of Bioinformatics & Molecular Design (BMD) is like the following

1. Physico-chemical properties prediction of diverse chemicals
About 2,500 2D, 3D molecular descriptors, which represent various physico-chemical properties of compounds such as boiling point, water solubility, etc., are calculated in our product, PreADME. There are constitutional, topological, physico-chemical and geometrical descriptors that are useful to the prediction of ADME/Tox. properties of drug candidates, for example the number of H- bond acceptors, The number of rotatable bonds, calculated logP, topological polar surface area, molecular connectivity indices, and so on. In addition, we are very interested in implementing and integrating statistical computation, artificial intelligence like neural network and innovative computational algorithm into our research purpose, especially drug discovery. As a result of our endeavors and research, we have built robust and reliable predictive models of biological activities such as predicting the ADME/Tox. properties of drug candidates.

2. In Silico High Throughput ADME/Tox. Screening

Recently, the emphasis in the drug discovery of many pharmaceutical companies has been extended from finding bioactive molecules by the traditional QSAR to predicting the ADME/Tox. (absorption, distribution, metabolism, excretion, and toxicity) of drug candidates in advance, because a number of new drugs in the drug market have been failed due to poor pharmacokinetic properties. The important goal of the CBI team is constructing predictive models of ADME/Tox. and druglike properties of chemicals, finally intended for the rational drug design. Actually, we developed powerful predictive models using artificial neural network (with Resilient back-propagation (Rprop) method) in order to evaluate molecules as potential drug candidates, especially by predicting their permeability in Caco-2 and MDCK cells for oral bioavailability and by predicting their penetration through BBB (blood-brain barrier) and plasma protein binding for drug distribution. Furthermore, we are interested in the prediction of physico-chemical properties of chemicals that have a close connection to the bioavailability of therapeutic agents. In fact, we had successfully made computer programs to calculate molecular descriptors that are closely related to the ADME/Tox. properties. In summary, our ultimate goals are like the following:
Physicochemical property prediction: logP, logD, Polar surface area (PSA), pKa, DMSO solubility, buffer stability Absorption prediction: Caco-2 cell permeability, MDCK (Madin-Darby canine kindey) cell permeability, Human intestinal absorption (HIA), Skin permeability, MDR-MDCK2 p-gp screening
Distribution prediction: Blood-brain barrier (BBB) permeability, Plasma protein binding, Plasma stability, p-gp transporters
Metabolism prediction: Structure lability, Cytochrome P450s(3A4, 2D6, 2C9, 2C19, 1A2) induction, inhibition and specificity
Excretion prediction: Hepatic clearance, urinary and bile excretion
Pharmacokinetics prediction: Bioavailability (Blood level), volume of distribution, half life time, dose response, microsomal statbility
Toxicity prediction: Mutagenicity (Ames test), carcinogenicity (rodent bioassay), Rat and mouse oral LD50, Chronic LOAEL, hERG inhibition, skin sensitization, skin and eye irritation, hepatoxicity
Drug-likeness prediction: Lipinski¡¯s rule (Rule of Five), Drug-like rule based on CMC, MDDR, and
WDI, Leadlike rule

3. Chemical structure and property databases for drug design

We have constructed chemical databases including about 2,000,000 compounds and developed
various search engines for the drug discovery. Our chemical databases contains 2D/3D chemical
structures and their 2D/3D descriptors, for example, H-bond acceptor and donor, rotatable bond,
polar surface area, calculated logP, molecular connectivity, etc. for example. In our search engine,
not only simple search with chemical names, molecular weight and molecular formula but the
similarity search by 2D or 3D structures and 2D exact structure/substructure search are
prepared also.
Conceptual design of organic chemical database Over 2 million 2D, 3D chemical structures
    and their properties or information like common name
Exact, substructure, similarity searching in chemical database
2D fingerprint representation of molecular structure and properties for quick chemical searching
Design of viewer module (Bioinformatical support for drug-target discovery)

4. Chemical library design and analysis
Combinatorial chemistry and high-throughput screening (HTS) are now an established part of
the drug discovery process, but they are somewhat based on the probability of finding a hit in
screening experiments (using universal or general libraries spanning many chemical classes)
according to the number and variety of molecules screened. Because of the cost to synthesize
and screening an enormous number of chemicals, researchers are attempting to enhance the
information content of screening libraries, for example designing focused chemical libraries for
a particular purpose. We have developed powerful software to quickly calculate above 1,000
molecular descriptors of many molecules simultaneously, in order to design virtual combinatorial
libraries that includes enough molecular diversity. Also, implementing data minding technology
such as inferential statistics like PCA and artificial intelligence like genetic algorithm is of our
important interest. On the basis of such methods, we can integrate our software into the
development of lead drug candidates through chemical space navigation, or molecular diversity
analysis to optimally select molecule sets whose structures cover chemical space or span a
particular chemical property without the redundancy of the chemicals themselves.
Our main goals are like the below:
Molecular descriptors for Combi. Chem. and HTS library design: Fast topological descriptors :
   Kier & Hall descriptors, information contents, E-state keys and diverse fingerprint keys,
   BCUT values
2D, 3D fingerprint key design: 2D similarity, classification of atomic types, pharmacophore
   definition
Library selection rule: Considering molecular diversity, ADME/Tox, reagents cost, deconvolution
Chemical similarity: Tanimodo coefficient and distance using BCUT and fingerprint descriptor

5. In silico prediction of environmental toxicity

Recently, many numbers of chemicals are synthesized and released to the environment without
sufficient consideration of their effect to the environment. Predicting chemical toxicity and fate in
the environment is one of most interest subject in many industrial, government, and NGO. Firstly,
we are developing a prediction model for acute fish toxicity based on our developed descriptors
and machine learning methods. And we expand our prediction model to vary endpoints in the
environmental toxicity. Our final goal is prediction of the environmental effects of chemicals at
different levels of biological organization and chemicals fate in the environment.