TECHNICAL FIELD
[0001] The present invention generally relates to a drug-metabolizing enzyme prediction
apparatus, a drug-metabolizing enzyme prediction method, a drug-metabolizing enzyme
prediction program, and a recording medium, and more particularly, to a drug-metabolizing
enzyme prediction apparatus, a drug-metabolizing enzyme prediction method, a drug-metabolizing
enzyme prediction program, and a recording medium, capable of accurately predicting
a molecular species of a drug-metabolizing enzyme and a reactive site of a compound
with a drug-metabolizing enzyme.
BACKGROUND ART
[0002] In recent years, about 50% or more of drugs developed are dropped out caused by a
problem on drug metabolism (for example, drug's power and side effect) in the stage
of drug development.
[0003] Therefore, it is greatly needed to evaluate the problem on drug metabolism related
to compounds and to narrow them down to any compound that clears the problem in the
early stage of drug development, in terms of development of drugs with fewer side
effects and of reduction in cost required for development of drugs.
[0004] Conventionally, to specify, for example, a molecular species and a reactive site
of a drug-metabolizing enzyme from a compound, a drug-metabolism screening test and
metabolite identification using a mass spectrum are conducted.
[0005] A technology for predicting a molecular species and a reactive site of a drug-metabolizing
enzyme on a computer is described in, for example, Nonpatent literature 1, Patent
document 1, Patent document 2, and Patent document 3.
[0006] Nonpatent literature 1 describes a technology for inputting a compound structure
and predicting an unknown metabolite on the analogy of known metabolic reactions collected
from documents. More specifically, it describes a technology of implementing an experimental
technique of extracting a structure of a reactive site between a substrate and a metabolite
from known metabolic reactions, and predicting a metabolite based on known metabolism
information if the structure of the reactive site extracted is found in a compound
to be predicted. This technology allows accurate prediction of a wide range of compounds
based on a population (compounds) of training data used for creating a prediction
model.
[0007] Furthermore, Patent document 1 describes a technology for predicting the relative
rate of every metabolic pathway for drugs by regarding a pathway with the fastest
reaction rate as a reaction that is the most possible reaction to occur, based on
stoichiometric data.
[0008] Patent document 2 describes a technology for predicting the susceptibility of every
metabolic site of drugs by regarding a site with the highest susceptibility as a first
candidate for a metabolic site, based on energy theoretical data.
[0010] However, all identifications of metabolites using the drug-metabolism screening test
and the mass spectrum are performed based on wet experiments, and this causes a problem
to arise such that cost and time are required enormously.
[0011] In the technology described in Nonpatent literature 1, because a prediction rate
is dependent on the quality of training data used, there is also a problem that it
is difficult to predict compounds which are extremely different from the population
(compounds) of the training data used for creation of the prediction model. In other
words, there is a problem that the range of predictable compounds is limited by the
population, and prediction cannot always be applied to all types of compounds. That
is, all types of compounds cannot always be accurately predicted.
[0012] In the technologies described in Patent document 1, Patent document 2, and Patent
document 3, because a cubic form of the compound is not calculated, there is a problem
that any compound, of which high reactivity is predicted based on energy calculation
but which cannot be bound to a drug-metabolizing enzyme in terms of its form, may
also be included in the compounds accordingly.
[0013] The present invention has been achieved to solve the conventional problems, and it
is an object of the present invention to provide a drug-metabolizing enzyme prediction
apparatus, a drug-metabolizing enzyme prediction method, a drug-metabolizing enzyme
prediction program, and a recording medium capable of accurately predicting a molecular
species of a drug-metabolizing enzyme and a reactive site of a compound with a drug-metabolizing
enzyme for all types of compounds without using training data for prediction and in
consideration of each cubic form of the compounds.
DISCLOSURE OF INVENTION,
[0014] The present invention proposes to predict a molecular species of a drug-metabolizing
enzyme and a reactive site of a compound with the drug-metabolizing enzyme.
[0015] The inventors of the present invention have found, as a result of keen examination,
that there are four sites where drug molecules are fixed, in a drug-metabolizing enzyme.
And the inventors named these four sites "binding site (or inducing site)", "reactive
site", "pinching point", and "open space site". Fig. 25 is a schematic of the binding
site, the reactive site, the pinching point, and the open space site in a compound.
[0016] In Fig. 25, at first, the binding site is a site where a compound and a drug-metabolizing
enzyme are bound to each other. The reactive site is a site where a metabolic reaction
occurs in the compound and in the drug-metabolizing enzyme. The pinching point is
an atom binding at least between the reactive site and the binding site. The open
space site is a site (a group of atoms) other than the binding site and the reactive
site which are bound to the pinching point.
[0017] In other words, the present invention is configured to predict a molecular species
and a reactive site of a drug-metabolizing enzyme by respectively estimating the four
sites.
[0018] To solve the above problems and to achieve the object, a drug-metabolizing enzyme
prediction apparatus according to one asepect of the present invention includes a
compound-structure-information acquiring unit that acquires compound structure information
including at least one of atomic coordinate information that is information on coordinates
of each of atoms forming a compound, bond information that is information on a bond
between the atoms, and ring structure information that is information on a ring structure
formed with a plurality of the atoms; a binding-site/molecular-species-information
identifying unit that identifies binding site information that is information on an
atom forming a binding site in the compound corresponding to the compound structure
information and molecular species information that is information on a molecular species,
from the compound structure information acquired by the compound-structure-information
acquiring unit, based on a binding-site identifying condition for identifying the
binding site that is a site where the compound and a drug-metabolizing enzyme are
bound to each other and the molecular species of the drug-metabolizing enzyme to be
bound; a pinching-point/reactive-site-information acquiring unit that specifies a
pinching point that is the atom binding at least between the binding site and a reactive
site that is a site where a metabolic reaction occurs in the compound and the drug-metabolizing
enzyme, from the binding site information identified by the binding-site/molecular-species-information
identifying unit and the compound structure information, and that acquires pinching
point information that is information on the atom of the specified pinching point
and the reactive site information that is information on the atom forming the reactive
site bound to the pinching point corresponding to the pinching point information;
a reactive-site-information identifying unit that identifies the reactive site information
acquired by the pinching-point/reactive-site-information acquiring unit, from the
molecular species information and the compound structure information, based on a reactive-site
identifying condition for identifying the reactive site; and a pinching-point-information
identifying unit that identifies the pinching point acquired by the pinching-point/reactive-site-information
acquiring unit, from the molecular species information and the compound structure
information, based on a pinching-point identifying condition for identifying the pinching
point.
[0019] In the drug-metabolizing enzyme prediction apparatus according to the present invention,
the binding-site/molecular-species-information identifying unit includes a binding-site
selecting unit that selects the binding site based on a predetermined binding-site
selection criterion; a binding-site-information acquiring unit that acquires the binding
site information for the binding site selected by the binding-site selecting unit,
from the compound structure information; and an acquired-binding-site/molecular-species-information
identifying unit that identifies the binding site information acquired by the binding-site-information
acquiring unit, from the compound structure information, based on the binding-site
identifying condition, and identifies the molecular species corresponding to the identified
binding site information as the molecular species information.
[0020] In the drug-metabolizing enzyme prediction apparatus according to the present invention,
the binding-site identifying condition is a condition in which a binding-site coordinate
range that is a range of the atomic coordinate information for the atom forming the
binding site is defined for each predetermined molecular species. The acquired-binding-site/molecular-species-information
identifying unit includes a coordinate-based acquired-binding-site/molecular-species-information
identifying unit that identifies the binding site information when the atomic coordinate
information for the atom that forms the binding site corresponding to the binding
site information acquired by the binding-site-information acquiring unit satisfies
the binding-site coordinate range defined by the binding-site identifying condition,
and identifies the molecular species corresponding to the satisfied binding-site coordinate
range as the molecular species information.
[0021] In the drug-metabolizing enzyme prediction apparatus according to the present invention,
the pinching-point/reactive-site-information acquiring unit includes an atom specifying
unit that specifies a directly bonded atom that is the atom bonded to the atom forming
the binding site corresponding to the binding site information identified by the binding-site/molecular-species-information
identifying unit and also specifies the atom bonded to the directly bonded atom, based
on the bond information included in the compound structure information; a ring-structure
determining unit that determines whether the atom specified by the atom specifying
unit is the atom forming the ring structure, based on the ring structure information
included in the compound structure information; a pinching-point/reactive-site specifying
unit that specifies the atom specified and another atom forming the ring structure
together with the atom, as the atoms forming the reactive site, when it is determined
by the ring-structure determining unit that the atom specified by the atom specifying
unit is the atom forming the ring structure, and that specifies the directly bonded
atom specified by the atom specifying unit, as the pinching point; and a specified-pinching-point/reactive-site-information
acquiring unit that acquires the pinching point information for the pinching point
specified by the pinching-point/reactive-site specifying unit and the reactive site
information for the reactive site specified, from the binding site information and
the compound structure information.
[0022] In the drug-metabolizing enzyme prediction apparatus according to the present invention,
the reactive-site identifying condition is such that a reactive-site coordinate range
that is a range of the atomic coordinate information for the atoms forming the reactive
site, a reactive-site angle range that is a range of an angle value between each of
the atoms forming the reactive site and the pinching point, and a reactive-site distance
range that is a range of a distance value between each of the atoms forming the reactive
site and the pinching point are defined for each of the molecular species previously
specified. The reactive-site-information identifying unit includes a coordinate-based
reactive-site-information determining unit that determines whether the atomic coordinate
information for the atoms, which form the reactive site corresponding to the reactive
site information acquired by the pinching-point/reactive-site-information acquiring
unit, satisfies the reactive-site coordinate range defined by the reactive-site identifying
condition, in the molecular species corresponding to the molecular species information
identified; an angle-based reactive-site-information determining unit that calculates
the angle value between the atom forming the reactive site corresponding to the reactive
site information and the pinching point corresponding to the pinching point information,
and determines whether the angle value calculated satisfies the reactive-site angle
range defined by the reactive-site identifying condition, in the molecular species
corresponding to the molecular species information identified; a distance-based reactive-site-information
determining unit that calculates the distance value between the atom forming the reactive
site corresponding to the reactive site information and the pinching point corresponding
to the pinching point information, and determines whether the distance value calculated
satisfies the reactive-site distance range defined by the reactive-site identifying
condition, in the molecular species corresponding to the molecular species information
identified; and a determination-result-based reactive-site-information identifying
unit that identifies the reactive site information determined, as being satisfied,
by the coordinate-based reactive-site-information determining unit, the angle-based
reactive-site-information determining unit, and the distance-based reactive-site-information
determining unit, as the reactive site information that satisfies the reactive-site
identifying condition.
[0023] In the drug-metabolizing enzyme prediction apparatus according to the present invention,
the pinching-point identifying condition is such that a pinching-point coordinate
range that is a range of the atomic coordinate information for the atom as the pinching
point is defined for each of the molecular species previously specified. The pinching-point-information
identifying unit includes a coordinate-based pinching-point-information identifying
unit that identifies the pinching point information when the atomic coordinate information
for the atom, which is the pinching point corresponding to the pinching point information
acquired by the pinching-point/reactive-site-information acquiring unit, satisfies
the pinching-point coordinate range defined by the pinching-point identifying condition,
in the molecular species corresponding to the molecular species information identified.
[0024] The drug-metabolizing enzyme prediction apparatus according to the present invention
further includes an open-space-site-information acquiring unit that acquires open-space
site information that is information on the atom forming an open space site, from
the binding site information, the reactive site information, the pinching point information,
and the compound structure information, if there exists the open space site that is
a site other than the reactive site corresponding to the reactive site information
identified by the reactive-site-information identifying unit and the binding site
corresponding to the binding site information identified by the binding-site/molecular-species-information
identifying unit and which is bound to the pinching point corresponding to the pinching
point information identified by the pinching-point-information identifying unit; and
an open-space-site-information identifying unit that identifies the open-space site
information acquired by the open-space-site-information acquiring unit, from the molecular
species information and the compound structure information, based on an open-space-site
identifying condition for identifying the open space site.
[0025] In the drug-metabolizing enzyme prediction apparatus according to the present invention,
the open-space-site identifying condition is such that an open-space-site coordinate
range, which is a range of the atomic coordinate information for the atom forming
the open space site, is defined for each of the molecular species previously specified.
The open-space-site-information identifying unit includes a coordinate-based open-space-site-information
identifying unit that identifies the open-space site information when the atomic coordinate
information for the atom, which forms the open space site corresponding to the open-space
site information acquired by the open-space-site-information acquiring unit, satisfies
the open-space-site coordinate range defined by the open-space-site identifying condition,
in the molecular species corresponding to the molecular species information identified.
[0026] The drug-metabolizing enzyme prediction apparatus according to the present invention
further includes an inhibition determining unit that determines whether the atom forming
the reactive site corresponding to the reactive site information or a partial-structure
atom group which is a group of part of the atoms in a molecular structure at the reactive
site inhibits the metabolic reaction, when the reactive site information is identified
by the reactive-site-information identifying unit, based on an inhibition determining
condition for determining whether the metabolic reaction is inhibited in the compound
of which the reactive site information is identified.
[0027] A drug-metabolizing enzyme prediction program according to another aspect of the
present invention causes a computer to execute a compound-structure-information acquiring
procedure of acquiring compound structure information including at least one of atomic
coordinate information that is information on coordinates of each of atoms forming
a compound, bond information that is information on a bond between the atoms, and
ring structure information that is information on a ring structure formed with a plurality
of the atoms; a binding-site/molecular-species-information identifying procedure of
identifying binding site information that is information on an atom forming a binding
site in the compound corresponding to the compound structure information and molecular
species information that is information on a molecular species, from the compound
structure information acquired at the compound-structure-information acquiring procedure,
based on a binding-site identifying condition for identifying the binding site that
is a site where the compound and a drug-metabolizing enzyme are bound to each other
and the molecular species of the drug-metabolizing enzyme to be bound; a pinching-point/reactive-site-information
acquiring procedure of specifying a pinching point that is the atom binding at least
between the binding site and a reactive site that is a site where a metabolic reaction
occurs in the compound and the drug-metabolizing enzyme, from the binding site information
identified at the binding-site/molecular-species-information identifying procedure
and the compound structure information, and acquiring pinching point information that
is information on the atom of the specified pinching point and the reactive site information
that is information on the atom forming the reactive site bound to the pinching point
corresponding to the pinching point information; a reactive-site-information identifying
procedure of identifying the reactive site information acquired at the pinching-point/reactive-site-information
acquiring procedure, from the molecular species information and the compound structure
information, based on a reactive-site identifying condition for identifying the reactive
site; and a pinching-point-information identifying procedure of identifying the pinching
point acquired at the pinching-point/reactive-site-information acquiring procedure,
from the molecular species information and the compound structure information, based
on a pinching-point identifying condition for identifying the pinching point.
[0028] In the drug-metabolizing enzyme prediction program according to the present invention,
the binding-site/molecular-species-information identifying procedure includes a binding-site
selecting procedure of selecting the binding site based on a predetermined binding-site
selection criterion; a binding-site-information acquiring procedure of acquiring the
binding site information for the binding site selected at the binding-site selecting
procedure, from the compound structure information; and an acquired-binding-site/molecular-species-information
identifying procedure of identifying the binding site information acquired at the
binding-site-information acquiring procedure, from the compound structure information,
based on the binding-site identifying condition, and identifies the molecular species
corresponding to the identified binding site information as the molecular species
information.
[0029] In the drug-metabolizing enzyme prediction program according to the present invention,
the binding-site identifying condition is a condition in which a binding-site coordinate
range that is a range of the atomic coordinate information for the atom forming the
binding site is defined for each predetermined molecular species. The acquired-binding-site/molecular-species-information
identifying procedure includes a coordinate-based acquired-binding-site/molecular-species-information
identifying procedure of identifying the binding site information when the atomic
coordinate information for the atom that forms the binding site corresponding to the
binding site information acquired at the binding-site-information acquiring procedure
satisfies the binding-site coordinate range defined by the binding-site identifying
condition, and identifying the molecular species corresponding to the satisfied binding-site
coordinate range as the molecular species information.
[0030] In the drug-metabolizing enzyme prediction program according to the present invention,
the pinching-point/reactive-site-information acquiring procedure includes an atom
specifying procedure of specifying a directly bonded atom that is the atom bonded
to the atom forming the binding site corresponding to the binding site information
identified at the binding-site/molecular-species-information identifying procedure
and also specifying the atom bonded to the directly bonded atom, based on the bond
information included in the compound structure information; a ring-structure determining
procedure of determining whether the atom specified at the atom specifying procedure
is the atom forming the ring structure, based on the ring structure information included
in the compound structure information; a pinching-point/reactive-site specifying procedure
of specifying the atom specified and another atom forming the ring structure together
with the atom, as the atoms forming the reactive site, when it is determined at the
ring-structure determining procedure that the atom specified at the atom specifying
procedure is the atom forming the ring structure, and specifying the directly bonded
atom specified at the atom specifying procedure, as the pinching point; and a specified
pinching-point/reactive-site-information acquiring procedure of acquiring the pinching
point information for the pinching point specified at the pinching-point/reactive-site
specifying procedure and the reactive site information for the reactive site specified,
from the binding site information and the compound structure information.
[0031] In the drug-metabolizing enzyme prediction program according to the present invention,
the reactive-site identifying condition is such that a reactive-site coordinate range
that is a range of the atomic coordinate information for the atoms forming the reactive
site, a reactive-site angle range that is a range of an angle value between each of
the atoms forming the reactive site and the pinching point, and a reactive-site distance
range that is a range of a distance value between each of the atoms forming the reactive
site and the pinching point are defined for each of the molecular species previously
specified. The reactive-site-information identifying procedure includes a coordinate-based
reactive-site-information determining procedure of determining whether the atomic
coordinate information for the atoms, which form the reactive site corresponding to
the reactive site information acquired at the pinching-point/reactive-site-information
acquiring procedure, satisfies the reactive-site coordinate range defined by the reactive-site
identifying condition, in the molecular species corresponding to the molecular species
information identified; an angle-based reactive-site-information determining procedure
of calculating the angle value between the atom forming the reactive site corresponding
to the reactive site information and the pinching point corresponding to the pinching
point information, and determining whether the angle value calculated satisfies the
reactive-site angle range defined by the reactive-site identifying condition, in the
molecular species corresponding to the molecular species information identified; a
distance-based reactive-site-information determining procedure of calculating the
distance value between the atom forming the reactive site corresponding to the reactive
site information and the pinching point corresponding to the pinching point information,
and determining whether the distance value calculated satisfies the reactive-site
distance range defined by the reactive-site identifying condition, in the molecular
species corresponding to the molecular species information identified; and a determination-result-based
reactive-site-information identifying procedure of identifying the reactive site information
determined, as being satisfied, at the coordinate-based reactive-site-information
determining procedure, the angle-based reactive-site-information determining procedure,
and the distance-based reactive-site-information determining procedure, as the reactive
site information that satisfies the reactive-site identifying condition.
[0032] In the drug-metabolizing enzyme prediction program according to the present invention,
the pinching-point identifying condition is such that a pinching-point coordinate
range that is a range of the atomic coordinate information for the atom as the pinching
point is defined for each of the molecular species previously specified. The pinching-point-information
identifying procedure includes a coordinate-based pinching-point-information identifying
procedure of identifying the pinching point information when the atomic coordinate
information for the atom, which is the pinching point corresponding to the pinching
point information acquired at the pinching-point/reactive-site-information acquiring
procedure, satisfies the pinching-point coordinate range defined by the pinching-point
identifying condition, in the molecular species corresponding to the molecular species
information identified.
[0033] The drug-metabolizing enzyme prediction program according to the present invention
further cuases the computer to execute an open-space-site-information acquiring procedure
of acquiring open-space site information that is information on the atom forming an
open space site, from the binding site information, the reactive site information,
the pinching point information, and the compound structure information, if there exists
the open space site that is a site other than the reactive site corresponding to the
reactive site information identified at the reactive-site-information identifying
procedure and the binding site corresponding to the binding site information identified
at the binding-site/molecular-species-information identifying procedure and which
is bound to the pinching point corresponding to the pinching point information identified
at the pinching-point-information identifying procedure; and an open-space-site-information
identifying procedure of identifying the open-space site information acquired at the
open-space-site-information acquiring procedure, from the molecular species information
and the compound structure information, based on an open-space-site identifying condition
for identifying the open space site.
[0034] In the drug-metabolizing enzyme prediction program according to the present invention,
the open-space-site identifying condition is such that an open-space-site coordinate
range, which is a range of the atomic coordinate information for the atom forming
the open space site, is defined for each of the molecular species previously specified.
The open-space-site-information identifying procedure includes a coordinate-based
open-space-site-information identifying procedure of identifying the open-space site
information when the atomic coordinate information for the atom, which forms the open
space site corresponding to the open-space site information acquired at the open-space-site-information
acquiring procedure, satisfies the open-space-site coordinate range defined by the
open-space-site identifying condition, in the molecular species corresponding to the
molecular species information identified.
[0035] The drug-metabolizing enzyme prediction program according to the present invention
further causes the computer to execute an inhibition determining procedure of determining
whether the atom forming the reactive site corresponding to the reactive site information
or a partial-structure atom group which is a group of part of the atoms in a molecular
structure at the reactive site inhibits the metabolic reaction, when the reactive
site information is identified at the reactive-site-information identifying procedure,
based on an inhibition determining condition for determining whether the metabolic
reaction is inhibited in the compound of which the reactive site information is identified.
[0036] A computer-readable recording medium according to still another aspect of the present
invention stores therein the drug-metabolizing enzyme prediction program according
to the present invention.
[0037] The present invention is configured to acquire the compound structure information
including at least one of the atomic coordinate information being information on coordinates
of each of atoms forming a compound, the bond information being information on a bond
between the atoms, and the ring structure information being information on the ring
structure formed with a plurality of the atoms; identify the binding site information
being information on the atom forming a binding site in the compound corresponding
to the compound structure information and the molecular species information being
information on the molecular species, from the compound structure information acquired,
based on the binding-site identifying condition for identifying the binding site,
which is a site where the compound and the drug-metabolizing enzyme are bound to each
other, and the molecular species of the drug-metabolizing enzyme bound; specify the
pinching point which is the atom binding at least the reactive site where a metabolic
reaction occurs in the compound and the drug-metabolizing enzyme to the binding site,
from the binding site information identified and the compound structure information;
acquire the pinching point information being information on the atom as the pinching
point specified and the reactive site information being information on the atoms forming
the reactive site bound to the pinching point corresponding to the pinching point
information; identify the reactive site information acquired, from the molecular species
information and the compound structure information, based on the reactive-site identifying
condition for identifying the reactive site; and identify the pinching point information
acquired, from the molecular species information and the compound structure information,
based on the pinching-point identifying condition for identifying the pinching point.
Therefore, it is possible to accurately predict a molecular species of a drug-metabolizing
enzyme and a reactive site of a compound with a drug-metabolizing enzyme, for all
types of compounds without using training data for prediction and in consideration
of each cubic form of the compounds.
BRIEF DESCRIPTION OF DRAWINGS
[0038] Fig. 1 is a principle diagram of a basic principle of the present invention; Fig.
2 is a block diagram of one example of a system to which the present invention is
applied; Fig. 3 is a diagram of one example of information stored in a compound-structure
information file 106a; Fig. 4 is a diagram of one example of information stored in
a binding-site identifying condition file 106b; Fig. 5 is a diagram of one example
of information stored in a binding-site/molecular-species information file 106c; Fig.
6 is a diagram of one example of information stored in a pinching-point information
file 106d; Fig. 7 is a diagram of one example of information stored in a reactive-site
information file 106e; Fig. 8 is a diagram of one example of information stored in
a reactive-site identifying condition file 106f; Fig. 9 is a diagram of one example
of information stored in a pinching-point identifying condition file 106g; Fig. 10
is a diagram of one example of information stored in an open-space-site information
file 106h; Fig. 11 is a diagram of one example of information stored in an open-space-site
identifying condition file 106i; Fig. 12 is a diagram of one example of information
stored in a prediction result file 106j; Fig. 13 is a block diagram of one example
of a binding-site/molecular-species identifying unit 102b in the system to which the
present invention is applied; Fig. 14 is a block diagram of one example of a pinching-point/reactive-site-information
acquiring unit 102c in the system to which the present invention is applied; Fig.
15 is a block diagram of one example of a reactive-site-information identifying unit
102d in the system to which the present invention is applied; Fig. 16 is a block diagram
of one example of a pinching-point-information identifying unit 102e in the system
to which the present invention is applied; Fig. 17 is a block diagram of one example
of an open-space-site-information identifying unit 102g in the system to which the
present invention is applied; Fig. 18 is a flowchart of one example of the main process
in the system according to one embodiment of the present invention; Fig. 19 is a flowchart
of one example of a binding-site/molecular-species-information identifying process
in the system according to the embodiment; Fig. 20 is a flowchart of one example of
a pinching-point/reactive-site-information acquiring process in the system according
to the embodiment; Fig. 21 is a flowchart of one example of a reactive-site-information
identifying process in the system according to the embodiment; Fig. 22 is a flowchart
of one example of a reactive-site coordinate range determining process in the system
according to the embodiment; Fig. 23 is a flowchart of one example of a pinching-point-information
identifying process in the system according to the embodiment; Fig. 24 is a flowchart
of one example of an open-space-site-information identifying process in the system
according to the embodiment; Fig. 25 is a schematic of the binding site, the reactive
site, the pinching point, and the open space site in a compound; Fig. 26 is a schematic
of one example of a binding-site selection criterion; Fig. 27 is a schematic of one
example of a binding-site identifying condition; Fig. 28 is a schematic of one example
of a reactive-site identifying condition; Fig. 29 is a schematic of one example of
the reactive-site identifying condition; Fig. 30 is a schematic of one example of
an open-space-site identifying condition; and Fig. 31 is a diagram for explaining
one example of the main process in a drug-metabolizing enzyme prediction apparatus
100 according to an example of the present invention.
BEST MODE(S) FOR CARRYING OUT THE INVENTION
[0039] Exemplary embodiments of a drug-metabolizing enzyme prediction apparatus, a drug-metabolizing
enzyme prediction method, a drug-metabolizing enzyme prediction program, and a recording
medium according to the present invention are explained in detail below with reference
to the accompanying drawings. It should be noted that the present invention is not
limited by these embodiments.
[Outline of the present invention]
[0040] The outline of the present invention is explained below first, and the configuration
and processes of the present invention are explained in detail next.
[0041] Fig. 1 is a principle diagram of a basic principle of the present invention.
[0042] The present invention schematically includes the following basic features. That is,
at first, the present invention acquires compound structure information including
at least one of atomic coordinate information being information on coordinates of
each of atoms forming a compound, bond information being information on a bond between
the atoms, and ring structure information being information on a ring structure formed
with a plurality of the atoms (step S-1)..
[0043] Then, the present invention identifies binding site information being information
on an atom forming a binding site in a compound corresponding to the compound structure
information and also molecular species information being information on a molecular
species, from the compound structure information acquired at step S-1, based on a
binding-site identifying condition for identifying a binding site which is a site,
where a compound and a drug-metabolizing enzyme are bound to each other, and also
a molecular species of the drug-metabolizing enzyme to be bound (step S-2).
[0044] From the binding site information identified at step S-2 and the compound structure
information acquired at step S-1, the present invention identifies a pinching point
which is an atom binding at least a reactive site being a site, where a metabolic
reaction occurs in the compound and the drug-metabolizing enzyme, to the binding site,
and acquires pinching point information being information on the atom as the pinching
point identified and reactive site information being information on an atom forming
the reactive site bound to the pinching point corresponding to the pinching point
information (step S-3).
[0045] Then, the present invention identifies the reactive site information acquired at
step S-3, from the molecular species information and the compound structure information,
based on a reactive-site identifying condition for identifying a reactive site (step
S-4).
[0046] Thereafter, the present invention identifies the pinching point information acquired
at step S-3, from the molecular species information and the compound structure information,
based on the pinching-point identifying condition for identifying a pinching point
(step S-5).
[0047] Here, if there exists an open space site being a site which is other than the reactive
site corresponding to the reactive site information identified at step S-4 and the
binding site corresponding to the binding site information identified at step S-2
and which is bound to the pinching point corresponding to the pinching point information
identified at step S-5, the present invention may acquire open-space-site information
being information on an atom forming an open space site from the binding site information,
the reactive site information, the pinching point information, and the compound structure
information, and identify the open-space-site information acquired, from the molecular
species information and the compound structure information, based on the open-space-site
identifying condition for identifying an open space site.
[0048] When the reactive site information is identified at step S-4, the present invention
may determine an atom forming a reactive site corresponding to the reactive site information
or a partial-structure atom group which is a group of part of atoms in a molecular
structure of the reactive site, based on an inhibition determining condition (e.g.,
a list of predetermined atoms as inhibitors) for determining whether a metabolic reaction
is inhibited in the compound of which the reactive site information is identified.
[0049] Furthermore, the present invention may output, as a prediction result for a compound,
at least one of the binding site information, the molecular species information, the
reactive site information, the pinching point information, the open-space site information,
and the inhibition determination result in the compound, which are identified (predicted).
[0050] As explained above, the present invention can automatically identify the binding
site, the pinching point, the reactive site, and the open space site in the compound
from the compound structure information. Therefore, it is possible to accurately predict
a molecular species of a drug-metabolizing enzyme and a reactive site of a compound
with the drug-metabolizing enzyme for all kinds of compounds without using training
data for prediction and in consideration of each cubic form of the compounds.
[0051] The present invention can further determine (check) whether any atom inhibiting a
metabolic reaction is contained in the reactive site identified, for the compound
of which the reactive site with the drug-metabolizing enzyme is identified. Therefore,
a useful compound group can be narrowed highly accurately, for example, in the early
stage of drug development.
[System configuration]
[0052] The configuration of the system is explained in detail below.
[0053] Fig. 2 is a block diagram of one example of the system to which the present invention
is applied, and only a portion related to the present invention in the configuration
is conceptually shown therein.
[0054] In Fig. 2, a network 300 such as the Internet has functions of mutually connecting
between the drug-metabolizing enzyme prediction apparatus 100 and an external system
200.
[0055] In Fig. 2, the external system 200 is mutually connected with the drug-metabolizing
enzyme prediction apparatus 100 through the network 300, and has a function of providing
a compound library, an external database on the compound structure information, and
a website for executing various external programs, to users.
[0056] Here, the external system 200 may be configured as a WEB server or an ASP server,
and its hardware may be configured by an information processor such as commercially
available work station and personal computer, and its attachment device. The functions
of the external system 200 are implemented by a CPU, a disk drive, a memory device,
an input device, an output device, a communication control device in the hardware
of the external system 200, and by programs for controlling these devices.
[0057] In Fig. 2, the drug-metabolizing enzyme prediction apparatus 100 schematically includes
a control unit 102 such as CPU that integrally controls the whole of the drug-metabolizing
enzyme prediction apparatus 100; a communication-control interface unit 104 connected
to a communication device (not shown) such as a router connected to a communication
line and the like; a memory unit 106 that stores various types of databases and files;
and an input/output-control interface unit 108 connected to an input device 112 and
an output device 114. These units are communicably connected to one another through
arbitrary communication paths. Furthermore, the drug-metabolizing enzyme prediction
apparatus 100 is communicably connected to the network 300 through a communication
device such as a router and a wired or a wireless communication line such as a dedicated
line.
[0058] The various types of databases, tables, and files (the compound-structure information
file 106a to the prediction result file 106j) stored in the memory unit 106 of Fig.
2 are storage units such as fixed disk devices, which store various programs, tables,
files, and databases used for various processes, and files for web pages.
[0059] Among these components in the memory unit 106, the compound-structure information
file 106a is a compound-structure-information storage unit that stores the compound
structure information including at least one of the atomic coordinate information
being information on coordinates of each of atoms forming a compound, the bond information
being information on a bond between the atoms, and the ring structure information
being information on a ring structure formed with a plurality of the atoms. Information
stored in the compound-structure information file 106a is explained below with reference
to Fig. 3.
[0060] Fig. 3 is a diagram of one example of information stored in the compound-structure
information file 106a. As shown in Fig. 3, the information stored in the compound-structure
information file 106a includes atom identifying information for uniquely identifying
an atom forming a compound; an atomic symbol of the atom corresponding to the atom
identifying information; atomic coordinate information (x coordinate value, y coordinate
value, and z coordinate value) for the atom corresponding to the atom identifying
information; bond information that includes bond identifying information for uniquely
identifying the bond information, atom identifying information for an atom bonded
to the atom corresponding to the atom identifying information in the bond identifying
information, and bond type information on bond types; and ring structure information
that includes ring identifying information for uniquely identifying the ring structure
information and also includes atom identifying information for atoms forming the ring
structure, which are mutually associated with one another.
[0061] The compound-structure information file 106a may be external compound-structure information
databases, which are accessed through the Internet, or may be an in-house database
created by copying these databases, storing original compound structure information,
and further adding unique annotation information and so forth.
[0062] The binding-site identifying condition file 106b is a binding-site identifying condition
storage unit that stores the binding-site identifying condition (e.g., condition under
which a binding-site coordinate range, which is a range of atomic coordinate information
for an atom forming a binding site, is defined for each of predetermined molecular
species) for identifying a binding site being a site where a compound and a drug-metabolizing
enzyme are bound to each other and also identifying a molecular species of the drug-metabolizing
enzyme to be bound. Here, information stored in the binding-site identifying condition
file 106b is explained below with reference to Fig. 4.
[0063] Fig. 4 is a diagram of one example of information stored in the binding-site identifying
condition file 106b. As shown in Fig. 4, the information stored in the binding-site
identifying condition file 106b includes molecular species information and ranges
(range of ±x direction, range of ±y direction, and range of ±z direction) of atomic
coordinate information for an atom forming a binding site, which are mutually associated
with one another (in a matrix form).
[0064] The binding-site/molecular-species information file 106c is a binding-site/molecular-species
information storage unit that stores the binding site information and the molecular
species information identified by a binding-site/molecular-species-information identifying
unit 102b explained later. Information stored in the binding-site/molecular-species
information file 106c is explained below with reference to Fig. 5.
[0065] Fig. 5 is a diagram of one example of information stored in the binding-site/molecular-species
information file 106c. As shown in Fig. 5, the information stored in the binding-site/molecular-species
information file 106c includes binding-site identifying information for uniquely identifying
a binding site, atom identifying information for an atom forming the binding site,
and molecular species information for a drug-metabolizing enzyme bound to the binding
site, which are mutually associated with one another.
[0066] The pinching-point information file 106d is a pinching-point information storage
unit that stores pinching point information which is identified by the pinching-point-information
identifying unit 102e and which is information on a pinching point being an atom for
binding at least a reactive site to a binding site, the reactive site being a site
where a metabolic reaction occurs in a compound and a drug-metabolizing enzyme. Information
stored in the pinching-point information file 106d is explained below with reference
to Fig. 6.
[0067] Fig. 6 is a diagram of one example of information stored in the pinching-point information
file 106d. As shown in Fig. 6, the information stored in the pinching-point information
file 106d includes pinching-point identifying information for uniquely identifying
a pinching point, atom identifying information for an atom as the pinching point,
binding-site identifying information for a binding site bound to the pinching point,
reactive-site identifying information for a reactive site bound to the pinching point,
and open-space-site identifying information for an open space site bound to the pinching
point, which are mutually associated with one another.
[0068] The reactive-site information file 106e is a reactive site information storage unit
that stores reactive site information identified by the reactive-site-information
identifying unit 102d explained later. Information stored in the reactive-site information
file 106e is explained below with reference to Fig. 7.
[0069] Fig. 7 is a diagram of one example of information stored in the reactive-site information
file 106e. As shown in Fig. 7, the information stored in the reactive-site information
file 106e includes reactive-site identifying information for uniquely identifying
a reactive site, pinching-point identifying information for a pinching point bound
to the reactive site, and atom identifying information for atoms forming the reactive
site, which are mutually associated with one another.
[0070] The reactive-site identifying condition file 106f is a reactive-site identifying
condition storage unit that stores the reactive-site identifying condition for identifying
a reactive site being a site where a metabolic reaction occurs in a compound and a
drug-metabolizing enzyme (e.g., condition under which the ranges as follows are defined
for each of the predetermine molecular species: a reactive-site coordinate range being
a range of atomic coordinate information for atoms forming the reactive site, a reactive-site
angle range being a range of an angle value between each of the atoms forming the
reactive site and the pinching point, and a reactive-site distance range being a range
of a distance value between each of the atoms forming the reactive site and the pinching
point). Information stored in the reactive-site identifying condition file 106f is
explained below with reference to Fig. 8.
[0071] Fig. 8 is a diagram of one example of information stored in the reactive-site identifying
condition file 106f. As shown in Fig. 8, the information stored in the reactive-site
identifying condition file 106f includes the molecular species information, ranges
of the atomic coordinate information for an atom forming a reactive site (range of
±x direction, range of ±y direction, and range of ±z direction), a range of an angle
value between the atom forming the reactive site and the pinching point, and a range
of a distance value between the atom forming the reactive site and the pinching point,
which are mutually associated with one another (in a matrix form).
[0072] The pinching-point identifying condition file 106g is a pinching-point identifying
condition storage unit that stores the pinching-point identifying condition for identifying
a pinching point which is an atom binding at least between a reactive site and a binding
site. Information stored in the pinching-point identifying condition file 106g is
explained below with reference to Fig. 9.
[0073] Fig. 9 is a diagram of one example of information stored in the pinching-point identifying
condition file 106g. As shown in Fig. 9, the information stored in the pinching-point
identifying condition file 106g includes the molecular species information, and the
ranges of the atomic coordinate information for an atom as a pinching point (range
of ±x direction, range of ±y direction, and range of ±z direction), which are mutually
associated with one another (in a matrix form).
[0074] The open-space-site information file 106h is an open-space-site-information storage
unit that stores open-space site information identified by the open-space-site-information
identifying unit 102g explained later. Information stored in the open-space-site information
file 106h is explained below with reference to Fig. 10.
[0075] Fig. 10 is a diagram of one example of information stored in the open-space-site
information file 106h. As shown in Fig. 10, the information stored in the open-space-site
information file 106h includes open-space-site identifying information for uniquely
identifying an open space site, pinching-point identifying information for a pinching
point bound to the open space site, and the atom identifying information for an atom
forming the open space site, which are mutually associated with one another.
[0076] The open-space-site identifying condition file 106i is an open-space-site identifying
condition storage unit that stores the open-space-site identifying condition for identifying
an open space site being a site which is other than the reactive site and the binding
site and is bound to the pinching point. Information stored in the open-space-site
identifying condition file 106i is explained below with reference to Fig. 11.
[0077] Fig. 11 is a diagram of one example of information stored in the open-space-site
identifying condition file 106i. As shown in Fig. 11, the information stored in the
open-space-site identifying condition file 106i includes the molecular species information,
and the ranges of the atomic coordinate information for an atom forming the open space
site (range of ±x direction, range of ±y direction, and range of ±z direction), which
are mutually associated with one another (in'a matrix form).
[0078] The prediction result file 106j is a prediction-result storage unit that stores a
prediction result including at least one of the binding site information, the molecular
species information, the reactive site information, the pinching point information,
the open-space site information, and the inhibition determination result, for a compound,
which are identified. Information stored in the prediction result file 106j is explained
below with reference to Fig. 12.
[0079] Fig. 12 is a diagram of one example of information stored in the prediction result
file 106j. As shown in Fig. 12, the information stored in the prediction result file
106j includes the binding-site identifying information for the binding site identified,
the molecular species information identified, the pinching-point identifying information
for the pinching point identified, the reactive-site identifying information for the
reactive site identified, the open-space-site identifying information for the open
space site identified, and the inhibition determination result that is the result
of determination on the inhibition of the reactive site in an inhibition determining
unit 102h explained later, which are mutually associated with one another.
[0080] As other information, the memory unit 106 of the drug-metabolizing enzyme prediction
apparatus 100 stores, for example, programs for automatically generating a plurality
of conformations.
[0081] In Fig. 2, the communication-control interface unit 104 controls communications between
the drug-metabolizing enzyme prediction apparatus 100 and the network 300 (or the
communication device such as a router). That is, the communication-control interface
unit 104 has a function of communicating data with other terminals through the communication
line.
[0082] In Fig. 2, the input/output-control interface unit 108 controls the input device
112 and the output device 114. As the output device 114, a speaker can be used other
than a monitor (including a home television) (hereinafter, the output device 114 may
sometimes be described "monitor"). As the input device 112, a keyboard, a mouse, and
a microphone can be used. The monitor also implements a function as a pointing device
in corporation with the mouse.
[0083] Furthermore, in Fig. 2, the control unit 102 includes an internal memory for storing
control programs such as OS (Operating System), programs defining various types of
procedures, and required data, and performs information processing to execute various
processes by these programs. The control unit 102 functionally includes a compound-structure-information
acquiring unit 102a, the binding-site/molecular-species-information identifying unit
102b, the pinching-point/reactive-site-information acquiring unit 102c, the reactive-site-information
identifying unit 102d, the pinching-point-information identifying unit 102e, an open-space-site-information
acquiring unit 102f, the open-space-site-information identifying unit 102g, the inhibition
determining unit 102h, and a prediction-result output unit 102i.
[0084] Among these, the compound-structure-information acquiring unit 102a acquires the
compound structure information including at least one of the atomic coordinate information
being information on coordinates of each of atoms forming a compound, the bond information
being information on a bond between the atoms, and the ring structure information
being information on a ring structure formed with a plurality of the atoms.
[0085] The binding-site/molecular-species-information identifying unit 102b identifies the
binding site information being information on an atom forming a binding site in a
compound corresponding to the compound structure information, and the molecular species
information being information on molecular species, based on the binding-site identifying
condition. Here, as shown in Fig. 13, the binding-site/molecular-species-information
identifying unit 102b further includes a binding-site selecting unit 102j, a binding-site-information
acquiring unit 102k, and an acquired-binding-site/molecular-species-information identifying
unit 102m.
[0086] Fig. 13 is a block diagram of one example of the binding-site/molecular-species-information
identifying unit 102b in the system to which the present invention is applied, and
only a portion related to the present invention in the configuration is conceptually
shown therein. In Fig. 13, the binding-site selecting unit 102j selects a binding
site based on a predetermined binding-site selection criterion. The binding-site-information
acquiring unit 102k acquires binding site information from the compound structure
information. The acquired-binding-site/molecular-species-information identifying unit
102m identifies the binding site information acquired in the binding-site-information
acquiring unit 102k from the compound structure information based on the binding-site
identifying condition, and identifies the molecular species corresponding to the binding
site information identified, as molecular species information. Here, as shown in Fig.
13, the acquired-binding-site/molecular-species-information identifying unit 102m
further includes a coordinate-based acquired-binding-site/molecular-species-information
identifying unit 102n. The coordinate-based acquired-binding-site/molecular-species-information
identifying unit 102n identifies binding site information when the atomic coordinate
information for an atom forming a binding site corresponding to the binding site information
acquired in the binding-site-information acquiring unit 102k satisfies a binding-site
coordinate range defined by the binding-site identifying condition, and identifies
the molecular species corresponding to the binding-site coordinate range satisfied,
as molecular species information.
[0087] Referring back to Fig. 2, the pinching-point/reactive-site-information acquiring
unit 102c specifies a pinching point, and acquires pinching point information for
the pinching point specified and reactive site information for a reactive site bound
to the pinching point specified, from the binding site information identified and
the compound structure information. Here, as shown in Fig. 14, the pinching-point/reactive-site-information
acquiring unit 102c further includes an atom specifying unit 102p, a ring-structure
determining unit 102q, a pinching-point/reactive-site specifying unit 102r, and a
specified-pinching-point/reactive-site-information acquiring unit 102s.
[0088] Fig. 14 is a block diagram of one example of the pinching-point/reactive-site-information
acquiring unit 102c in the system to which the present invention is applied, and only
a portion related to the present invention in the configuration is conceptually shown
therein. In Fig. 14, the atom specifying unit 102p specifies a directly bonded atom
being an atom bonded to the atom forming a binding site corresponding to the binding
site information identified, and specifies an atom bonded to the directly bonded atom,
based on the bond information included in the compound structure information. The
ring-structure determining unit 102q determines whether the atom specified is an atom
forming the ring structure, based on the ring structure information included in the
compound structure information. The pinching-point/reactive-site specifying unit 102r
specifies the atom specified and another atom forming the ring structure together
with the atom as atoms forming the reactive site when it is determined that the atom
specified forms the ring structure, and specifies the directly bonded atom specified,
as a pinching point. The specified-pinching-point/reactive-site-information acquiring
unit 102s acquires the pinching point information for the pinching point specified
and the reactive site information for the reactive site specified, from the binding
site information and the compound structure information.
[0089] Referring back again to Fig. 2, the reactive-site-information identifying unit 102d
identifies reactive site information from the molecular species information and the
compound structure information, based on the reactive-site identifying condition for
identifying a reactive site. As shown in Fig. 15, the reactive-site-information identifying
unit 102d further includes a coordinate-based reactive-site-information determining
unit 102t, an angle-based reactive-site-information determining unit 102u, a distance-based
reactive-site-information determining unit 102v, and a determination-result-based
reactive-site-information identifying unit 102w.
[0090] Fig. 15 is a block diagram of one example of the reactive-site-information identifying
unit 102d in the system to which the present invention is applied, and only a portion
related to the present invention in the configuration is conceptually shown therein.
In Fig. 15, the coordinate-based reactive-site-information determining unit 102t determines
whether atomic coordinate information for an atom forming the reactive site corresponding
to the reactive site information satisfies the reactive-site coordinate range defined
by the reactive-site identifying condition, in the molecular species corresponding
to the molecular species information identified. The angle-based reactive-site-information
determining unit 102u calculates an angle value between each of the atoms forming
the reactive site corresponding to the reactive site information and the pinching
point corresponding to the pinching point information, and determines whether the
angle value calculated satisfies the reactive-site angle range defined by the reactive-site
identifying condition, in the molecular species corresponding to the molecular species
information identified. The distance-based reactive-site-information determining unit
102v calculates a distance value between the atom forming the reactive site corresponding
to the reactive site information and the pinching point corresponding to the pinching
point information, and determines whether the distance value calculated satisfies
the reactive-site distance range defined by the reactive-site identifying condition,
in the molecular species corresponding to the molecular species information identified.
The determination-result-based reactive-site-information identifying unit 102w identifies
the reactive site information determined as being satisfied in the coordinate-based
reactive-site-information determining unit 102t, the angle-based reactive-site-information
determining unit 102u, and the distance-based reactive-site-information determining
unit 102v, as reactive site information that satisfies the reactive-site identifying
condition.
[0091] Referring back again to Fig. 2, the pinching-point-information identifying unit 102e
identifies pinching point information from the molecular species information and the
compound structure information, based on the pinching-point identifying condition.
Here, as shown in Fig. 16, the pinching-point-information identifying unit 102e further
includes a coordinate-based pinching-point-information identifying unit 102x.
[0092] Fig. 16 is a block diagram of one example of the pinching-point-information identifying
unit 102e in the system to which the present invention is applied, and only a portion
related to the present invention in the configuration is conceptually shown therein.
In Fig. 16, the coordinate-based pinching-point-information identifying unit 102x
identifies the pinching point information when the atomic coordinate information for
the atom corresponding to the pinching point information satisfies the pinching-point
coordinate range defined by the pinching-point identifying condition, in the molecular
species corresponding to the molecular species information identified.
[0093] Referring back again to Fig. 2, if there exists an open space site which is a site
other than the reactive site identified and the binding site identified and is bound
to the pinching point identified, the open-space-site-information acquiring unit 102f
acquires open-space site information being information on an atom forming the open
space site, from the binding site information, the reactive site information, the
pinching point information, and the compound structure information.
[0094] The open-space-site-information identifying unit 102g identifies open-space-site
information from the molecular species information and the compound structure information,
based on the open-space-site identifying condition. Here, as shown in Fig. 17, the
open-space-site-information identifying unit 102g further includes a coordinate-based
open-space-site-information identifying unit 102y.
[0095] Fig. 17 is a block diagram of one example of the open-space-site-information identifying
unit 102g in the system to which the present invention is applied, and only a portion
related to the present invention in the configuration is conceptually shown therein.
In Fig. 17, the coordinate-based open-space-site-information identifying unit 102y
identifies open-space-site information when the atomic coordinate information for
an atom forming an open space site corresponding to the open-space-site information
satisfies an open-space-site coordinate range defined by the open-space-site identifying
condition, in the molecular species corresponding to the molecular species information
identified.
[0096] Referring back again to Fig. 2, when the reactive site information is identified
by the reactive-site-information identifying unit 102d explained later, the inhibition
determining unit 102h determines whether an atom forming the reactive site corresponding
to the reactive site information or a partial-structure atom group which is a group
of part of the atoms in a molecular structure at the reactive site inhibits a metabolic
reaction in the compound, based on the inhibition determining condition (e.g., a list
of predetermined atoms as inhibitors) for determining whether the metabolic reaction
is inhibited in the compound of which the reactive site information is identified.
[0097] The prediction-result output unit 102i outputs at least one of the binding site information,
the molecular species information, the reactive site information, the pinching point
information, the open-space site information, and the inhibition determination result
identified in a compound, as a result of prediction for the compound.
[0098] Details of the processes in these units are explained later.
[Process in system]
[0099] One of the process in the system according to the embodiment configured in the above
manner is explained in detail below with reference to Fig. 18 to Fig. 24.
[0100] Details of the main process are explained first with reference to Fig. 18. Fig. 18
is a flowchart of one example of the main process in the system according to the embodiment.
[0101] At first, in the drug-metabolizing enzyme prediction apparatus 100, the compound-structure-information
acquiring unit 102a acquires the compound structure information including at least
one of the atomic coordinate information being information on coordinates of each
of atoms forming a compound, the bond information being information on a bond between
the atoms, and the ring structure information being information on a ring structure
formed with a plurality of the atoms, and stores the compound structure information
in a predetermined storage area of the compound-structure information file 106a (step
SA-1).
[0102] At step SA-1, the compound structure information to be acquired may be, for example,
compound structure information stored in the external system 200 or the like, compound
structure information stored in the compound library, and virtual compound structure
information input by a user through the input device 112.
[0103] Then, in the drug-metabolizing enzyme prediction apparatus 100, the binding-site/molecular-species-information
identifying unit 102b identifies binding site information and molecular species information
from the compound structure information acquired at step SA-1, based on the binding-site
identifying condition stored in the binding-site identifying condition file 106b,
and stores them in a predetermined storage area of the binding-site/molecular-species
information file 106c (step SA-2).
[0104] At step SA-2, the binding-site/molecular-species-information identifying unit 102b
may cause the user to select a binding site based the predetermined binding-site selection
criterion, to acquire binding site information corresponding to the binding site selected
from the compound structure information, identify the binding site information acquired,
from the compound structure information, based on the binding-site identifying condition,
and identify a molecular species corresponding to the binding site information identified,
as molecular species information (binding-site/molecular-species-information identifying
process). It may also cause the user to select at least one of an original point,
and the x-axis, the γ-axis, and the z-axis with respect to the original point, from
an atom forming a binding site.
[0105] After the user selects the binding site, the drug-metabolizing enzyme prediction
apparatus 100 may generate a plurality of conformations for the compound structure
information, using, for example, automatic conformation generating software (see,
for example,
"http://www.rsi.co.jp/kagaku/cs/info/pdf.files/moe.pdf",
"http://homepage2.nifty.com/ccsnews2/2002/3q/2002_3Qrsimoe2 00203.htm", and
"http://homepage2.nifty.com/ccsnews2/2003/2q/2003_2Q/ccs03s rsi.htm"), and perform
processes at step SA-3 and thereafter as follows for each conformation generated.
[0106] The binding-site selection criterion may be specifically defined such that a site
is selected as a binding site, the site being a ring structure in a compound or a
pseudo-ring structure externally similar to the ring structure (or pseudo-ring structure
containing an acid group (e.g., carboxyl and sulfone)). More specifically, as shown
in Fig. 26, it may be defined to select a site as a binding site if there is the site
containing a bond "S=O" in its structure (e.g., Omeprazole or H259/31) specific to
CYP (cytochrome P450) 2C19, and to select a site, as a binding site, where one pseudo-ring
structure is formed with three ring structures if Losartan, in its structure specific
to CYP2C9 (e.g., Losartan).
[0107] The binding-site identifying condition may be such that each range of a value which
each atom forming a binding site can take is conceptually defined for each width (±x
direction), depth (±y direction), and thickness (±z direction). More specifically,
for example, the width (±x direction) may conceptually be a range decided based on
the condition of functional group as follows. The depth (±y direction) may conceptually
be a range decided for each molecular species of CYP, and may be a range where a benzene
ring is the maximum if, for example, CYP2C19. The thickness (±z direction) may conceptually
be a range where the thickness of a mesyl group is the maximum. As shown in Fig. 27,
the condition of functional group is such that R1 and R2 form a planar structure and
a carboxyl group is the maximum (e.g., if the bond is "=0" and "=S"), R3 and R4 form
a planar structure and a benzene ring is the maximum, and R5 becomes the same as that
of R1 and R2 (however, CH
3 is also allowed if CYP2C8).
[0108] Details of the binding-site/molecular-species-information identifying process are
explained below with reference to Fig. 19. Fig. 19 is a flowchart of one example of
the binding-site/molecular-species-information identifying process in the system according
to the embodiment.
[0109] At first, in the binding-site/molecular-species-information identifying unit 102b,
the binding-site selecting unit 102j causes the user to select a binding site based
on the predetermined binding-site selection criterion (step SB-1).
[0110] Then, in the binding-site/molecular-species-information identifying unit 102b, the
binding-site-information acquiring unit 102k acquires binding site information for
the binding site selected at step SB-1, and stores the binding site information in
a predetermined storage area of the binding-site/molecular-species information file
106c (step SB-2).
[0111] In the binding-site/molecular-species-information identifying unit 102b, the acquired-binding-site/molecular-species-information
identifying unit 102m identifies the binding site information acquired at step SB-2,
from the compound structure information, based on the binding-site identifying condition
stored in the binding-site identifying condition file 106b, and identifies molecular
species corresponding to the binding site information identified, as molecular species
information.
[0112] More specifically, in the acquired-binding-site/molecular-species-information identifying
unit 102m, the coordinate-based acquired-binding-site/molecular-species-information
identifying unit 102n determines whether the x coordinate value, in the atomic coordinate
information for the atom forming the binding site corresponding to the binding site
information acquired at step SB-2, satisfies the binding-site coordinate range (±x
direction) defined by the binding-site identifying condition stored in the binding-site
identifying condition file 106b (step SB-3).
[0113] Then, in the acquired-binding-site/molecular-species-information identifying unit
102m, if "it is satisfied" is determined at step SB-3 (step SB-4: Yes), the coordinate-based
acquired-binding-site/molecular-species-information identifying unit 102n acquires
molecular species information corresponding to the binding-site coordinate range (±x
direction) satisfied, and stores the molecular species information in a predetermined
storage area of the binding-site/molecular-species information file 106c (step SB-5).
[0114] In the acquired-binding-site/molecular-species-information identifying unit 102m,
the coordinate-based acquired-binding-site/molecular-species-information identifying
unit 102n determines whether the y coordinate value in the atomic coordinate information
satisfies the binding-site coordinate range (±y direction) defined by the binding-site
identifying condition stored in the binding-site identifying condition file 106b,
in the molecular species corresponding to the molecular species information acquired
at step SB-5 (step SB-6).
[0115] Then, in the acquired-binding-site/molecular-species-information identifying unit
102m, if "it is satisfied" is determined at step SB-6 (step SB-7: Yes), the coordinate-based
acquired-binding-site/molecular-species-information identifying unit 102n determines
whether the z coordinate value in the atomic coordinate information satisfies the
binding-site coordinate range (±z direction) defined by the binding-site identifying
condition stored in the binding-site identifying condition file 106b, in the molecular
species corresponding to the molecular species information acquired at step SB-5 (step
SB-8).
[0116] Then, in the acquired-binding-site/molecular-species-information identifying unit
102m, if "it is satisfied" is determined at step SB-8 (step SB-9: Yes), the coordinate-based
acquired-binding-site/molecular-species-information identifying unit 102n returns
again to the process at step SB-3 if there is any atom that is not determined (step
SB-10: No), and identifies the binding site information acquired at step SB-2 if there
are no atoms that are not determined (step SB-10: Yes), in other words, if all the
atoms are determined, and identifies the molecular species information acquired at
step SB-5.
[0117] When "it is not satisfied" is determined at step SB-3, step SB-6, and step SB-8 (step
SB-4: No, step SB-7: No, and step SB-9: No), the process is terminated even if there
still remains an atom which is not determined.
[0118] Now, the binding-site/molecular-species-information identifying process is completed.
[0119] Referring back to Fig. 18, in the drug-metabolizing enzyme prediction apparatus 100,
the pinching-point/reactive-site-information acquiring unit 102c specifies a pinching
point from the binding site information identified at step SA-2 and from the compound
structure information acquired at step SA-1, acquires pinching point information for
the pinching point specified and reactive site information for a reactive site bound
to the pinching point corresponding to the pinching point information, and stores
them in a predetermined storage area of the pinching-point information file 106d and
in a predetermined storage area of the reactive-site information file 106e, respectively
(step SA-3).
[0120] At step SA-3, the pinching-point/reactive-site-information acquiring unit 102c may
also be configured to specify a directly bonded atom being an atom bonded to the atom
forming the binding site corresponding to the binding site information identified
at step SA-2, and also specify an atom bonded to the directly bonded atom, based on
the bond information included in the compound structure information; determine whether
the atom specified is an atom forming the ring structure, based on the ring structure
information included in the compound structure information; specify, if it is determined
that the atom specified is an atom forming the ring structure, this atom specified
and another atom forming the ring structure together with this atom as atoms forming
the reactive site; to specify the directly bonded atom specified, as a pinching point;
and acquire the pinching point information for the pinching point specified and the
reactive site information for the reactive site specified, from the binding site information
and the compound structure information, respectively (pinching-point/reactive-site-information
acquiring process).
[0121] If it is determined that the atom specified is not an atom forming the ring structure,
it is determined whether the directly bonded atom is an atom forming the ring structure.
If it is determined that the directly bonded atom is an atom forming the ring structure,
an atom, which forms a binding site corresponding to the binding site information
identified at step SA-2 and is bonded to the directly bonded atom, is specified as
a pinching point, and the directly bonded atom and another atom forming the ring structure
together with the directly bonded atom may be specified as atoms forming a reactive
site. In other words, the atom forming the binding site may sometimes be a pinching
point.
[0122] The atom specified as the pinching point may be an atom bonded to the atom forming
the ring structure, selected from among atoms each located, for example, one bond
away from the atom forming the binding site.
[0123] It may also be considered that a compound is suppressed by CYP at the most strict
point of an allowable range of the atomic coordinate information (e.g., ±x direction)
between an atom forming the binding site and a pinching point bound to the atom.
[0124] The pinching-point/reactive-site-information acquiring unit 102c may acquire a plurality
pieces of pinching point information. In this case, the processes for identifying
pinching point information and identifying reactive site information explained later
are performed on the pinching point information and the corresponding reactive site
information, respectively.
[0125] Details of the pinching-point/reactive-site-information acquiring process are explained
below with reference to Fig. 20. Fig. 20 is a flowchart of one example of the pinching-point/reactive-site-information
acquiring process in the system according to the embodiment.
[0126] At first, in the pinching-point/reactive-site-information acquiring unit 102c, the
atom specifying unit 102p specifies a directly bonded atom and an atom bonded to the
directly bonded atom, based on the bond information included in the compound structure
information (step SC-1).
[0127] Then, the ring-structure determining unit 102q in the pinching-point/reactive-site-information
acquiring unit 102c determines whether the atom specified at step SC-1 is an atom
forming the ring structure based on the ring structure information included in the
compound structure information (step SC-2).
[0128] Then, if it is determined that the atom specified at step SC-2 is an atom forming
the ring structure, the pinching-point/reactive-site specifying unit 102r in the pinching-point/reactive-site-information
acquiring unit 102c specifies the atom specified and another atom forming the ring
structure together with the atom, as atoms forming a reactive site, and specifies
the directly bonded atom specified, as a pinching point (step SC-3).
[0129] If it is determined at step SC-3 that the atom specified is not an atom forming the
ring structure, the ring-structure determining unit 102q in the pinching-point/reactive-site-information
acquiring unit 102c determines whether the directly bonded atom is an atom forming
the ring structure. If it is determined that the directly bonded atom is an atom forming
the ring structure, the pinching-point/reactive-site specifying unit 102r specifies
the atom, which forms the binding site corresponding to the binding site information
identified and is bonded to the directly bonded atom, as a pinching point.
[0130] Then, the specified-pinching-point/reactive-site-information acquiring unit 102s
in the pinching-point/reactive-site-information acquiring unit 102c acquires pinching
point information for the pinching point specified at step SC-3 and reactive site
information for the reactive site specified from the binding site information and
the compound structure information, and stores these information in a predetermined
storage area of the pinching-point information file 106d and in a predetermined storage
area of the reactive-site information file 106e, respectively (step SC-4).
[0131] Now, the pinching-point/reactive-site-information acquiring process is completed.
[0132] Referring back to Fig. 18, the reactive-site-information identifying unit 102d in
the drug-metabolizing enzyme prediction apparatus 100 identifies the reactive site
information acquired at step SA-3 from the molecular species information and the compound
structure information, based on the reactive-site identifying condition stored in
the reactive-site identifying condition file 106f (step SA-4) .
[0133] The reactive-site identifying condition used at step SA-4 may be a condition such
that the reactive-site coordinate range, the reactive-site angle range, and the reactive-site
distance range are defined for each of the predetermine molecular species. Alternatively,
the reactive-site identifying condition may be processes of determining whether the
atomic coordinate information for an atom forming the reactive site corresponding
to the reactive site information acquired at step SA-3 satisfies the reactive-site
coordinate range in the molecular species corresponding to the molecular species information
identified at step SA-2; calculating an angle value between the atom forming the reactive
site corresponding to the reactive site information and the pinching point corresponding
to the pinching point information; determining whether the angle value calculated
satisfies the reactive-site angle range in the molecular species corresponding to
the molecular species information identified at step SA-2; calculating a distance
value between the atom forming the reactive site corresponding to the reactive site
information and the pinching point corresponding to the pinching point information;
determining whether the distance value calculated satisfies the reactive-site distance
range in the molecular species corresponding to the molecular species information
identified at step SA-2; and identifying the reactive site information determined
that it satisfies the reactive-site coordinate range, the reactive-site angle range,
and the reactive-site distance range, as reactive site information that satisfies
the reactive-site identifying condition (reactive-site-information identifying process).
[0134] The reactive-site coordinate range may be such that a range of a value, which each
atom forming the reactive site can take, is conceptually defined for each of width
(±x direction), depth (±y direction), and thickness (±z direction). More specifically,
for example, the width (±x direction) may conceptually be a range (extension in +x
direction from, for example, the binding site) similar to the range in the binding-site
identifying condition. The depth (±y direction) may conceptually be a range defined
for each molecular species of CYP, and may be a narrow range if, for example, CYP2C9,
and may be a wide range if CYP2C19. Furthermore, the thickness (±z direction) may
conceptually be a range such that the thickness of cyclohexane (chair form) is the
maximum.
[0135] As shown in Fig. 28, the reactive-site angle range may conceptually be a range of
an angle value from a coordinate axis between atoms when the atom as the pinching
point and the atom forming the reactive site are projected to a previously defined
two-dimensional coordinate system (two-dimensional coordinate system in which the
pinching point is set as the origin in Fig. 28).
[0136] The reactive-site distance range may conceptually be a range of a linear distance
between the atom as the pinching point and the atom forming the reactive site. More
specifically, as shown in Fig. 29, the reactive-site distance range may be a range
of a distance (e.g., distance on the z coordinate) between a pinching point (in Fig.
29, atom with number of "1") and an atom apart from the pinching point by five carbons
(in Fig. 29, atom with number of "5").
[0137] Details of the reactive-site-information identifying process are explained below
with reference to Fig. 21. Fig. 21 is a flowchart of one example of the reactive-site-information
identifying process in the system according to the embodiment.
[0138] At first, in the reactive-site-information identifying unit 102d, the coordinate-based
reactive-site-information determining unit 102t determines whether the atomic coordinate
information for an atom forming the reactive site corresponding to the reactive site
information acquired at step SA-3 satisfies the reactive-site coordinate range defined
by the reactive-site identifying condition stored in the reactive-site identifying
condition file 106f, in the molecular species corresponding to the molecular species
information identified at step SA-2 (step SD-1) (reactive-site coordinate range determining
process).
[0139] Details of the reactive-site coordinate range determining process performed at step
SD-1 are explained below with reference to Fig. 22. Fig. 22 is a flowchart of one
example of the reactive-site coordinate range determining process in the system according
to the embodiment.
[0140] At first, in the reactive-site-information identifying unit 102d, the coordinate-based
reactive-site-information determining unit 102t determines whether the x coordinate
value in the atomic coordinate information for the atom forming the reactive site
corresponding to the reactive site information acquired at step SA-3 satisfies the
reactive-site coordinate range (±x direction) defined by the reactive-site identifying
condition stored in the reactive-site identifying condition file 106f, in the molecular
species corresponding to the molecular species information identified at step SA-2
(step SE-1).
[0141] Then, if "it is satisfied" is determined at step SE-1 (step SE-2: Yes), the coordinate-based
reactive-site-information determining unit 102t in the reactive-site-information identifying
unit 102d determines whether the y coordinate value in the atomic coordinate information
satisfies the reactive-site coordinate range (±y direction) defined by the reactive-site
identifying condition stored in the reactive-site identifying condition file 106f,
in the molecular species corresponding to the molecular species information identified
at step SA-2 (step SE-3).
[0142] Then, if "it is satisfied" is determined at step SE-3 (step SE-4: Yes), the coordinate-based
reactive-site-information determining unit 102t in the reactive-site-information identifying
unit 102d determines whether the z coordinate value in the atomic coordinate information
satisfies the reactive-site coordinate range (±z direction) defined by the reactive-site
identifying condition stored in the reactive-site identifying condition file 106f,
in the molecular species corresponding to the molecular species information identified
at step SA-2 (step SE-5).
[0143] If "it is not satisfied" is determined at step SE-1 and step SE-3 (step SE-2: No
and step SE-4: No), the process is terminated.
[0144] Now, the reactive-site coordinate range determining process is completed.
[0145] Referring back again to Fig. 21, in the reactive-site-information identifying unit
102d, if "it is satisfied" is determined at step SD-1 (if "it is satisfied" is determined
at step SE-5) (step SD-2: Yes), the angle-based reactive-site-information determining
unit 102u calculates an angle value between the atom forming the reactive site corresponding
to the reactive site information and the pinching point corresponding to the pinching
point information, and determines whether the angle value calculated satisfies the
reactive-site angle range defined by the reactive-site identifying condition stored
in the reactive-site identifying condition file 106f, in the molecular species corresponding
to the molecular species information identified at step SA-2 (step SD-3).
[0146] Then, in the reactive-site-information identifying unit 102d, if "it is satisfied"
is determined at step SD-3 (step SD-4: Yes), the distance-based reactive-site-information
determining unit 102v calculates a distance value between the atom forming the reactive
site corresponding to the reactive site information and the pinching point corresponding
to the pinching point information, and determines whether the distance value calculated
satisfies the reactive-site distance range defined by the reactive-site identifying
condition stored in the reactive-site identifying condition file 106f, in the molecular
species corresponding to the molecular species information identified at step SA-2
(step SD-5).
[0147] In the reactive-site-information identifying unit 102d, the determination-result-based
reactive-site-information identifying unit 102w returns again to the process at step
SD-1 if there is any atom that is not determined yet (step SD-6: No), and identifies
the reactive site information acquired at step SA-3 if there are no atoms that are
not determined (step SB-6: Yes), in other words, if all the atoms are determined.
More specifically, when it is determined that the atomic coordinate information for
all the atoms forming the reactive site corresponding to the reactive site information
acquired at step SA-3 "is satisfied" at step SD-1, and when it is determined that
the atomic coordinate information for the atoms forming the reactive site corresponding
to the reactive site information acquired at step SA-3 "is satisfied" at step SD-3
and step SD-5, the reactive site information acquired at SA-3 is identified.
[0148] If "it is not satisfied" is determined at step SD-1 (step SD-2: No), the process
is terminated even if there still remains an atom which is not determined. If "it
is not satisfied" is determined at step SD-3 (step SD-4: No), the process proceeds
to step SD-6 as shown in Fig. 21.
[0149] Now, the reactive-site-information identifying process is completed.
[0150] Referring back again to Fig. 18, in the drug-metabolizing enzyme prediction apparatus
100, the pinching-point-information identifying unit 102e identifies the pinching
point information acquired at step SA-3 from the molecular species information and
the compound structure information, based on the pinching-point identifying condition
stored in the pinching-point identifying condition file 106g (step SA-5).
[0151] The pinching-point identifying condition used at step SA-5 may be a condition such
that the pinching-point coordinate range, which is a range of the atomic coordinate
information for an atom as the pinching point, is defined for each of the predetermine
molecular species. Alternatively, the pinching-point-information identifying unit
102e may identify the pinching point information when the atomic coordinate information
for the atom as the pinching point corresponding to the pinching point information
acquired at step SA-3 satisfies the pinching-point coordinate range, in the molecular
species corresponding to the molecular species information identified at step SA-2
(pinching-point-information identifying process).
[0152] Details of the pinching-point-information identifying process are explained below
with reference to Fig. 23. Fig. 23 is a flowchart of one example of the pinching-point-information
identifying process in the system according to the embodiment.
[0153] At first, in the pinching-point-information identifying unit 102e, the coordinate-based
pinching-point-information identifying unit 102x determines whether the x coordinate
value in the atomic coordinate information for the atom corresponding to the pinching
point information acquired at step SA-3 satisfies the pinching-point coordinate range
(±x direction) defined by the pinching-point identifying condition stored in the pinching-point
identifying condition file 106g, in the molecular species corresponding to the molecular
species information identified at step SA-2 (step SF-1).
[0154] Then, in the pinching-point-information identifying unit 102e, if "it is satisfied"
is determined at step SF-1 (step SF-2: Yes), the coordinate-based pinching-point-information
identifying unit 102x determines whether the y coordinate value in the atomic coordinate
information for the atom corresponding to the pinching point information satisfies
the pinching-point coordinate range (±y direction) defined by the pinching-point identifying
condition stored in the pinching-point identifying condition file 106g, in the molecular
species corresponding to the molecular species information identified at step SA-2
(step SF-3).
[0155] Then, in the pinching-point-information identifying unit 102e, if "it is satisfied"
is determined at step SF-3 (step SF-4: Yes), the coordinate-based pinching-point-information
identifying unit 102x determines whether the z coordinate value in the atomic coordinate
information for the atom corresponding to the pinching point information satisfies
the pinching-point coordinate range (±z direction) defined by the pinching-point identifying
condition stored in the pinching-point identifying condition file 106g, in the molecular
species corresponding to the molecular species information identified at step SA-2
(step SF-5) .
[0156] Then, in the pinching-point-information identifying unit 102e, if "it is satisfied"
is determined at step SF-5, the coordinate-based pinching-point-information identifying
unit 102x identifies the pinching point information acquired at step SA-3.
[0157] The pinching point identified may serve an atom which forms the binding site as a
result.
[0158] Now, the pinching-point-information identifying process is completed.
[0159] In the drug-metabolizing enzyme prediction apparatus 100, if there exists an open
space site being a site which is other than the reactive site corresponding to the
reactive site information identified at step SA-4 and the binding site corresponding
to the binding site information identified at step SA-2 and is bound to the pinching
point corresponding to the pinching point information identified at step SA-5, the
open-space-site-information acquiring unit 102f may acquire open-space site information
being information on an atom forming the open space site from the binding site information,
the reactive site information, the pinching point information, and the compound structure
information, and store the open-space site information in a predetermined storage
area of the open-space-site information file 106h; and the open-space-site-information
identifying unit 102g may also identify the open-space site information acquired,
from the molecular species information and the compound structure information, based
on the open-space-site identifying condition stored in the open-space-site identifying
condition file 106i for identifying the open space site. The open-space-site identifying
condition may be a condition such that the open-space-site coordinate range, which
is a range of the atomic coordinate information for an atom forming an open space
site is defined for each of the predetermine molecular species. Alternatively, in
the open-space-site-information identifying unit 102g, the coordinate-based open-space-site-information
identifying unit 102y may identify open-space site information when the atomic coordinate
information for the atom forming the open space site corresponding to the open-space
site information acquired satisfies the open-space-site coordinate range defined by
the open-space-site identifying condition stored in the open-space-site identifying
condition file 106i, in the molecular species corresponding to the molecular species
information identified (open-space-site-information identifying process) .
[0160] Furthermore, as shown in Fig. 30, the open-space-site coordinate range may conceptually
be a range such that the range of a value which each atom forming the open space site
can take is defined by each of the width (±x direction), the depth (±y direction),
and the thickness (±z direction).
[0161] Details of the open-space-site-information identifying process are explained below
with reference to Fig. 24. Fig. 24 is a flowchart of one example of the open-space-site-information
identifying process in the system according to the embodiment.
[0162] At first, in the open-space-site-information identifying unit 102g, the coordinate-based
open-space-site-information identifying unit 102y determines whether the x coordinate
value in the atomic coordinate information for the atom forming the open space site
corresponding to the open-space site information acquired by the open-space-site-information
acquiring unit 102f satisfies the open-space-site coordinate range (±x direction)
defined by the open-space-site identifying condition stored in the open-space-site
identifying condition file 106i, in the molecular species corresponding to the molecular
species information identified at step SA-2 (step SG-1).
[0163] Then, in the open-space-site-information identifying unit 102g, if "it is satisfied"
is determined at step SG-1 (step SG-2: Yes), the coordinate-based open-space-site-information
identifying unit 102y determines whether the y coordinate value in the atomic coordinate
information satisfies the open-space-site coordinate range (±y direction) defined
by the open-space-site identifying condition stored in the open-space-site identifying
condition file 106i, in the molecular species corresponding to the molecular species
information identified at step SA-2 (step SG-3).
[0164] Then, in the open-space-site-information identifying unit 102g, if "it is satisfied"
is determined at step SG-3 (step SG-4: Yes), the coordinate-based open-space-site-information
identifying unit 102y determines whether the z coordinate value in the atomic coordinate
information satisfies the open-space-site coordinate range (±z direction) defined
by the open-space-site identifying condition stored in the open-space-site identifying
condition file 106i, in the molecular species corresponding to the molecular species
information identified at step SA-2 (step SG-5).
[0165] Then, in the open-space-site-information identifying unit 102g, if "it is satisfied"
is determined at step SG-5 (step SG-6: Yes), the coordinate-based open-space-site-information
identifying unit 102y returns again to the process at step SG-1 if there is any atom
that is not determined yet (step SG-7: No), and identifies the open-space site information
acquired in the open-space-site-information acquiring unit 102f if there are no atoms
that are not determined (step SB-7: Yes), in other words, if all the atoms are determined.
When "it is not satisfied" is determined at step SG-1, step SG-3, and step SG-5 (step
SG-2: No, step SG-4: No, and step SG-6: No), the process is terminated even if there
still remains an atom which is not determined.
[0166] Now, the open-space-site-information identifying process is completed.
[0167] In the drug-metabolizing enzyme prediction apparatus 100, if the reactive site information
is identified at step SA-4, the inhibition determining unit 102h may determine whether
an atom forming the reactive site corresponding to the reactive site information or
a partial-structure atom group which is a group of part of atoms in the molecular
structure at the reactive site inhibits a metabolic reaction in a compound, based
on the inhibition determining condition (e.g., a list of predetermined atoms as inhibitors)
for determining whether the metabolic reaction is inhibited in the compound of which
the reactive site information is identified. More specifically, in the drug-metabolizing
enzyme prediction apparatus 100, if the molecular species of the drug-metabolizing
enzyme bound to the compound and the reactive site where a metabolic reaction occurs
in the compound bound to the drug-metabolizing enzyme, are identified by the process,
the inhibition determining unit 102h may determine whether any atom (e.g., amide (N))
inhibiting the metabolic reaction is contained in atoms forming the reactive site
of the compound.
[0168] In the drug-metabolizing enzyme prediction apparatus 100, the prediction-result output
unit 102i may store at least one of the binding site information, the molecular species
information, the reactive site information, the pinching point information, and the
open-space site information, for the compound, and the inhibition determination result,
in a predetermined storage area of the prediction result file 106j, as a prediction
result for the compound, and output the prediction result through the output device
114.
[0169] Now, the main process is completed.
[0170] As explained above, according to the embodiment, in the drug-metabolizing enzyme
prediction apparatus 100, the compound-structure-information acquiring unit 102a acquires
the compound structure information including at least one of the atomic coordinate
information being information on coordinates of each of atoms forming a compound,
the bond information being information on a bond between the atoms, and the ring structure
information being information on a ring structure formed with a plurality of the atoms.
The binding-site/molecular-species-information identifying unit 102b identifies the
binding site information being information on the atom forming the binding site in
the compound corresponding to the compound structure information and the molecular
species information being information on the molecular species, from the compound
structure information acquired, based on the binding-site identifying condition for
identifying the binding site being a site where the compound and the drug-metabolizing
enzyme are bound to each other and identifying the molecular species of the drug-metabolizing
enzyme to be bound. The pinching-point/reactive-site-information acquiring unit 102c
specifies the pinching point which is an atom binding at least the reactive site where
a metabolic reaction occurs in the compound and the drug-metabolizing enzyme to the
binding site, and acquires the pinching point information being the information on
the atom as the pinching point specified and the reactive site information being information
on the atom forming the reactive site bound to the pinching point corresponding to
the pinching point information, from the binding site information and the compound
structure information identified. The reactive-site-information identifying unit 102d
identifies the reactive site information acquired, from the molecular species information
and the compound structure information, based on the reactive-site identifying condition
for identifying the reactive site. The pinching-point-information identifying unit
102e identifies the pinching point information acquired, from the molecular species
information and the compound structure information, based on the pinching-point identifying
condition for identifying the pinching point. Therefore, it is possible to accurately
predict a molecular species of a drug-metabolizing enzyme and a reactive site of a
compound with the drug-metabolizing enzyme, for all types of compounds without using
training data for prediction and in consideration of each cubic form of the compounds.
[0171] That is, according to the embodiment, the drug-metabolizing enzyme prediction apparatus
100 allows accurate prediction of a molecular species of a drug-metabolizing enzyme
and a reactive site of a compound with the drug-metabolizing enzyme, for all types
of compounds without using training data for prediction and in consideration of each
cubic form of the compounds. Therefore, a compound group can accurately be narrowed
down, for example, in the early stage of drug development.
[0172] According to the embodiment, in the drug-metabolizing enzyme prediction apparatus
100, if the reactive site information is identified at step SA-4, the inhibition determining
unit 102h may determine whether an atom forming a reactive site corresponding to the
reactive site information or a partial-structure atom group which is a group of part
of atoms in the molecular structure at the reactive site inhibits a metabolic reaction,
based on the inhibition determining condition (e.g., a list of predetermined atoms
as inhibitors) for determining whether the metabolic reaction is inhibited in the
compound of which reactive site information is identified. Therefore, it is possible
to determine (check) whether any atom inhibiting the metabolic reaction is contained
in the atoms forming the reactive site predicted, and this allows the compound group
to be more accurately narrowed.
[Example]
[0173] An example to which the embodiment is applied is explained below with reference to
Fig. 31. Fig. 31 is a diagram for explaining one example of the main process in the
drug-metabolizing enzyme prediction apparatus 100 according to an example of the present
invention.
[0174] At first, in the drug-metabolizing enzyme prediction apparatus 100, the compound-structure-information
acquiring unit 102a according to the embodiment acquires a compound library (which
corresponds to the compound structure information according to the embodiment) stored
in the external system 200 (step SH-1: compound-library acquiring process).
[0175] Then, in the drug-metabolizing enzyme prediction apparatus 100, if the compound library
cannot be acquired at step SH-1 (step SH-2: No), the compound-structure-information
acquiring unit 102a according to the embodiment acquires a virtual compound library
input by the user (step SH-3: virtual-compound-library acquiring process).
[0176] Then, in the drug-metabolizing enzyme prediction apparatus 100, the binding-site/molecular-species-information
identifying unit 102b according to the embodiment identifies the binding site information
from the compound library acquired at step SH-1 or the virtual compound library acquired
at step SH-3 (step SH-4: binding-site identifying process).
[0177] Then, in the drug-metabolizing enzyme prediction apparatus 100, the pinching-point-information
identifying unit 102e according to the embodiment identifies the pinching point information
from the compound library acquired at step SH-1 or the virtual compound library acquired
at step SH-3 (step SH-5: pinching-point identifying process).
[0178] Then, in the drug-metabolizing enzyme prediction apparatus 100, the open-space-site-information
identifying unit 102g according to the embodiment identifies the open-space site information
from the compound library acquired at step SH-1 or the virtual compound library acquired
at step SH-3 (step SH-6: open-space-site identifying process).
[0179] Then, in the drug-metabolizing enzyme prediction apparatus 100, the reactive-site-information
identifying unit 102d according to the embodiment identifies the reactive site information
from the compound library acquired at step SH-1 or the virtual compound library acquired
at step SH-3 (step SH-7: reactive site identifying process).
[0180] Then, in the drug-metabolizing enzyme prediction apparatus 100, the control unit
102 according to the embodiment in the drug-metabolizing enzyme prediction apparatus
100 confirms the molecular species of a drug-metabolizing enzyme (CYP) based on the
identification results at step SH-4 to step SH-7, and the inhibition determining unit
102h according to the embodiment further determines whether any atom inhibiting the
metabolic reaction is contained in the atoms forming the reactive site of the compound
in which the binding site, the pinching point, the open space site, and the reactive
site are identified at step SH-4 to step SH-7 (step SH-8: CYP confirming process).
[0181] Then, in the drug-metabolizing enzyme prediction apparatus 100, the prediction-result
output unit 102i according to the embodiment stores the binding site, the pinching
point, the open space site, and the reactive site identified at step SH-4 to step
SH-7, and the metabolic prediction result including the molecular species and the
metabolic inhibition result confirmed at step SH-8, in the prediction result file
106j according to the embodiment (metabolic-prediction-result database), and outputs
(displays) them to (on) the output device 114 (e.g., monitor) according to the embodiment
(step SH-9).
[0182] According to the example, by respectively evaluating the binding site, the pinching
point, the reactive site, and the open space site, it is possible to accurately predict
a molecular species and a reactive site of a drug-metabolizing enzyme for all types
of compounds, and also efficiently verify whether a metabolic reaction actually occurs
at the reactive site predicted. This allows a useful compound group to be accurately
narrowed, for example, in the early stage of drug development.
INDUSTRIAL APPLICABILITY
[0183] The drug-metabolizing enzyme prediction apparatus, the drug-metabolizing enzyme prediction
method, the drug-metabolizing enzyme prediction program, and the recording medium
according to the present invention are useful for prediction of a molecular species
and a reactive site of a drug-metabolizing enzyme, and particularly suitable for narrowing
of a compound group in the field of drug discovery (e.g., a metabolic section and
a synthesis section of drug discovery) in which drug design and the like are conducted.
1. A drug-metabolizing enzyme prediction apparatus comprising:
a compound-structure-information acquiring unit that acquires compound structure information
including at least one of atomic coordinate information that is information on coordinates
of each of atoms forming a compound, bond information that is information on a bond
between the atoms, and ring structure information that is information on a ring structure
formed with a plurality of the atoms;
a binding-site/molecular-species-information identifying unit that identifies binding
site information that is information on an atom forming a binding site in the compound
corresponding to the compound structure information and molecular species information
that is information on a molecular species, from the compound structure information
acquired by the compound-structure-information acquiring unit, based on a binding-site
identifying condition for identifying the binding site that is a site where the compound
and a drug-metabolizing enzyme are bound to each other and the molecular species of
the drug-metabolizing enzyme to be bound;
a pinching-point/reactive-site-information acquiring unit that specifies a pinching
point that is the atom binding at least between the binding site and a reactive site
that is a site where a metabolic reaction occurs in the compound and the drug-metabolizing
enzyme, from the binding site information identified by the binding-site/molecular-species-information
identifying unit and the compound structure information, and that acquires pinching
point information that is information on the atom of the specified pinching point
and the reactive site information that is information on the atom forming the reactive
site bound to the pinching point corresponding to the pinching point information;
a reactive-site-information identifying unit that identifies the reactive site information
acquired by the pinching-point/reactive-site-information acquiring unit, from the
molecular species information and the compound structure information, based on a reactive-site
identifying condition for identifying the reactive site; and
a pinching-point-information identifying unit that identifies the pinching point acquired
by the pinching-point/reactive-site-information acquiring unit, from the molecular
species information and the compound structure information, based on a pinching-point
identifying condition for identifying the pinching point.
2. The drug-metabolizing enzyme prediction apparatus according to claim 1, wherein
the binding-site/molecular-species-information identifying unit includes
a binding-site selecting unit that selects the binding site based on a predetermined
binding-site selection criterion;
a binding-site-information acquiring unit that acquires the binding site information
for the binding site selected by the binding-site selecting unit, from the compound
structure information; and
an acquired-binding-site/molecular-species-information identifying unit that identifies
the binding site information acquired by the binding-site-information acquiring unit,
from the compound structure information, based on the binding-site identifying condition,
and identifies the molecular species corresponding to the identified binding site
information as the molecular species information.
3. The drug-metabolizing enzyme prediction apparatus according to claim 2, wherein
the binding-site identifying condition is a condition in which a binding-site coordinate
range that is a range of the atomic coordinate information for the atom forming the
binding site is defined for each predetermined molecular species, and
the acquired-binding-site/molecular-species-information identifying unit includes
a coordinate-based acquired-binding-site/molecular-species-information identifying
unit that identifies the binding site information when the atomic coordinate information
for the atom that forms the binding site corresponding to the binding site information
acquired by the binding-site-information acquiring unit satisfies the binding-site
coordinate range defined by the binding-site identifying condition, and identifies
the molecular species corresponding to the satisfied binding-site coordinate range
as the molecular species information.
4. The drug-metabolizing enzyme prediction apparatus according to any one of claims 1
to 3, wherein
the pinching-point/reactive-site-information acquiring unit includes
an atom specifying unit that specifies a directly bonded atom that is the atom bonded
to the atom forming the binding site corresponding to the binding site information
identified by the binding-site/molecular-species-information identifying unit and
also specifies the atom bonded to the directly bonded atom, based on the bond information
included in the compound structure information;
a ring-structure determining unit that determines whether the atom specified by the
atom specifying unit is the atom forming the ring structure, based on the ring structure
information included in the compound structure information;
a pinching-point/reactive-site specifying unit that specifies the atom specified and
another atom forming the ring structure together with the atom, as the atoms forming
the reactive site, when it is determined by the ring-structure determining unit that
the atom specified by the atom specifying unit is the atom forming the ring structure,
and that specifies the directly bonded atom specified by the atom specifying unit,
as the pinching point; and
a specified-pinching-point/reactive-site-information acquiring unit that acquires
the pinching point information for the pinching point specified by the pinching-point/reactive-site
specifying unit and the reactive site information for the reactive site specified,
from the binding site information and the compound structure information.
5. The drug-metabolizing enzyme prediction apparatus according to any one of claims 1
to 4, wherein
the reactive-site identifying condition is such that a reactive-site coordinate range
that is a range of the atomic coordinate information for the atoms forming the reactive
site, a reactive-site angle range that is a range of an angle value between each of
the atoms forming the reactive site and the pinching point, and a reactive-site distance
range that is a range of a distance value between each of the atoms forming the reactive
site and the pinching point are defined for each of the molecular species previously
specified, and
the reactive-site-information identifying unit includes
a coordinate-based reactive-site-information determining unit that determines whether
the atomic coordinate information for the atoms, which form the reactive site corresponding
to the reactive site information acquired by the pinching-point/reactive-site-information
acquiring unit, satisfies the reactive-site coordinate range defined by the reactive-site
identifying condition, in the molecular species corresponding to the molecular species
information identified;
an angle-based reactive-site-information determining unit that calculates the angle
value between the atom forming the reactive site corresponding to the reactive site
information and the pinching point corresponding to the pinching point information,
and determines whether the angle value calculated satisfies the reactive-site angle
range defined by the reactive-site identifying condition, in the molecular species
corresponding to the molecular species information identified;
a distance-based reactive-site-information determining unit that calculates the distance
value between the atom forming the reactive site corresponding to the reactive site
information and the pinching point corresponding to the pinching point information,
and determines whether the distance value calculated satisfies the reactive-site distance
range defined by the reactive-site identifying condition, in the molecular species
corresponding to the molecular species information identified; and
a determination-result-based reactive-site-information identifying unit that identifies
the reactive site information determined, as being satisfied, by the coordinate-based
reactive-site-information determining unit, the angle-based reactive-site-information
determining unit, and the distance-based reactive-site-information determining unit,
as the reactive site information that satisfies the reactive-site identifying condition.
6. The drug-metabolizing enzyme prediction apparatus according to any one of claims 1
to 5, wherein
the pinching-point identifying condition is such that a pinching-point coordinate
range that is a range of the atomic coordinate information for the atom as the pinching
point is defined for each of the molecular species _ previously specified, and
the pinching-point-information identifying unit includes a coordinate-based pinching-point-information
identifying unit that identifies the pinching point information when the atomic coordinate
information for the atom, which is the pinching point corresponding to the pinching
point information acquired by the pinching-point/reactive-site-information acquiring
unit, satisfies the pinching-point coordinate range defined by the pinching-point
identifying condition, in the molecular species corresponding to the molecular species
information identified.
7. The drug-metabolizing enzyme prediction apparatus according to any one of claims 1
to 6, further comprising:
an open-space-site-information acquiring unit that acquires open-space site information
that is information on the atom forming an open space site, from the binding site
information, the reactive site information, the pinching point information, and the
compound structure information, if there exists the open space site that is a site
other than the reactive site corresponding to the reactive site information identified
by the reactive-site-information identifying unit and the binding site corresponding
to the binding site information identified by the binding-site/molecular-species-information
identifying unit and which is bound to the pinching point corresponding to the pinching
point information identified by the pinching-point-information identifying unit; and
an open-space-site-information identifying unit that identifies the open-space site
information acquired by the open-space-site-information acquiring unit, from the molecular
species information and the compound structure information, based on an open-space-site
identifying condition for identifying the open space site.
8. The drug-metabolizing enzyme prediction apparatus according to claim 7, wherein
the open-space-site identifying condition is such that an open-space-site coordinate
range, which is a range of the atomic coordinate information for the atom forming
the open space site, is defined for each of the molecular species previously specified,
and
the open-space-site-information identifying unit includes a coordinate-based open-space-site-information
identifying unit that identifies the open-space site information when the atomic coordinate
information for the atom, which forms the open space site corresponding to the open-space
site information acquired by the open-space-site-information acquiring unit, satisfies
the open-space-site coordinate range defined by the open-space-site identifying condition,
in the molecular species corresponding to the molecular species information identified.
9. The drug-metabolizing enzyme prediction apparatus according to any one of claims 1
to 8, further comprising:
an inhibition determining unit that determines whether the atom forming the reactive
site corresponding to the reactive site information or a partial-structure atom group
which is a group of part of the atoms in a molecular structure at the reactive site
inhibits the metabolic reaction, when the reactive site information is identified
by the reactive-site-information identifying unit, based on an inhibition determining
condition for determining whether the metabolic reaction is inhibited in the compound
of which the reactive site information is identified.
10. A drug-metabolizing enzyme prediction method comprising:
a compound-structure-information acquiring step of acquiring compound structure information
including at least one of atomic coordinate information that is information on coordinates
of each of atoms forming a compound, bond information that is information on a bond
between the atoms, and ring structure information that is information on a ring structure
formed with a plurality of the atoms;
a binding-site/molecular-species-information identifying step of identifying binding
site information that is information on an atom forming a binding site in the compound
corresponding to the compound structure information and molecular species information
that is information on a molecular species, from the compound structure information
acquired at the compound-structure-information acquiring step, based on a binding-site
identifying condition for identifying the binding site that is a site where the compound
and a drug-metabolizing enzyme are bound to each other and the molecular species of
the drug-metabolizing enzyme to be bound;
a pinching-point/reactive-site-information acquiring step of specifying a pinching
point that is the atom binding at least between the binding site and a reactive site
that is a site where a metabolic reaction occurs in the compound and the drug-metabolizing
enzyme, from the binding site information identified at the binding-site/molecular-species-information
identifying step and the compound structure information, and acquiring pinching point
information that is information on the atom of the specified pinching point and the
reactive site information that is information on the atom forming the reactive site
bound to the pinching point corresponding to the pinching point information;
a reactive-site-information identifying step of identifying the reactive site information
acquired at the pinching-point/reactive-site-information acquiring step, from the
molecular species information and the compound structure information, based on a reactive-site
identifying condition for identifying the reactive site; and
a pinching-point-information identifying step of identifying the pinching point acquired
at the pinching-point/reactive-site-information acquiring step, from the molecular
species information and the compound structure information, based on a pinching-point
identifying condition for identifying the pinching point.
11. The drug-metabolizing enzyme prediction method according to claim 10, wherein
the binding-site/molecular-species-information identifying step includes
a binding-site selecting step of selecting the binding site based on a predetermined
binding-site selection criterion;
a binding-site-information acquiring step of acquiring the binding site information
for the binding site selected at the binding-site selecting step, from the compound
structure information; and
an acquired-binding-site/molecular-species-information identifying step of identifying
the binding site information acquired at the binding-site-information acquiring step,
from the compound structure information, based on the binding-site identifying condition,
and identifies the molecular species corresponding to the identified binding site
information as the molecular species information.
12. The drug-metabolizing enzyme prediction method according to claim 11, wherein
the binding-site identifying condition is a condition in which a binding-site coordinate
range that is a range of the atomic coordinate information for the atom forming the
binding site is defined for each predetermined molecular species, and
the acquired-binding-site/molecular-species-information identifying step includes
a coordinate-based acquired-binding-site/molecular-species-information identifying
step of identifying the binding site information when the atomic coordinate information
for the atom that forms the binding site corresponding to the binding site information
acquired at the binding-site-information acquiring step satisfies the binding-site
coordinate range defined by the binding-site identifying condition, and identifying
the molecular species corresponding to the satisfied binding-site coordinate range
as the molecular species information.
13. The drug-metabolizing enzyme prediction method according to any one of claims 10 to
12, wherein
the pinching-point/reactive-site-information acquiring step includes
an atom specifying step of specifying a directly bonded atom that is the atom bonded
to the atom forming the binding site corresponding to the binding site information
identified at the binding-site/molecular-species-information identifying step and
also specifying the atom bonded to the directly bonded atom, based on the bond information
included in the compound structure information;
a ring-structure determining step of determining whether the atom specified at the
atom specifying step is the atom forming the ring structure, based on the ring structure
information included in the compound structure information;
a pinching-point/reactive-site specifying step of specifying the atom specified and
another atom forming the ring structure together with the atom, as the atoms forming
the reactive site, when it is determined at the ring-structure determining step that
the atom specified at the atom specifying step is the atom forming the ring structure,
and specifying the directly bonded atom specified at the atom specifying step, as
the pinching point; and
a specified pinching-point/reactive-site-information acquiring step of acquiring the
pinching point information for the pinching point specified at the pinching-point/reactive-site
specifying step and the reactive site information for the reactive site specified,
from the binding site information and the compound structure information.
14. The drug-metabolizing enzyme prediction method according to any one of claims 10 to
13, wherein
the reactive-site identifying condition is such that a reactive-site coordinate range
that is a range of the atomic coordinate information for the atoms forming the reactive
site, a reactive-site angle range that is a range of an angle value between each of
the atoms forming the reactive site and the pinching point, and a reactive-site distance
range that is a range of a distance value between each of the atoms forming the reactive
site and the pinching point are defined for each of the molecular species previously
specified, and
the reactive-site-information identifying step includes
a coordinate-based reactive-site-information determining step of determining whether
the atomic coordinate information for the atoms, which form the reactive site corresponding
to the reactive site information acquired at the pinching-point/reactive-site-information
acquiring step, satisfies the reactive-site coordinate range defined by the reactive-site
identifying condition, in the molecular species corresponding to the molecular species
information identified;
an angle-based reactive-site-information determining step of calculating the angle
value between the atom forming the reactive site corresponding to the reactive site
information and the pinching point corresponding to the pinching point information,
and determining whether the angle value calculated satisfies the reactive-site angle
range defined by the reactive-site identifying condition, in the molecular species
corresponding to the molecular species information
identified;
a distance-based reactive-site-information determining step of calculating the distance
value between the atom forming the reactive site corresponding to the reactive site
information and the pinching point corresponding to the pinching point information,
and determining whether the distance value calculated satisfies the reactive-site
distance range defined by the reactive-site identifying condition, in the molecular
species corresponding to the molecular species information identified; and
a determination-result-based reactive-site-information identifying step of identifying
the reactive site information determined, as being satisfied, at the coordinate-based
reactive-site-information determining step, the angle-based reactive-site-information
determining step, and the distance-based reactive-site-information determining step,
as the reactive site information that satisfies the reactive-site identifying condition.
15. The drug-metabolizing enzyme prediction method according to any one of claims 10 to
14, wherein
the pinching-point identifying condition is such that a pinching-point coordinate
range that is a range of the atomic coordinate information for the atom as the pinching
point is defined for each of the molecular species previously specified, and
the pinching-point-information identifying step includes a coordinate-based pinching-point-information
identifying step of identifying the pinching point information when the atomic coordinate
information for the atom, which is the pinching point corresponding to the pinching
point information acquired at the pinching-point/reactive-site-information acquiring
step, satisfies the pinching-point coordinate range defined by the pinching-point
identifying condition, in the molecular species corresponding to the molecular species
information identified.
16. The drug-metabolizing enzyme prediction method according to any one of claims 10 to
15, further comprising:
an' open-space-site-information acquiring step of acquiring open-space site information
that is information on the atom forming an open space site, from the binding site
information, the reactive site information, the pinching point information, and the
compound structure information, if there exists the open space site that is a site
other than the reactive site corresponding to the reactive site information identified
at the reactive-site-information identifying step and the binding site corresponding
to the binding site information identified at the binding-site/molecular-species-information
identifying step and which is bound to the pinching point corresponding to the pinching
point information identified at the pinching-point-information identifying step; and
an open-space-site-information identifying step of identifying the open-space site
information acquired at the open-space-site-information acquiring step, from the molecular
species information and the compound structure information, based on an open-space-site
identifying condition for identifying the open space site.
17. The drug-metabolizing enzyme prediction method according to claim 16, wherein
the open-space-site identifying condition is such that an open-space-site coordinate
range, which is a range of the atomic coordinate information for the atom forming
the open space site, is defined for each of the molecular species previously specified,
and
the open-space-site-information identifying step includes a coordinate-based open-space-site-information
identifying step of identifying the open-space site information when the atomic coordinate
information for the atom, which forms the open space site corresponding to the open-space
site information acquired at the open-space-site-information acquiring step, satisfies
the open-space-site coordinate range defined by the open-space-site identifying condition,
in the molecular species corresponding to the molecular species information identified.
18. The drug-metabolizing enzyme prediction method according to any one of claims 10 to
17, further comprising:
an inhibition determining step of determining whether the atom forming the reactive
site corresponding to the reactive site information or a partial-structure atom group
which is a group of part of the atoms in a molecular structure at the reactive site
inhibits the metabolic reaction, when the reactive site information is identified
at the reactive-site-information identifying step, based on an inhibition determining
condition for determining whether the metabolic reaction is inhibited in the compound
of which the reactive site information is identified.
19. A drug-metabolizing enzyme prediction program causing a computer to execute:
a compound-structure-information acquiring procedure of acquiring compound structure
information including at least one of atomic coordinate information that is information
on coordinates of each of atoms forming a compound, bond information that is information
on a bond between the atoms, and ring structure information that is information on
a ring structure formed with a plurality of the atoms;
a binding-site/molecular-species-information identifying procedure of identifying
binding site information that is information on an atom forming a binding site in
the compound corresponding to the compound structure information and molecular species
information that is information on a molecular species., from the compound structure
information acquired at the compound-structure-information acquiring procedure, based
on a binding-site identifying condition for identifying the binding site that is a
site where the compound and a drug-metabolizing enzyme are bound to each other and
the molecular species of the drug-metabolizing enzyme to be bound;
a pinching-point/reactive-site-information acquiring procedure of specifying a pinching
point that is the atom binding at least between the binding site and a reactive site
that is a site where a metabolic reaction occurs in the compound and the drug-metabolizing
enzyme, from the binding site information identified at the binding-site/molecular-species-information
identifying procedure and the compound structure information, and acquiring pinching
point information that is information on the atom of the specified pinching point
and the reactive site information that is information on the atom forming the reactive
site bound to the pinching point corresponding to the pinching point information;
a reactive-site-information identifying procedure of identifying the reactive site
information acquired at the pinching-point/reactive-site-information acquiring procedure,
from the molecular species information and the compound structure information, based
on a reactive-site identifying condition for identifying the reactive site; and
a pinching-point-information identifying procedure of identifying the pinching point
acquired at the pinching-point/reactive-site-information acquiring procedure, from
the molecular species information and the compound structure information, based on
a pinching-point identifying condition for identifying the pinching point.
20. The drug-metabolizing enzyme prediction program according to claim 19, wherein
the binding-site/molecular-species-information identifying procedure includes
a binding-site selecting procedure of selecting the binding site based on a predetermined
binding-site selection criterion;
a binding-site-information acquiring procedure of acquiring the binding site information
for the binding site selected at the binding-site selecting procedure, from the compound
structure information; and
an acquired-binding-site/molecular-species-information identifying procedure of identifying
the binding site information acquired at the binding-site-information acquiring procedure,
from the compound structure information, based on the binding-site identifying condition,
and identifies the molecular species corresponding to the identified binding site
information as the molecular species information.
21. The drug-metabolizing enzyme prediction program according to claim 20, wherein
the binding-site identifying condition is a condition in which a binding-site coordinate
range that is a range of the atomic coordinate information for the atom forming the
binding site is defined for each predetermined molecular species, and
the acquired-binding-site/molecular-species-information identifying procedure includes
a coordinate-based acquired-binding-site/molecular-species-information identifying
procedure of identifying the binding site information when the atomic coordinate information
for the atom that forms the binding site corresponding to the binding site information
acquired at the binding-site-information acquiring procedure satisfies the binding-site
coordinate range defined by the binding-site identifying condition, and identifying
the molecular species corresponding to the satisfied binding-site coordinate range
as the molecular species information.
22. The drug-metabolizing enzyme prediction program according to any one of claims 19
to 21, wherein
the pinching-point/reactive-site-information acquiring procedure includes
an atom specifying procedure of specifying a directly bonded atom that is the atom
bonded to the atom forming the binding site corresponding to the binding site information
identified at the binding-site/molecular-species-information identifying procedure
and also specifying the atom bonded to the directly bonded atom, based on the bond
information included in the compound structure information;
a ring-structure determining procedure of determining whether the atom specified at
the atom specifying procedure is the atom forming the ring structure, based on the
ring structure information included in the compound structure information;
a pinching-point/reactive-site specifying procedure of specifying the atom specified
and another atom forming the ring structure together with the atom, as the atoms forming
the reactive site, when it is determined at the ring-structure determining procedure
that the atom specified at the atom specifying procedure is the atom forming the ring
structure, and specifying the directly bonded atom specified at the atom specifying
procedure, as the pinching point; and
a specified pinching-point/reactive-site-information acquiring procedure of acquiring
the pinching point information for the pinching point specified at the pinching-point/reactive-site
specifying procedure and the reactive site information for the reactive site specified,
from the binding site information and the compound structure information.
23. The drug-metabolizing enzyme prediction program according to any one of claims 19
to 22, wherein
the reactive-site identifying condition is such that a reactive-site coordinate range
that is a range of the atomic coordinate information for the atoms forming the reactive
site, a reactive-site angle range that is a range of an angle value between each of
the atoms forming the reactive site and the pinching point, and a reactive-site distance
range that is a range of a distance value between each of the atoms forming the reactive
site and the pinching point are defined for each of the molecular species previously
specified, and
the reactive-site-information identifying procedure includes
a coordinate-based reactive-site-information determining procedure of determining
whether the atomic coordinate information for the atoms, which form the reactive site
corresponding to the reactive site information acquired at the pinching-point/reactive-site-information
acquiring procedure, satisfies the reactive-site coordinate range defined by the reactive-site
identifying condition, in the molecular species corresponding to the molecular species
information identified;
an angle-based reactive-site-information determining procedure of calculating the
angle value between the atom forming the reactive site corresponding to the reactive
site information and the pinching point corresponding to the pinching point information,
and determining whether the angle value calculated satisfies the reactive-site angle
range defined by the reactive-site identifying condition, in the molecular species
corresponding to the molecular species information identified;
a distance-based reactive-site-information determining procedure of calculating the
distance value between the atom forming the reactive site corresponding to the reactive
site information and the pinching point corresponding to the pinching point information,
and determining whether the distance value calculated satisfies the reactive-site
distance range defined by the reactive-site identifying condition, in the molecular
species corresponding to the molecular species information identified; and
a determination-result-based reactive-site-information identifying procedure of identifying
the reactive site information determined, as being satisfied, at the coordinate-based
reactive-site-information determining procedure, the angle-based reactive-site-information
determining procedure, and the distance-based reactive-site-information determining
procedure, as the reactive site information that satisfies the reactive-site identifying
condition.
24. The drug-metabolizing enzyme prediction program according to any one of claims 19
to 23, wherein
the pinching-point identifying condition is such that a pinching-point coordinate
range that is a range of the atomic coordinate information for the atom as the pinching
point is defined for each of the molecular species previously specified, and
the pinching-point-information identifying procedure includes a coordinate-based pinching-point-information
identifying procedure of identifying the pinching point information when the atomic
coordinate information for the atom, which is the pinching point corresponding to
the pinching point information acquired at the pinching-point/reactive-site-information
acquiring procedure, satisfies the pinching-point coordinate range defined by the
pinching-point identifying condition, in the molecular species corresponding to the
molecular species information identified.
25. The drug-metabolizing enzyme prediction program according to any one of claims 19
to 24, further causing the computer to execute:
an open-space-site-information acquiring procedure of acquiring open-space site information
that is information on the atom forming an open space site, from the binding site
information, the reactive site information, the pinching point information, and the
compound structure information, if there exists the open space site that is a site
other than the reactive site corresponding to the reactive site information identified
at the reactive-site-information identifying procedure and the binding site corresponding
to the binding site information identified at the binding-site/molecular-species-information
identifying procedure and which is bound to the pinching point corresponding to the
pinching point information identified at the pinching-point-information identifying
procedure; and
an open-space-site-information identifying procedure of identifying the open-space
site information acquired at the open-space-site-information acquiring procedure,
from the molecular species information and the compound structure information, based
on an open-space-site identifying condition for identifying the open space site.
26. The drug-metabolizing enzyme prediction program according to claim 25, wherein
the open-space-site identifying condition is such that an open-space-site coordinate
range, which is a range of the atomic coordinate information for the atom forming
the open space site, is defined for each of the molecular species previously specified,
and
the open-space-site-information identifying procedure includes a coordinate-based
open-space-site-information identifying procedure of identifying the open-space site
information when the atomic coordinate information for the atom, which forms the open
space site corresponding to the open-space site information acquired at the open-space-site-information
acquiring procedure, satisfies the open-space-site coordinate range defined by the
open-space-site identifying condition, in the molecular species corresponding to the
molecular species information identified.
27. The drug-metabolizing enzyme prediction program according to any one of claims 19
to 26, further causing the computer to execute:
an inhibition determining procedure of determining whether the atom forming the reactive
site corresponding to the reactive site information or a partial-structure atom group
which is a group of part of the atoms in a molecular structure at the reactive site
inhibits the metabolic reaction, when the reactive site information is identified
at the reactive-site-information identifying procedure, based on an inhibition determining
condition for determining whether the metabolic reaction is inhibited in the compound
of which the reactive site information is identified.
28. A computer-readable recording medium that stores therein the drug-metabolizing enzyme
prediction program according to any one of claims 19 to 27.