Products CONFLEX Algorithm

Why Use CONFLEX?

Critical to drug discovery and chemical engineering is fast, accurate conformation searching and analysis. No other product at this price point offer the features and benefits of CONFLEX.

  • Exhaustive conformation searches
  • Fast and highly accurate
  • Handles large molecules
  • Parallel computing option
  • Available as stand-alone engine or with CONFLEX interface
  • Affordable licensing plans

CONFLEX is used by government research facilities, industrial chemical companies, and pharmaceutical organizations to compute:

  • Optimum geometry
  • Conformations
  • Potential energy lists
  • Crystal packing

CONFLEX Search Algorithm

algorithm

For convenience, the structures being processed are referred to differently depending on the particular stage of the search process:

  • input structure is the input at the beginning of a search
  • initial structure is called from the conformer storage database at the beginning of each perturbation cycle
  • starting (trial) structure is the perturbed structure before geometry-optimization
  • optimized structure is the structure after geometry-optimization
  • stored structure is the optimized structure that survived the redundancy test, and it is saved in the conformer storage.

The algorithms for searching conformational space or a torsional hypersurface involve repeated sampling from the vast conformational space. The process, which is illustrated below (a curved arrow indicates a looping subprocess), has the following steps:

  1. Generation of an appropriate starting structure
  2. Geometry-optimization of the starting structure
  3. Comparison of the optimized structure with the stored conformers

Finally, a structure passing all comparisons is added to the list of stored conformers.

In the advanced conformational space search algorithms, the first step consists of two subprocesses:

  1. Selection of an initial structure from among the stored conformers
  2. Assignment of appropriate structural perturbation to the initial structure to produce the coordinates of a starting structure.

Whereas the overall scheme is similar to most of the known conformation-generators, new strategies have been implemented for each step. In the following sections, these strategies are described.

CONFLEX Space Search
Input and Output Structures
Process Input Output
(1) None a Starting
(1-1) None or Stored Initial
(1-2) Initial Starting
(1-3) None Stored
(2) Starting Optimized
(3) Optimized None/Optimized
(3-1) Optimized Stored
a At the beginning, no input structure exists in the conformation storage.

Generation of Starting Structure

The lowest-energy structure of the unused, stored conformers is chosen as the initial structure (a structure is considered to be unused if it has never been perturbed to generate a starting structure). This strategy is key to searching through the region in the conformational space, in the most efficient way towards a lower energy area. This is similar to a stream filling an empty reservoir by finding the the lowest point.

CONFLEX Generation

The strategy is illustrated schematically here for a conformational space search by the reservoir-filling algorithm.

  • The initial structures are sampled in the order indicated by the number appearing next to the conformers (ellipses). A pair of conformers connected by a line comprise an initial structure and a conformer generated from that initial structure through perturbation and geometry-optimization.
  • When the search limit is set to the level of Limit 1, the search is complete after the initial structures 1 through 17 have been processed.
  • Then the search limit is increased to Limit 2, and conformers 18 through 25 are subjected to initial structures.


Variable Search Limit

When the global minimum or the instant global minimum in a local area is reached, the search moves towards a higher-energy area, and sometime local dips are overlooked in the process. The gradual expansion can be utilized to guarantee the thoroughness of the low-energy search.

The search limit is first decided - for example 5 kcal/mol from the global energy minimum. All the conformers within this first window (Limit 1) are used as the initial structures. During this search, all the conformers are saved, even those with energies higher than Limit 1.

The limit is increased to Limit 2 and all the conformers in the new window (between Limit 1 and 2) are subjected to initial structures in order of increasing energy. If local networks of conformers (LAN-a and LAN-b) are not connected within the first window, LAN-b is searched after LAN-c in the second window. A window width of 7 to 10 kcal/mol from the global minimum is generally sufficient to achieve an exhaustive search of the first, chemically meaningful window. Therefore, the variable limitation technique is a useful strategy for keeping the search region under control during the universal search.


Local Perturbation

If the perturbation cannot move the starting structure from the territory to which it initially belonged, subsequent geometry-optimization will return it to the same structure. The territories of similar conformers may be located in close vicinity in the conformational space, constituting a local network of local territories, and less similar conformers may be considered to belong to different localities. Therefore, the method of perturbing an initial structure to produce a new candidate conformer is also responsible for the efficiency of search.

To ensure exhaustive generation of all possible starting structures, local perturbation is applied to every flexible part in the initial structure. The following three modes of perturbation are designed to mimic the elementary process in the thermal movements of a molecule undergoing conformational change: corner flap and edge flip for endocyclic parts, and stepwise rotation for acyclic parts.

Corner Flap

Corner flap exploits the puckered feature of a ring structure, and involves the movement of a corner ring-atom to the other side of the local average plane. Two to four contiguous dihedral angles are simultaneously changed along the ring. The advantages of corner flap are:

  • it is highly efficient in producing a new energy-minimum
  • it can be considered to mimic a barrier-crossing step in the elementary process of thermal conformational interconversion
  • it generally does not propagate itself, and can therefore be applied to every ring-atom. In many cases, as many starting structures as the number of ring-atoms can be obtained.
CONFLEX Corner Flap

Edge Flip

Whilst corner flap is efficient for small to medium ring structures, it is sometimes an unsuccessful technique for larger rings. For example, a ring-atom that is lying on the average local ring plane and flanked by a pair of gauche bonds of the same sign cannot be flapped. Corner flap alone cannot produce all the nearby energy minima for larger rings where mechanisms of conformational interconversion involving more than four contiguous bonds exist.

Edge flip is where two adjacent ring-atoms are simultaneously given the corner flap in opposite directions, where a ring bond is flipped. Edge flip is best illustrated by the chair-to-twist boat conversion of cyclohexane. An edge flip perturbation mode, which proved to be effective for large rings, consists of simultaneous small flapping of two adjacent ring-atoms towards the inside of the ring. In contrast to the other two perturbation methods, both directed towards the outside of the ring, this mode is a local inflection occurring for ring structures with a large cavity space. The edge flip involves rotation of two to five contiguous ring bonds, and are called when certain combinations of three contiguous dihedral angles patterns (A, G, G’) appear along the ring. Careful structural adjustments are given in order to ensure smooth transformation of local structure by edge flip, which is actually a much larger perturbation.

CONFLEX Edge Flip

Stepwise Rotation

The acyclic part of a molecule is perturbed by stepwise rotation. Typically, a Csp 3 -Csp 3 bond is given 120 and -120 rotations to produce a pair of new rotamers. This method, combined with the reservoir-filling strategy for guiding the search direction, searches low-energy conformers for short side chains and linear molecules with up to six to ten rotatable acyclic bonds. However, it is not as effective for large molecules with multiple branching, where unexpectedly high-energy starting structures are produced.


Optimization

Pre-check during optimization

The time required to perform systematic perturbations on the initial structure comprises only a few percent of the total computing time, while by far the most time-consuming step is the geometry-optimization. Therefore, the pre-check is an effective way to increase the efficiency of the conformational space search. In CONFLEX, a structure that is being geometry-optimized is frequently compared with all the stored conformers during the optimization, and that calculation is stopped as soon as the candidate structure is identified as superimposable with one of the stored structures. However, if comparison is made too often, the comparison time will take up a significant part of the total time when the number of stored structures increases. Therefore, with this option, CONFLEX will perform the comparison at 0, 10, 20, and every 10 iterations until 200 iteration, and every 50 iterations thereafter. This strategy reduces the total computing time by 30 to 60%.

Comparison

Comparison using conformational distance

The similarity between two conformers can be quickly identified by comparing the root mean square of differences in the corresponding pair of dihedral angles, f A and f B. This method is fast and accurate. To save time, the dihedral angles of a stored conformer are retrieved for comparison rather than re-calculating those dihedral angles each time.

CONFLEX Comparison

Part of PARALLEL CONFLEX was developed under a “Grant-in-Aid for Project Costs Associated with Innovation Creation with the Collaboration of Industry, Government, and Universities” of the Japan Ministry of Education, Culture, Sports, Science and Technology.

Back To TOP