-
Notifications
You must be signed in to change notification settings - Fork 11
Introduction to AARON: Screening Ligands
The simplest and most common way to use AARON is to predict stereoselectivities for different catalysts/ligands for a given reaction.
This could be substituted analogs of a single ligand or different ligand scaffolds, or both. We'll cover each of these uses below. For each, we'll use an existing TS library (see TS Library). See Building a TS Library for instructions on building your own TS library.
For all of these examples, we'll use a simplified version of an organocatalyzed allylation reaction (see REF). This is a simple reaction with a single, well-defined elementary step that is stereocontrolling. The structures in the TS library for this reaction are catalyzed by 2,2'-bipyridine-N,N'-dioxide. There are a total of 10 TS structures in the TS library: 5 leading to the (R)-alcohol and 5 leading to (S). These correspond to different arrangements of substrate/catalyst around the hexacoordinate silicon. However, for these examples we will compute only the four lowest-lying TS structures (R/ts2, S/ts1, S/ts2, and S/ts3) to speed up the examples. For real applications, it is advisable to compute all of the TS structures in the TS library.
The four structures in this library can be viewed here
First, let's consider the case screening different substituted versions of the ligand/catalyst found in the TS library.
We'll make predictions for two substituted bipyridine-N,N'-dioxides in which we've replaced hydrogen atoms 43 and 46 with F and Ph (see the structures here)
Note: When specifying the substitution of ligand atoms, the numbers are the atom numbers for the ligand only! This means that if you are adding a new ligand to a TS geometry (see below), you should look at the ligand XYZ file from the Ligand library rather than an example of a TS structure with this ligand!
Thus, we will request replacement of atoms 18 and 21 of the ligand, which correspond to atoms 43 and 46 in the TS structure including the substrate!
reaction_type=Allylation template=NN-dioxide_example charge=1 method=b97d denfit=true basis=6-31G &Ligands Cat1: 18=F 21=F Cat2: 18=Ph 21=Ph &
The reaction_type and template keywords are required, as is the 'Ligands' section. All other keywords are optional, and will default to the values found in $HOME/.aaronrc (if it exists) or $QCHASM/Aaron/.aaronrc
This input file requests that we use the TS structures from
$QCHASM/Aaron/TS_geoms/Allylation/NN-dioxide_example
The only keywords are the charge, denfit, method, and basis set. We need to specify the charge (+1), since the default charge in $QCHASM/Aaron/.aaronrc is neutral (0). The basis set is set to 6-31G to make these examples run a little more quickly. We use B97D with density fitting (denfit=true).
The input file above requests that Aaron compute all 4 TS structures found in the TS library for the catalyst Cat1 constructed by replacing atoms 18 and 21 of the ligand present in the TS library with F atoms and catalyst Cat2 by adding Ph groups. Note that we can use any name we'd like to specify these ligands/catalysts (i.e. Lig19, cat19a, etc).
Save the above input file to example1.in and then run
Aaron example1.in
AARON will first make directories for Cat1 and Cat2 then in each one construct a directory tree mimicking that found in the TS library for this reaction:
R/ ts2/ S/ ts1/ ts2/ ts3/AARON will then proceed through a prescribed series of 5 different steps to locate and characterize each TS structure, printing an updated status for each TS structure as it proceeds.
For instance, here is the status update after these jobs have been running for some time:
Status for all jobs...(Sun Aug 5 18:15:21 2018) -------------------------------------------------------------------------------- Status for Cat1 jobs...... The following jobs are pending: Cat1/R/ts2 step 3 attempt 1: No msg recorded Cat1/S/ts1 step 3 attempt 1: No msg recorded Cat1/S/ts2 step 3 attempt 1: No msg recorded Cat1/S/ts3 step 3 attempt 1: No msg recorded -------------------------------------------------------------------------------- Status for Cat2 jobs...... The following jobs are running: Cat2/R/ts2/Cf1 step 2 attempt 1 cycle 1. Progress: 0.069840/NO, 0.003582/NO, 0.409419/NO, 0.000467/NO Cat2/S/ts1/Cf1 step 2 attempt 1 cycle 1. Progress: 0.089284/NO, 0.007370/NO, 0.535821/NO, 0.001161/NO The following jobs are pending: Cat2/S/ts2/Cf1 step 2 attempt 1: No msg recorded Cat2/S/ts3/Cf1 step 2 attempt 1: No msg recorded The following jobs have not been started and are awaiting other jobs to finish: Cat2/R/ts2/Cf2 Cat2/R/ts2/Cf3 Cat2/R/ts2/Cf4 Cat2/S/ts1/Cf2 Cat2/S/ts1/Cf3 Cat2/S/ts1/Cf4 Cat2/S/ts2/Cf2 Cat2/S/ts2/Cf3 Cat2/S/ts2/Cf4 Cat2/S/ts3/Cf2 Cat2/S/ts3/Cf3 Cat2/S/ts3/Cf4 Aaron will check status of all jobs after 300 seconds Sleeping... Aaron will check status of all jobs after 300 seconds Sleeping...
AARON lists running, pending, and finished jobs for each catalyst/ligand. For Cat1, all four TS structures have advanced to Step 3. For Cat2, the initial conformation (Cf1) of the four TS structures are awaiting Step 1. The other conformations (Cf2, Cf3, etc., which correspond to different combinations of rotamers of the two Ph groups) are awaiting the completion of the Cf1 optimizations (AARON performs a hierarchical search of conformations of rotatable substituents).
Also reported is the number of 'attempts' that have been made for the current step, as well as the number of cycles (a new cycle is started any time AARON needs to return to a previous Step in order to correct an erroneous geometry).
When there are running jobs, 'Progress' section reports the Maximum Force, RMS Force, Maximum Displacement, and RMS Displacement along with whether or not these are converged for the current geometry optimization step:
Cat1/R/ts1 step 2 attempt 1 cycle 1. Progress: 0.009347/NO, 0.000378/YES, 0.037218/NO, 0.000089/YES
If you wish to examine any particular input (.com) or output (.log) file, these can be readily located within the directory structure built by AARON. These files are named according to the ligand/catalyst name, stereoisomer, TS number, and the step number.
For instance,
Cat1.R.ts1.1.com Cat1.R.ts1.1.log
are the input and output files for Step 1 (a quick semi-empirical optimization of the newly-added substituents) for TS1(R) for Cat1.
AARON will automatically check the status of all jobs every 5 minutes (by default). If a job has completed, it will automatically be advanced to the next Step. If any problems are detected with a job (an error in the output file, a structure in which connectivity has changed, etc.) AARON will attempt to fix the problem and will re-submit a new job automatically.
In this way, AARON minimizes wasted CPU time, since problems with optimizations are detected and corrected within 5 minutes.
After each 5 minute update, AARON will print relative and absolute energies, enthalpies, etc. for any completed optimizations to a '_thermo' file (e.g. example1_thermo.dat).
Next, let's consider the case of screening different a different catalyst as well as a substituted analog of this catalyst. We will again use the organocatalyzed allylation reaction.
We'll replace the bipyridine-N,N'-dioxide found in the TS library with a biisoquinoline-N,N'-dioxide. We'll run both the unsubstituted case and that with atoms at the 3,3'-positions (atoms
reaction_type=Allylation template=NN-dioxide method=b97d basis=b97d charge=1 &Ligands Cat1: 18=F 21=F Cat2: ligand=bi-isoquinoline-NN-dioxide & &Substrates Sub1: 14=CN &
In this case, we are making predictions for two catalysts and two substrates. The first substrate is that contained in the TS library, which is run by default. AARON will also compute all TS structures for substrate 'Sub1' constructed by replacing atom 14 of the substrate with CN. Note that it will not actually start running calculations for the 'new' substrate until the structures for the original substrate have been optimized--these structures are then used as the templates for the new substrate.
The final results should be as follows: