Thursday, June 13, 2013

Unraveling the Enigmatic Mechanism of L‐Asparaginase II with QM/ QM Calculations

Diana S. Gesto, Nuno M. F. S. A. Cerqueira, Pedro A. Fernandes, and Maria J. Ramos J. Am. Chem. Soc. 2013, 135, 7146
Contributed by Jonathan Goodman

This paper discusses the hydrolysis of asparagine


The transformation is simple, but the catalyst, L-Asparaginase II, is not. The active site of the enzyme contains a number of functional groups which work together to hydrolyse the enzyme. 


The currently accepted mechanism is that the process is related to that of a serine protease, with Thr12 attacking the amide to form an ester that is then hydrolysed. This mechanism looks reasonable. The three-dimensional structure shows that Thr12 is poised over the carbonyl of the amide, close to the π* orbital of the carbonyl and in about the right position to attack. However, studies of mutants of the enzyme do not provide unambiguous support for this proposal. 

This paper reports a computational study of the system using an ONIOM approach, with B3LYP/6-31G(d) for the high-level layer and AM1 for the low-level layer. This split approach made it possible to follow the reaction pathways for the system. Minima and transition states were recalculated using single point calculations at the M06-2X/6-311++G(2d,2p) level. This method showed that the currently accepted mechanism, summarized in the figure below showing the key Thr12-substrate interaction in cyan, was a high energy pathway, and an alternative mechanism involving the attack of a water molecule and stabilization of the intermediate oxyanion by Thr12 was more accessible. The new mechanism is surprising in that Thr12 is above the carbonyl group and so not in a good position to interact with the lone pair of the amide carbonyl. However, this configuration does fit the grand jeté orientation that we recently highlighted for mechanisms involving oxyanion holes (doi: 10.1039/C2OB06717J).



This paper demonstrates that the calculations can provide a sufficiently accurate analysis of the system to inspire an alternative to the currently accepted mechanism. The new mechanism fits all of the available experimental data; it was hard to see how some of the experimental data fitted the original mechanism. The new mechanism, therefore, is to be preferred. If both mechanisms had fitted the available experimental data, would these demanding calculations have been enough to change the currently accepted mechanism to the new process?  

Friday, May 31, 2013

Describing Noncovalent Interactions beyond the Common Approximations: How Accurate Is the “Gold Standard,” CCSD(T) at the Complete Basis Set Limit?

Řezáč, J.; Hobza, P.  J. Chem. Theor. Comput., 2013, 9, 2151
Contributed by Steven Bachrach.
Reposted from Computational Organic Chemistry with permission

The gold standard in quantum chemistry is the method that is considered to be the best, the one that gives accurate reproduction of experimental results. The CCSD(T) method is often referred to as the gold standard, especially when a complete basis set (CBS) extrapolation is utilized. But is this method truly accurate, or simply the highest level method that is within our reach today?

Řezáč and Hobza1 address the question of the accuracy of CCSD(T)/CBS by examining 24 small systems that exhibit weak interactions, including hydrogen bonding (e.g. in the water dimer and the waterammonia complex), dispersion (e.g. in the methane dimer and the methaneethane complex) and π-stacking (e.g. as in the stacked ethene and ethyne dimers). Since weak interactions result from quantum mechanical effects, these are a sensitive probe of computational rigor.

A CCSD(T)/CBS computation, a gold standard computation, still entails a number of approximations. These approximations include (a) an incomplete basis set dealt with by an arbitrary extrapolation procedure; (b) neglect of higher order correlations, such as complete inclusion of triples and omission of quadruples, quintuples, etc.; (c) usually the core electrons are frozen and not correlated with each other nor with the valence electrons; and (d) omission of relativistic effects. Do these omissions/approximations matter?

Comparisons with calculations that go beyond CCSD(T)/CBS to test these assumptions were made for the test set. Inclusion of the core electrons within the correlation computation increases the non-covalent bond, but the average omission is about 0.6% of the binding energy. The relativistic effect is even smaller, leaving it off for these systems involving only first and second row elements gives an average error of 0.1%. Comparison of the binding energy at CCSD(T)/CBS with those computed at CCSDT(Q)/6-311G** shows an average error of 0.9% for not including higher order configuration corrections. The largest error is for the formaldehyde dimer (the complex with the largest biding energy of 4.56 kcal mol-1) is only 0.08 kcal mol-1. If all three of these corrections are combined, the average error is 1.5%. It is safe to say that the current gold standard appears to be quite acceptable for predicting binding energy in small non-covalent complexes. This certainly gives much support to our notion of CCSD(T)/CBS as the universal gold standard.
An unfortunate note: the authors state that the data associated with these 24 compounds (the so-called A24 dataset) is available on their web site (www.begdb.com), but I could not find it there. Any help?


References

(1) Řezáč, J.; Hobza, P. "Describing Noncovalent Interactions beyond the Common Approximations: How Accurate Is the “Gold Standard,” CCSD(T) at the Complete Basis Set Limit?," J. Chem. Theor. Comput.,20139, 2151–2155, DOI: 10.1021/ct400057w.


Sunday, May 26, 2013

The Optical Rotation of Methyloxirane in Aqueous Solution: A Never Ending Story?

Lipparini, F.; Egidi, F.; Cappelli, C.; Barone, V. J. Chem. Theor. Comput. 2013, 9, 1880
Contributed by Steven Bachrach.
Reposted from Computational Organic Chemistry with permission

Computing the optical rotation of simple organic molecules can be a real challenge. One of the classic problems is methyloxirane. DFT typically gets the wrong sign, let alone the wrong value. Cappelli and Barone1 have developed a QM/MM procedure where methyloxirane is treated with DFT (B3LYP/aug-cc-pVDZ or CAM-B3LYP/aubg-cc-pVDZ). Then 2000 arrangements of water about methyloxirane were obtained from an MD simulation. For each of these configurations, a supermolecule containing methyloxirane and all water molecules with 16 Å was identified. The waters of the supermolecule were treated as a polarized force field. This supermolecule is embedded into bulk water employing a conductor-polarizable continuum model (C-PCM). Lastly, inclusion of vibrational effects, and averaging over the 2000 configurations, gives a predicted optical rotation at 589 nm that is of the correct sign (which is not accomplished with a gas phase or simple PCM computation) and is within 10% of the correct value. The full experimental ORD spectrum is also quite nicely matched using this theoretical approach.


References

(1) Lipparini, F.; Egidi, F.; Cappelli, C.; Barone, V. "The Optical Rotation of Methyloxirane in Aqueous Solution: A Never Ending Story?," J. Chem. Theor. Comput. 20139, 1880-1884, DOI:10.1021/ct400061z.


InChIs

(R)-Methyloxirane:
InChI=1S/C3H6O/c1-3-2-4-3/h3H,2H2,1H3/t3-/m1/s1
InChIKey=GOOHAUXETOMSMM-GSVOUGTGSA-N

Creative Commons License
This work is licensed under a Creative Commons Attribution-NoDerivs 3.0 Unported License.

Monday, May 20, 2013

Corrected small basis set Hartree-Fock method for large systems


Contributed by +Jan Jensen 

Steven Bachrach recently highlighted Kruse and Grimme's geometric counterpoise scheme (gCP) - an add on term that corrects for BSSE in the same spirit as Grimme's dispersion correction.  One of the many interesting observations in this paper is that gCP- and dispersion-corrected HF minimal basis set calculations can yield very accurate binding energies.

The paper by Sure and Grimme investigates this particular issue further and presents the so-called HF-3c method.  "3c" stands for the 3 different corrections: gCP, dispersion, and a short-range basis incompleteness (SRB) correction for "systematically overestimated bond lengths for electronegative elements (e.g. N,O,F) when employing small basis sets."  HF-3c uses a minimal basis set for H, C, N, and O and a split valence basis for heavier elements.  In all only 9 empirically adjusted parameters are needed.

Being an HF/minimal basis set calculation the method can be applied to rather large molecules, but will be slower than corresponding semi-empirical calculations on a single core.  However, the HF-3c can of course be run in parallel. HF-3c interaction energies computed for the S22, S66, and X40 datasets are of similar accuracy to PM6-DH2 and significantly better than PM6.

HF-3c geometry optimizations and vibrational frequency calculations are found to significantly more numerically stable and offer more accurate structures (including protein structures) than PM6, as implemented in MOPAC2012.  HF-3c thus offers an attractive alternative when MOPAC2012 is giving imaginary frequencies.  If HF-3c can produces accurate transition state geometries, the method might also be a viable alternative to QM/MM calculations.

HF-3c is implemented in the ORCA package, which is available free of charge.

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 Unported License.

  

Wednesday, May 15, 2013

The solution to the challenge in "Time-Reversible Random Number Generators" by Wm. G. Hoover and Carol G. Hoover

Federico Ricci-Tersenghi arXiv:1305.1805
Contributed by Dan Gezelter
Reposted from The OpenScience Project with permission


William G. Hoover (of the Nosé-Hoover Thermostat) and Carol G. Hoover issued a $500 challenge on arXiv to generate a time-reversible random number generator.  The challenge itself would be quite remarkable news.  What’s even better is that the challenge (including the source code for an implementation) was solved in 6 days by Frederico Ricci-Tersenghi.
Why is this a big deal?  Most of the equations in physics that govern time evolution of particles obey time-reversal symmetry; the same differential equations that govern molecular or planetary motion will take you back to your starting point if you suddenly reverse the time variable.  This is a usually a fantastic way to check to see if you are doing the physics correctly in your simulations, and also means that collections of  starting points that are related to each other behave in certain predictable ways when they evolve.
Stochastic approaches to physical motion introduce an aspect of randomness to mimic the behavior of complex phenomena like the motion of solvent surrounding the moleculewe’re interested in, or to mimic the transitions between different electronic states of a molecule.   The introduction of random numbers has meant we had to give up time-reversibility, and we’ve been willing to live with that for a long time because we can study more complicated phenomena.
If we have access to a time-reversible pseudo-random number generator, however, we get that very powerful tool back in our toolbox.
Now, the Langevin equation,
md2xdt2=Fγ(t)dxdt+R(t)

has two things that prevent it from being time-reversible.  Besides the stochastic or random force, R(t), there’s also a drag or friction force, γ(t)dxdt, that depends on the velocities of the particles.  There’s no solution yet to time reversibility for this piece (and I have my doubts that there ever will be a way to reverse this).

Wednesday, April 24, 2013

All Five Forms of Cytosine Revealed in the Gas Phase

Alonso, J. L.; Vaquero, V.; Peña, I.; López, J. C.; Mata, S.; Caminati, W.  Angew. Chem. Int. Ed. 2013, 52, 2331-2334
Contributed by Steven Bachrach.
Reposted from Computational Organic Chemistry with permission


Alonso and coworkers have again (see this post employed laser-ablation molecular-beam Fourier-transform microwave (LA-MB-MW)spectroscopy to discern the gas phase structure of an important biological compound: cytosine.1 They identified five tautomers of cytosine 1-5. Comparison between the experimental and computational (MP2/6-311++G(d,p) microwave rotational constants and nitrogen nuclear quadrupole coupling constants led to the complete assignment of the spectra. The experimental and calculated rotational constants are listed in Table 1.
Table 1. Rotational constants (MHz) for 1-5.

1
2
3
4
5

Expt
calc
Expt
calc
Expt
calc
Expt
calc
Expt
calc
A
3951.85
3934.5
3889.46
3876.5
3871.55
3856.0
3848.18
3820.1
3861.30
3844.2
B
2008.96
1999.1
2026.32
2014.7
2024.98
2012.3
2026.31
2019.0
2011.41
1999.7
C
1332.47
1326.8
1332.87
1326.9
1330.34
1323.3
1327.99
1324.0
1323.20
1318.4

The experimental and computed relative free energies are listed in Table 2. There is both not a complete match of the relative energetic ordering of the tautomers, nor is there good agreement in their magnitude. Previous computations2 at CCSD(T)/cc-pVQZ//CCSD//cc-pVTZ are in somewhat better agreement with the gas-phase experiments.
Table 2. Relative free energies (kcal mol-1) of 1-5.

expt
MP2/
6-311++G(d,p)
CCSD(T)/cc-pVQZ//
CCSD//cc-pVTZ
1
0.0
0.0
0.0
2
0.47
0.70
0.7
3
0.11
1.19
0.2
4
0.83
3.61
0.7
5

5.22

References

(1) Alonso, J. L.; Vaquero, V.; Peña, I.; López, J. C.; Mata, S.; Caminati, W. "All Five Forms of Cytosine Revealed in the Gas Phase," Angew. Chem. Int. Ed. 201352, 2331-2334, DOI:10.1002/anie.201207744.
(2) Bazso, G.; Tarczay, G.; Fogarasi, G.; Szalay, P. G. "Tautomers of cytosine and their excited electronic states: a matrix isolation spectroscopic and quantum chemical study," Phys. Chem. Chem. Phys.2011,13, 6799-6807, DOI:10.1039/C0CP02354J.


InChIs

cytosine: InChI=1S/C4H5N3O/c5-3-1-2-6-4(8)7-3/h1-2H,(H3,5,6,7,8)
InChIKey=OPTASPLRGRRNAP-UHFFFAOYSA-N


Creative Commons License
This work is licensed under a Creative Commons Attribution-NoDerivs 3.0 Unported License.