Note for Contact-assisted protein structure modeling by global optimization in CASP11
Three Model
- CSA global optimization method: the optimization of the energy function
- common energy functions:
1. E_{stereo-chemistry}
- the stereo-chemical terms related to bond lengths, bond angles, torsion angles, and improper torsion angles according to CHARMM22
2. E^{vdw}_{repul}
- the repulsive part of the van der Waals potential
Tp protocol
- predicted residue–residue contact information was provided.
earched for protein 3D models which were diverse and had low energie
ETp=Estereo−chemistry+Evdwrepul+Econtact+Ephys
Econtact=wcontact∑i⎧⎩⎨⎪⎪⎪⎪(ri−rupper)2(ri−rupper)2+δ2,ri>rupper0,otherwise
r_i is the distance between two Cb (Ca for glycine) atoms for each contact i;provided residue–residue contacts were likely to be incorrect
- not want to penalize contact violation too excessively
->functional form of the Lorentzian function was used for E_contact.
Ephys=Edfa+Edfire+EGOAP+Ehbond
E_phys includes all the necessary modeling energy terms for free modeling (ab initio modeling) :dfa: dynamic fragment assembly terms
- dfire: dfire statistical potential term
- hbond: the hydrogen bonding term
- GOAP: the GOAP term
Total procedure
- For initial structures -> used fold recognition models generated by nns protocol and all the server models provided from the CASP11 homepage
- Using CSA, typically generated up to 200 structures which were clustered by the structural similarity
- Each cluster was ranked based on the average value of Econtact and the average QA3 score
- The best cluster was identified and the final models were selected based on E_contact and the QA3 score in the cluster.
Ts protocol
The modeling of Ts targets is closely related to the NMR structure determination using NOE distance restraints
the contact restraints provided for Ts targets were highly ambiguous —> many atom–atom pairs were assigned to a peak
CSA optimization is applied to the following energy function:
ETs=Estereo−chemistry+Evdwrepul+Echiral+ECMAP+ENOE
- Estereo-chemistry: constrains bond lengths, bond angles, torsion angles and improper torsion angles, and taken from the CHARMM22 force field
- E_vdw^repul: the repulsive part of the van der Waals potential
- E_chiral: keep the chirality of the amino acid residue in the L form
- Since the driving force for the structure modeling of Ts targets is hydrogen–hydrogen distance restraints, without Echiral, the mirror image of a given structure (either in L or D form) will be as stable as the given structure
- ECMAP: used to provide a cross-term correction for two adjacent torsion angles
- which is a standard stereo-chemical term in the CHARMM force field
- NOE: for a given set of ambiguous restraints
ENOE=∑i⎧⎩⎨⎪⎪⎪⎪ki(Ri−dupperi)2,Ri>dupperi0,dloweri<Ri<dupperiki(Ri−dloweri)2,Ri<dloweri
Ri=(∑jd−6i)−16
- the sum over j is taken for all possible distance pairs provided for the ith restraint.
ki=103/dupperikcalmol−1A−1
- the conventional usage of the effective distance Ri suits our purpose of solving the two-tier optimization problem, where the variation of an assigned peak to an atom–atom pair is possible
- the sum over j is taken for all possible distance pairs provided for the ith restraint.
- Depending on the initial protein structure, the minimization of E_Ts will provide various peak assignment possibilities (i.e., the identity of the atom–atom pair contributing the most in 1/di^6 varies)
Total Procedure
- For initial structures, we used the same initial models as used in the Tp protocol
- generated up to 700 structures which were subsequently
reminimized using the LBFGS local minimization method on the new energy function:
E′Ts=ETs+Eelectro−static+Egbsa
- the second and the third terms respectively correspond to the CHARMM22 electro-static potential and gbsa solvation energy
- generated up to 700 structures which were subsequently
- Final models were selected based on the values of E_ts and the Ramachandran-favored score calculated by the MolProbity program
Tc protocol
total 24 Tc target:17 (Ts,Tc) + 7 (Tc)
- For 17 (Ts,Tc):
E1Tc=ETs+Econtact
- For 7 (Tc):
E2Tc=ETp
- Note: arguments is different from E_ts and E_tp.