Placeholder image of a protein
Icon representing a puzzle

581: CASP10 Target T0711

Closed since over 13 years ago

Intermediate Overall Prediction

Summary


Created
June 17, 2012
Expires
Max points
100
Description

The tenth CASP10 will look familiar: it's also 33 residues and the 3 different templates we are giving you are the same as Puzzle 574's. Note the short deadline, since this target is so small. More details about this CASP target are in the puzzle comments.

Top groups


  1. Avatar for Go Science 100 pts. 8,583
  2. Avatar for Contenders 2. Contenders 81 pts. 8,550
  3. Avatar for Anthropic Dreams 3. Anthropic Dreams 64 pts. 8,541
  4. Avatar for FoldIt@Netherlands 4. FoldIt@Netherlands 50 pts. 8,536
  5. Avatar for Void Crushers 5. Void Crushers 39 pts. 8,534
  6. Avatar for HMT heritage 6. HMT heritage 30 pts. 8,528
  7. Avatar for L'Alliance Francophone 7. L'Alliance Francophone 23 pts. 8,525
  8. Avatar for foldeRNA 8. foldeRNA 17 pts. 8,516
  9. Avatar for Deleted group 9. Deleted group pts. 8,501
  10. Avatar for Gargleblasters 10. Gargleblasters 9 pts. 8,496

  1. Avatar for bendbob 41. bendbob Lv 1 44 pts. 8,488
  2. Avatar for martinzblavy 42. martinzblavy Lv 1 43 pts. 8,487
  3. Avatar for Bautho 43. Bautho Lv 1 42 pts. 8,487
  4. Avatar for Galaxie 44. Galaxie Lv 1 41 pts. 8,485
  5. Avatar for nemo7731 45. nemo7731 Lv 1 40 pts. 8,483
  6. Avatar for Deleted player 46. Deleted player pts. 8,481
  7. Avatar for randomlil 47. randomlil Lv 1 38 pts. 8,481
  8. Avatar for SKSbell 48. SKSbell Lv 1 37 pts. 8,481
  9. Avatar for Lindata 49. Lindata Lv 1 36 pts. 8,480
  10. Avatar for goastano 50. goastano Lv 1 35 pts. 8,479

Comments


beta_helix Staff Lv 1

Here is the CASP link for this target (showing the amino acid sequence):
http://predictioncenter.org/casp10/target.cgi?id=120
__________________

Here is the sequence logo predicted by the SAM server.

H = helix
E = sheet
C = loop (or coil)

The taller the letter at each position, the higher the probability of that specific secondary structure for that amino acid.

You can see that these secondary structure predictions imply that this protein is mostly loop!
__________________

Link to template PDBs:

http://www.pdb.org/pdb/explore/explore.do?structureId=1lu0
http://www.pdb.org/pdb/explore/explore.do?structureId=1w7z
http://www.pdb.org/pdb/explore/explore.do?structureId=1h9h

brow42 Lv 1

Perhaps it is more important to get the cysteines in the right spot rather than matching the amino acid sequence (these are insensitive to substitution in general, aren't they?)

This sequence has 1-11-5-3-1-5-1 loops. The templates are x-6-5-3-1-5-x. The next time a probable cysteine knot comes up, could you include the best homologue that has the same internal loop lengths, even if it doesn't match AA or type?

krulon Lv 1

Should a disulfide bridge give a negative score? Can someone please explain this?

Negative disulfide scoreNegative disulfide score

brow42 Lv 1

Disulfides are tricky. They do score negative sometimes. This means when you shake sidechains, the client will usually break the brdige to improve the score, making it really hard to evolve a good bridge. This is why I made the Bridge Wiggle script.

However, that still didn't work well enough, because if you got into position for a good disulfide score, the sidechain score would be negative, like -20. Auuuugh! A shake sidechain would again break the bridge to raise the score, or you'd keep the bridge but have a low scoring segment. I've modified Bridge Wiggle to optimize to total sidechain+disulfide.

However, that still didn't work well enough. The protein isn't flexible enough to find a good scoring position, and if you did find a good position, the backbone is negative! Auuuugh!

This puzzle and 573b are easier because the templates all have proper bridges, so you start with high scoring bridges and they tend to stick around then….but people are passing me in rank and I bet they don't have all 3 bridges.

I don't know the details of how mini-rosetta scores this, but in general I think it measures dihedral angles and then looks up the probability of those angles based on real proteins. The wikipedia page here http://en.wikipedia.org/wiki/Disulfide_bond#Occurrence_in_proteins says that the C1-S-S-C2 dihedral is always 90 degrees, meaning the C1-S-S plane and the S-S-C2 plane formed by the last segment of the cysteines and the S-S bond itself, are rotated 90 degrees. Make an L of your two hands and join the thumbs, then rotate one hand 90 degrees….that's what it has to look like. Anything else is highly penalized with a negative score for being unrealistic.

That doesn't say anything about the C-S-S angle, but I strongly suspect that also needs to be 90 degrees, L instead of V.

I made Bridge Wiggle band up cysteines in the exact lengths to force these bond angles…I thought it would give me good scores…It didn't! Auuuugh!

I've suspected that maybe mini-rosetta needs to have a separate sidechain table for bonded and not-bonded cysteines. Maybe it does, or maybe they aren't really different. However, if foldit/rosetta consistently fail to converge on knotted conformations in cysteine knots, maybe they'll look into that. We'll find out when the CASP natives are released.

For the record, my current best on this puzzles has disulfide scores of 7 - 9 (9 is the highest I've ever seen) and sidechain scores of -2 - -1, which is pretty darn good. So maybe it does work. I got this far by using the templates, and by modifying scripts to 'restore-best-with-3-bridges' instead of just restore-best.