Exciting preliminary results in Grand Challenge puzzles!

Started by David Baker

David Baker Staff Lv 1

First of all, I would like to thank all of you for participating in this most exciting new venture! I think FoldIt has real promise for making it possible for people all over the world and from all different backgrounds to bring their talents and unique skill sets to bear on solving critical problems in biomedicine and bioengineering. And of course, have fun at the same time!

My research group at the University of Washington has been working on the protein structure prediction problem for quite some time. With the Rosetta program and the computing power of Rosetta@home, we can now predict structures accurately for small proteins. The main obstacle preventing us from being able to predict structures of larger and more complex proteins is that finding the correct structure becomes increasingly difficult. The correct structure almost always has lower energy than incorrect structures, so if Rosetta can locate the native structure it can be identified based on its very low energy. We find that Rosetta often gets close to the native structure, but not close enough for the energy to drop significantly. This drop requires a last few twists and contortions that Rosetta, which searches by making random changes and evaluating the energy, is not smart enough to try.

The Grand Challenge puzzles are testing whether FoldIt players can overcome these last obstacles which Rosetta often fails at. The starting points in these puzzles are generally pretty close to the correct structure, but there are one or a small number of errors which prevent the energy from dropping (in FoldIt, to get a score more like a traditional computer game, we take the negative of the energy, so a higher score means lower energy).

The exciting preliminary result is that many of you are able to recognize the problems in the starting structures–many of which are the structures that Rosetta trajectories got stuck at==
and fix these problems remarkably well. Over the next month we will be releasing more puzzles with different types of errors in the starting Rosetta generated structures, and doing a full analysis of the results on these challenges which we will report to you directly and to the world via a scientific publication. Beginning with the central issue of whether human intelligence and intuition can overcome obstacles hindering computers from solving problems like protein folding, here are many fascinating questions, most of which have never been considered before.

Keep up the great work!

David

csohad Lv 1

I would like to have your idea about the secondary structures (helix, sheets and loop) that are given on the beginning of most puzzles.

How reliable their prediction is?

For example,Is there a need to generate helix from loops?


What are the meaning of 1 residue sheet or helix?

Thanks
Ohad

David Baker Staff Lv 1

This is an excellent question! Unfortunately, the accuracy of secondary structure prediction is not perfect–it is around 80%. Most of the errors are at the junctions between loops and helices or between loops and strands. So when you are looking at a puzzle you have to consider the possibility that the place where a loop begins or ends is off by a residue or two.

A 1 residue helix is the same as a loop, a 1 residue strand can still be part of a sheet.

spvincent Lv 1

Thanks for the update, much appreciated.

I'd like to suggest that after a puzzle is complete, it would be nice for someone on the project team to write a short "post mortem" on the puzzle. Nothing elaborate, just a paragraph that explains the particular challenges involved in folding that protein. That, and the ability to download and view the top-scoring solutions (which sounds as if it should be coming soon anyway) would help folders develop a better feel for the structures that proteins can take. The current evolution videos, while mildly entertaining, aren't terribly useful (at least I don't find them so).

JonathanCline Lv 1

It would be very useful (and cool) to see an animation of the moves made in order by the #1 ranked solver. i.e. a set of jpegs rendered into a quicktime movie, to show in slow motion the steps used to reach the "high score". This should include seeing tweak directions, temporary rubber bands, etc.

icecubey1971 Lv 1

David,

Thanks for the info. I found it interesting. You mentioned that the program is pretty good a predicting structures for "small proteins". What is a general size of these small proteins ???

Thanks
Icecubey1971