That is exactly what I want to discuss at the Scientist Chat.
The diversity protocol used is to look at the top-scoring Foldit prediction (across all 3 CASP ROLL puzzles for each target) and submit that model (as long as it is not identical to one of the server models… I will expand on that part in the chat).
Then we look at the next top-scoring Foldit prediction that is completely different from the previous top-scoring model (ie not based on the same server model, or if it was based on the same model initially… it has been dramatically modified)
and so on until we get 5 topologically different top-scoring Foldit solutions.
For Example:
Top-scoring Foldit prediction was based off of Baker2 (the second RosettaServer model).
Next top-scoring Foldit prediction (that wasn't based off of Baker2) was the top-scoring solution from the "Top CASP ROLL Predictions" puzzle that was based off of Quark1 (the first Quark Server prediction).
Next top-scoring Foldit prediction (that wasn't based off Baker2 or Quark1) was based off of Baker4.
Next top-scoring Foldit prediction (that wasn't based off Baker2, Quark1, or Baker4) was based off of Quark2.
And the next top-scoring Foldit prediction was actually also initially based on Quark2, but was so heavily modified that it looks nothing like either the second Quark Server model or the 4th submitted Foldit model above.
The most important thing to realize about the Free Modeling Targets is that they are very difficult and it is rare for any CASP submission to actually get it right (server or human).
Just go to the CASP9 page:
http://predictioncenter.org/casp9/results.cgi?view=tb-sel
and click on the tiny box on the top right that says FM, then click Show Results.
You'll see that the highest GDT_TS score is 71.83 and they quickly go down in the 50s (a GDT score of 100 would be perfect).
Go back to that initial page and select TBM (Template Based Modeling), then click Show Results. You'll see how over half of those GDT scores are above 90.
This means that the most likely all 15 server models that you are working on are completely wrong! As depressing as that might seem, it is actually very exciting because if you are able to come up with even 1 Free Modeling Foldit prediction that is closer to the native than all the other CASP ROLL groups, that would be a remarkable result!
But what this also means is that there is absolutely no reason to submit a model that is just a refinement of a server model (no matter how good it scores). Even if we improve it by 0.1 GDT, the server will get the credit for "nailing" that prediction… not Foldit. There needs to be significant structural changes for us to get credit for it.
Again, I'll bring this up in the second half of the Scientist Chat tomorrow, but to answer auntdeen's second question: it is not worth coming up with a "second best" solution starting from the same server model unless the result is so different that you wouldn't be able to tell that it started from that model. (I hope that makes sense in the context of my previous paragraph).
We will always submit 5 completely different Foldit predictions, even if every single Foldit player all converged on the same server model and those models all scored way better than the next top-scoring Foldit model that was based on a different start.