Placeholder image of a protein
Icon representing a puzzle

1152: Foldit vs. UMich Electron Density Challenge

Closed since over 10 years ago

Advanced Overall Prediction Electron Density

Summary


Created
October 22, 2015
Expires
Max points
100
Description

The structure of this protein was recently solved by crystallographers in the Bardwell Lab at the University of Michigan. Before they publish the structure, they are challenging a class of undergraduate students to build a model into this unrefined density map - and they have invited Foldit players to participate! As a further challenge, both Foldit players and students are provided with the full sequence of the protein; however, note that some parts of the protein are disordered and will not occupy any electron density. Just like the UMich students, Foldit players will have four weeks to develop a protein model that fits this density. Fold this protein into the electron density map to see if you can outperform the UMich students - or even the crystallographers themselves as the UMich students did last year!



Sequence:


SMQQDIVNDHQEEAQGWKWEQIKEIIESGELARLKRSRQMTDKYHEHKKRTAGLDMNQYVLQKLGWSLDEPQLENAAAKAFSSSTLYAVRANDFPYNFEPGVVHLVLWSKVALPVHSPDKAVREAARARMNAFLQAQPLLRPLLSSGHVAWFVNYPELQSVARIFHAHVLLFFPRERYSAEQVKTTVDDILSHGFEPLA

Top groups


  1. Avatar for Anthropic Dreams 100 pts. 18,130
  2. Avatar for Go Science 2. Go Science 89 pts. 17,971
  3. Avatar for Gargleblasters 3. Gargleblasters 78 pts. 17,969
  4. Avatar for Void Crushers 4. Void Crushers 68 pts. 17,966
  5. Avatar for Beta Folders 5. Beta Folders 60 pts. 17,598
  6. Avatar for Deleted group 6. Deleted group pts. 12,516
  7. Avatar for Contenders 7. Contenders 45 pts. 11,480
  8. Avatar for L'Alliance Francophone 8. L'Alliance Francophone 39 pts. 11,088
  9. Avatar for Italiani Al Lavoro 9. Italiani Al Lavoro 33 pts. 10,967
  10. Avatar for FoldIt@Netherlands 10. FoldIt@Netherlands 28 pts. 10,399

  1. Avatar for Susume
    1. Susume Lv 1
    100 pts. 18,108
  2. Avatar for BitSpawn 2. BitSpawn Lv 1 99 pts. 18,053
  3. Avatar for frood66 3. frood66 Lv 1 98 pts. 17,901
  4. Avatar for Bautho 4. Bautho Lv 1 97 pts. 17,764
  5. Avatar for bertro 5. bertro Lv 1 96 pts. 17,496
  6. Avatar for steveB 6. steveB Lv 1 95 pts. 17,433
  7. Avatar for pauldunn 7. pauldunn Lv 1 94 pts. 16,223
  8. Avatar for nemo7731 8. nemo7731 Lv 1 93 pts. 14,739
  9. Avatar for Bruno Kestemont 9. Bruno Kestemont Lv 1 92 pts. 14,506
  10. Avatar for grogar7 10. grogar7 Lv 1 91 pts. 13,111

Comments


beta_helix Staff Lv 1

In the spirit of fairness for this Foldit vs. UMich Electron Density Challenge, the Bardwell Lab at the University of Michigan is kindly sharing with us additional information that they provide their students.

Here are some hints they gave out in class today:

"In order to help place the sequence, it helps to look for groups of large bulky aromatics, like tryptophans, phenylalanines, and tyrosines in the sequence, as these will be the most identifiable spots in the structure."

They also pointed their students to this useful picture:

Bruno Kestemont Lv 1

In addition, I've seen in their paper on 2014:
"Horowitz positioned a single amino acid on the map to show where the protein chain started."

Did they also get this information for puzzle 1152? If it's the case, could we receive the same info ?

bkoep Staff Lv 1

This year, the students were not given the position of any amino acids in the protein.

beta_helix Staff Lv 1

They have only been given the sequence and the exact same secondary structure predictions that were provided on your extended chain.

Any additional information (not related to Coot) that is given to the students by their instructors will be posted here. This is a class after all, so the students will undoubtably ask questions!

I know you already voiced your request for a repeat of their previous experiment:
http://fold.it/portal/node/2001334#comment-31314

but Foldit players showed in the subsequent Foldit puzzle (http://fold.it/portal/node/2000180) that you could solve these ED problems without that information.

This experiment is much more realistic (in how Foldit could help with protein structure determination) because it is unlikely that we would know that information for an unsolved case.

bkoep Staff Lv 1

The UMich class has recently spent some time discussing the topic of sidechain flexibility. We want to make sure Foldit players are aware that some long, flexible sidechains will not fit into any density.

You may have already had some trouble finding density for long sidechains like those of LYS, ARG, GLU, or GLN. When the protein is in solution, these residues are particularly flexible, and shift frequently between different conformations. Often, and especially on the surface of the protein, these residues can retain their flexibility even in the crystal form, when most of the other residues have been locked down into a single, rigid conformation.

The uniformity and rigidity of molecules in a crystal is key to obtaining high-resolution electron density. A flexible, nonuniform residue in a crystal only gives a noisy signal; as a result, these flexible residues appear to have no density in the final density map.

Bruno Kestemont Lv 1

I don't know if it makes sense, but would it be possible to share the best result (I suppose the published one?) as an "expired" 1152b puzzle for us to evaluate it and see our differences?

(and who knows, even try to further evolve it :)

It can be useful for learning.

Bruno Kestemont Lv 1

I think it's useful as a technique for hand folding first steps of EDs in Foldit:

"The trained crystallographers reported using the following approach to the puzzle, which corresponds well with what the instructor observed with many of the undergraduate students. First, they looked for large density blobs that might correspond to large aromatic side chains such as Trp, Tyr or Phe. Working forwards and backwards from the Trp–Phe–Val–Asn sequence proved particularly useful. Modelling in a few of these large residues led to the assignment of density to sequence location. The direction of the polypeptide chain was reversed on a few occasions, but was fixed by looking at the carbonyl density. The Find Secondary Structure tool in Coot was used, especially for regions where the density was poor. Real Space Refine Zone was used with the refinement weight set to 20 or 10, based on the instructor’s suggestion for building in an unrefined map. Regions where the density was very poor and decisions had to be made about whether to keep trying to build or not proved to be the hardest part of the task. The trained crystallographers reported that at first they did build in these sections of poor electron density. However, when they realized the extent of the guessing involved, they subsequently removed most of the model in these areas. After modelling in the residues, the trained crystallographers used the validation tools in Coot, including Ramachandran plot, Rotamer analysis and Density Fit analysis, which flagged areas with poor geometry. They also ran the structure through MolProbity, which gave similar results to the Coot validation tools. Finally, the crystallographers fixed problem areas as best as possible with the Coot modelling tools, such as Flip Peptide, Rotamers, Regularize Zone and Real Space Refine Zone. When asked to describe the difficulty level of this assignment, the trained crystallographers rated it as somewhat difficult (on a scale of: very difficult, somewhat difficult, neither easy nor difficult, somewhat easy and very easy)."