Placeholder image of a protein
Icon representing a puzzle

2027: Design with AlphaFold Predictions

Closed since over 4 years ago

Intermediate Overall Design

Summary


Created
August 04, 2021
Expires
Max points
100
Description

Design a protein with the help of the new AlphaFold prediction tool! AlphaFold is an algorithm from the company DeepMind and can predict protein structures with incredible accuracy. We want to know if Foldit players can use feedback from AlphaFold to design new proteins! After you've designed a protein, you can upload your solution to the Foldit server and get back an AlphaFold prediction about how your design will actually fold up. Players should aim for an AlphaFold confidence and similarity of 80% or more. See the blog for more details about the AlphaFold prediction tool.



We are especially interested to see what creative new folds players might come up with. We already know that Foldit players can design simple helix bundles or ferredoxin-like ("surfing hotdog") proteins. We are hoping that AlphaFold will allow players to explore protein designs with all beta sheets, or folds with a recessed binding pocket where a ligand could fit. After this puzzle closes, we'll highlight our favorite designs in the next Lab Report video update on September 1.



In addition to the typical extended chain, this puzzle has four starting structures of designed proteins that you may like to use as a starting point. Reset the puzzle to cycle through the different starting structures. There are no Objectives, but we expect successful designs will still need lots of helices or sheets, with short minimal loops, as well as a closely-packed core of orange hydrophobic residues. AlphaFold predictions will not affect your score. Players may insert and delete residues to a maximum of 120 residues total. This special puzzle will remain online until August 31 at 23:00 GMT.

Top groups


  1. Avatar for Anthropic Dreams 100 pts. 13,412
  2. Avatar for Contenders 2. Contenders 77 pts. 13,398
  3. Avatar for Go Science 3. Go Science 58 pts. 13,385
  4. Avatar for Beta Folders 4. Beta Folders 43 pts. 13,360
  5. Avatar for L'Alliance Francophone 5. L'Alliance Francophone 31 pts. 12,791
  6. Avatar for Gargleblasters 6. Gargleblasters 22 pts. 12,705
  7. Avatar for Marvin's bunch 7. Marvin's bunch 15 pts. 12,695
  8. Avatar for Deleted group 8. Deleted group pts. 12,518
  9. Avatar for Void Crushers 9. Void Crushers 7 pts. 12,454
  10. Avatar for BOINC@Poland 10. BOINC@Poland 5 pts. 12,389

  1. Avatar for artsyambie6 191. artsyambie6 Lv 1 1 pt. 7,383
  2. Avatar for nn1234 192. nn1234 Lv 1 1 pt. 1,672
  3. Avatar for sakshamphul 193. sakshamphul Lv 1 1 pt. 0
  4. Avatar for Threeoak 194. Threeoak Lv 1 1 pt. 0
  5. Avatar for apetrides 195. apetrides Lv 1 1 pt. 0
  6. Avatar for memm0740 196. memm0740 Lv 1 1 pt. 0
  7. Avatar for Keresto 197. Keresto Lv 1 1 pt. 0
  8. Avatar for PeterDav 198. PeterDav Lv 1 1 pt. 0
  9. Avatar for prooh 199. prooh Lv 1 1 pt. 0
  10. Avatar for gdnskye 200. gdnskye Lv 1 1 pt. 0

Comments


Susume Lv 1

If I've reached agreement with AF on the topology (confidence >80), and AF and foldit are now duking it out about how big the voids will be or what angle the helix will lie against the sheet (similarity never quite reaching 80), is it really useful to keep pushing for them to agree, or can I call that one done and move on to something else?

And which version do scientists want to see? I guess if I upload mine you can always run AF on it yourself? But if you will discard it due to similarity < 80%, I'll be sad.

bkoep Staff Lv 1

Please share your Foldit-optimized design (not the raw AlphaFold prediction)! We can easily run AlphaFold predictions for your shared designs, and we like to see a players' best attempts in Foldit.

. . . is it really useful to keep pushing for them to agree, or can I call that one done and move on to something else?

We are still uncertain about this. Among previous Foldit designs, we have seen that AlphaFold similarity is slightly better than AlphaFold confidence for predicting successful designs. That suggests that, yes, it's worth it to keep refining your Foldit solution until AlphaFold similarity is greater than 80%.

However, we don't know how well that holds if you are iteratively loading AlphaFold solutions, refining them in Foldit, resubmitting to AlphaFold, re-refining in Foldit, etc. . . . This kind of strategy might "artificially" boost your AlphaFold similarity value without making meaningful improvements to your design.

Either way, I would hope that you could eventually find a high-scoring Foldit solution that also has a high AlphaFold similarity. If the differences cannot be reconciled, that might suggest a deeper problem (so you should share examples with scientists!).

Bautho Lv 1

Can you say anything about the time, how long it will take to run the prediction?
Does it depend on the number of residues?

Bautho Lv 1

… when loading solution from AlphaFold.
I can share the original design to scientists :)

bkoep Staff Lv 1

Yes, please share with scientists! We've received sporadic crash reports, but have had difficulty reproducing them.

spvincent Lv 1

I'm seeing this too. Doesn't happen with all AF-generated solutions but when it does it crashes consistently. Scientist-shared solution t4_c, when fed to AF, generates a solution which crashes when you try to load it.

LociOiling Lv 1

This is a multistart puzzle, which is explained in the last paragraph of the puzzle description, which I hadn't read until just now.

When I started the puzzle, I saw a straight "extended chain", so I assumed that was that….

Resetting the puzzle gives you a different structure. Structure 5 is the extended chain. You can reset by ctrl-r or Undo => Reset Puzzle. The first time you open the puzzle, Foldit picks a structure at random.

The structure number appears briefly in a pop-up when you reset. The segment information window also contains the structure number in the third line. To open the segment information window, hover over any segment and hit the tab key.

I used Jpred to search for matches to the primary structure (amino acid sequence) of structures 1 through 4. Jpred looks for matches to "solved" proteins in the PDB (Protein Data Bank, at rcsb.org), and also in the UniProt database of protein sequences. Unlike the PDB, entries in UniProt may not have a known 3D structure. (Maybe AlphaFold can get that sorted.) Jpred doesn't always find PDB matches.

The results were:

  • Structure 1 - length 43 - JPred finds no matches, but rscb.org finds PDB 5UOI
  • Structure 2 - length 68 - matches 6MRR, player-designed protein Foldit1
  • Structure 3 - length 106 - no PDB match, but lots of UniProt matches
  • Structure 4 - length 109 - matches PDB 6D0T (chains A and B) </ul> So three of the structures have been "solved", and structure 2 was designed by Foldit players in the first place. Structure 3 shows UniProt matches, where you'll quickly see things like "Cluster: SnoaL-like domain-containing protein" and *Burkholderia pyrrocinia*, part of the Burkholderia genus. Interesting, but not as helpful as having a 3D view available. If you look up "SnoaL", you may find "the proteins in this family adopt a distorted alpha-beta barrel fold", and see PDB 1SJW as an example. (Edit: search at rcsb.org found match for structure 1, Jpred is not perfect.)

Susume Lv 1

Structure 3 is the type of curved beta-sheet design that was featured in the Marcos paper from Baker lab - PDB 5L33 is another example.

Bautho Lv 1

I agree. Even when submitting the same structure again, it crashes every time you load any of the solutions.

beta_helix Staff Lv 1

Thank you all for your scientist shares, they helped us uncover the issue.
We'll post a hotfix soon!