Foldit

2027: Design with AlphaFold Predictions

Closed since over 4 years ago

Summary

Created: August 04, 2021
Expires: August 31, 2021 at 23:00 UTC
Max points: 100

Description

Design a protein with the help of the new AlphaFold prediction tool! AlphaFold is an algorithm from the company DeepMind and can predict protein structures with incredible accuracy. We want to know if Foldit players can use feedback from AlphaFold to design new proteins! After you've designed a protein, you can upload your solution to the Foldit server and get back an AlphaFold prediction about how your design will actually fold up. Players should aim for an AlphaFold confidence and similarity of 80% or more. See the blog for more details about the AlphaFold prediction tool.

We are especially interested to see what creative new folds players might come up with. We already know that Foldit players can design simple helix bundles or ferredoxin-like ("surfing hotdog") proteins. We are hoping that AlphaFold will allow players to explore protein designs with all beta sheets, or folds with a recessed binding pocket where a ligand could fit. After this puzzle closes, we'll highlight our favorite designs in the next Lab Report video update on September 1.

In addition to the typical extended chain, this puzzle has four starting structures of designed proteins that you may like to use as a starting point. Reset the puzzle to cycle through the different starting structures. There are no Objectives, but we expect successful designs will still need lots of helices or sheets, with short minimal loops, as well as a closely-packed core of orange hydrophobic residues. AlphaFold predictions will not affect your score. Players may insert and delete residues to a maximum of 120 residues total. This special puzzle will remain online until August 31 at 23:00 GMT.

Top groups

1. Anthropic Dreams
100 pts. 13,412
2. Contenders 77 pts. 13,398
3. Go Science 58 pts. 13,385
4. Beta Folders 43 pts. 13,360
5. L'Alliance Francophone 31 pts. 12,791
6. Gargleblasters 22 pts. 12,705
7. Marvin's bunch 15 pts. 12,695
8. Deleted group pts. 12,518
9. Void Crushers 7 pts. 12,454
10. BOINC@Poland 5 pts. 12,389

191. artsyambie6 Lv 1 1 pt. 7,383
192. nn1234 Lv 1 1 pt. 1,672
193. sakshamphul Lv 1 1 pt. 0
194. Threeoak Lv 1 1 pt. 0
195. apetrides Lv 1 1 pt. 0
196. memm0740 Lv 1 1 pt. 0
197. Keresto Lv 1 1 pt. 0
198. PeterDav Lv 1 1 pt. 0
199. prooh Lv 1 1 pt. 0
200. gdnskye Lv 1 1 pt. 0

Comments

Susume Lv 1

August 06, 2021

If I've reached agreement with AF on the topology (confidence >80), and AF and foldit are now duking it out about how big the voids will be or what angle the helix will lie against the sheet (similarity never quite reaching 80), is it really useful to keep pushing for them to agree, or can I call that one done and move on to something else?

And which version do scientists want to see? I guess if I upload mine you can always run AF on it yourself? But if you will discard it due to similarity < 80%, I'll be sad.

bkoep Staff Lv 1

August 06, 2021

Please share your Foldit-optimized design (not the raw AlphaFold prediction)! We can easily run AlphaFold predictions for your shared designs, and we like to see a players' best attempts in Foldit.

. . . is it really useful to keep pushing for them to agree, or can I call that one done and move on to something else?

We are still uncertain about this. Among previous Foldit designs, we have seen that AlphaFold similarity is slightly better than AlphaFold confidence for predicting successful designs. That suggests that, yes, it's worth it to keep refining your Foldit solution until AlphaFold similarity is greater than 80%.

However, we don't know how well that holds if you are iteratively loading AlphaFold solutions, refining them in Foldit, resubmitting to AlphaFold, re-refining in Foldit, etc. . . . This kind of strategy might "artificially" boost your AlphaFold similarity value without making meaningful improvements to your design.

Either way, I would hope that you could eventually find a high-scoring Foldit solution that also has a high AlphaFold similarity. If the differences cannot be reconciled, that might suggest a deeper problem (so you should share examples with scientists!).

Bautho Lv 1

August 07, 2021

Can you say anything about the time, how long it will take to run the prediction?
Does it depend on the number of residues?

Bautho Lv 1

August 07, 2021

… when loading solution from AlphaFold.
I can share the original design to scientists :)

bkoep Staff Lv 1

August 08, 2021

Yes, please share with scientists! We've received sporadic crash reports, but have had difficulty reproducing them.

spvincent Lv 1

August 08, 2021

I'm seeing this too. Doesn't happen with all AF-generated solutions but when it does it crashes consistently. Scientist-shared solution t4_c, when fed to AF, generates a solution which crashes when you try to load it.

LociOiling Lv 1

August 08, 2021

This is a multistart puzzle, which is explained in the last paragraph of the puzzle description, which I hadn't read until just now.

When I started the puzzle, I saw a straight "extended chain", so I assumed that was that….

Resetting the puzzle gives you a different structure. Structure 5 is the extended chain. You can reset by ctrl-r or Undo => Reset Puzzle. The first time you open the puzzle, Foldit picks a structure at random.

The structure number appears briefly in a pop-up when you reset. The segment information window also contains the structure number in the third line. To open the segment information window, hover over any segment and hit the tab key.

I used Jpred to search for matches to the primary structure (amino acid sequence) of structures 1 through 4. Jpred looks for matches to "solved" proteins in the PDB (Protein Data Bank, at rcsb.org), and also in the UniProt database of protein sequences. Unlike the PDB, entries in UniProt may not have a known 3D structure. (Maybe AlphaFold can get that sorted.) Jpred doesn't always find PDB matches.

The results were:

Structure 1 - length 43 - JPred finds no matches, but rscb.org finds PDB 5UOI
Structure 2 - length 68 - matches 6MRR, player-designed protein Foldit1
Structure 3 - length 106 - no PDB match, but lots of UniProt matches
Structure 4 - length 109 - matches PDB 6D0T (chains A and B) </ul> So three of the structures have been "solved", and structure 2 was designed by Foldit players in the first place. Structure 3 shows UniProt matches, where you'll quickly see things like "Cluster: SnoaL-like domain-containing protein" and *Burkholderia pyrrocinia*, part of the Burkholderia genus. Interesting, but not as helpful as having a 3D view available. If you look up "SnoaL", you may find "the proteins in this family adopt a distorted alpha-beta barrel fold", and see PDB 1SJW as an example. (Edit: search at rcsb.org found match for structure 1, Jpred is not perfect.)