Placeholder image of a protein
Icon representing a puzzle

1817: Coronavirus ORF6 Prediction

Closed since almost 6 years ago

Intermediate Overall Prediction

Summary


Created
March 26, 2020
Expires
Max points
100
Description

Refold this coronavirus protein! This protein is encoded in the viral genome of SARS-CoV-2, in a region called ORF6, but the protein's structure is still unknown. Evidence suggests this protein inhibits the natural immune response, helping the virus survive and replicate. If we knew how this protein folds, we might be able to figure out exactly how it inhibits the immune system. The puzzle's starting structure shows SS predictions from PSIPRED, and hints which parts of the protein might fold into helices or sheets. Refold this protein to find high-scoring solutions, which will tell us how this protein is most likely to fold!



Sequence:


MFHLVDFQVTIAEILLIIMRTFKVSIWNLDYIINLIIKNLSKSLTENKYSQLDEEQPMEID

Top groups


  1. Avatar for Hold My Beer 100 pts. 9,477
  2. Avatar for Go Science 2. Go Science 87 pts. 9,395
  3. Avatar for Gargleblasters 3. Gargleblasters 74 pts. 9,366
  4. Avatar for Anthropic Dreams 4. Anthropic Dreams 64 pts. 9,286
  5. Avatar for Contenders 5. Contenders 54 pts. 9,276
  6. Avatar for Void Crushers 6. Void Crushers 46 pts. 9,268
  7. Avatar for Beta Folders 7. Beta Folders 38 pts. 9,160
  8. Avatar for BOINC@Poland 8. BOINC@Poland 32 pts. 9,066
  9. Avatar for Marvin's bunch 9. Marvin's bunch 27 pts. 9,062
  10. Avatar for SETI.Germany 10. SETI.Germany 22 pts. 8,965

  1. Avatar for robgee 11. robgee Lv 1 96 pts. 9,161
  2. Avatar for bertro 12. bertro Lv 1 95 pts. 9,160
  3. Avatar for Timo van der Laan 13. Timo van der Laan Lv 1 95 pts. 9,154
  4. Avatar for KarenCH 14. KarenCH Lv 1 94 pts. 9,140
  5. Avatar for bnmoore 15. bnmoore Lv 1 94 pts. 9,139
  6. Avatar for blorppppp 16. blorppppp Lv 1 93 pts. 9,135
  7. Avatar for Serca 18. Serca Lv 1 92 pts. 9,135
  8. Avatar for Galaxie 19. Galaxie Lv 1 92 pts. 9,129
  9. Avatar for infjamc 20. infjamc Lv 1 91 pts. 9,114

Comments


Serca Lv 1

Psipred prediction gives long helix just like puzzle start structure.
It seems that this protein is transmembrane, but when you wiggle it enough that helix breaks on two parts. So it is possible that both tails of the protein are on the same side of the cell.

bkoep Staff Lv 1

Conf: 9630646303999999999985389986999999999998102000242227657998789
Pred: CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC
  AA: MFHLVDFQVTIAEILLIIMRTFKVSIWNLDYIINLIIKNLSKSLTENKYSQLDEEQPMEID
              10        20        30        40        50        60

jeff101 Lv 1

In the above PSIPRED Predictions, what do the Conf numbers mean?
Is 0 the lowest confidence and 9 the highest?
Is 1 the lowest confidence and 0 (standing for 10) the highest?

Serca Lv 1

I found a paper from 2007 that researches the functions of the ORF6 protein from a different SARS strain.
It states that its OFR6 is a membrane protein: "we demonstrate that SARS-COV ORF6 protein is localized to the endoplasmic reticulum (ER)/Golgi membrane in infected cells"

If the 1817 OFR6 protein is transmembrane, that means that exists is in non-aquatic enviroment, so hydrophilics should be buried and orange sidechains of this protein have to be exposed outside. In this case we just cannot get high Foldit score for best solutions, isn't it?

Paper:
Severe acute respiratory syndrome coronavirus ORF6 antagonizes STAT1 function by sequestering nuclear import factors on the rough endoplasmic reticulum/Golgi membrane.

jeff101 Lv 1

Doesn't PSIPRED give for each segment the
probability that the segment is helix (H),
sheet (E), or loop (coil so C)? I think 
these 3 probabilities should total to 1
for each segment. Doesn't PSIPRED also 
give diagrams with 3 letters of different 
heights for each segment, so some segments 
have large H's but small E's and C's, while 
other segments have large C's but small H's 
and E's? Would someone please post a chart 
with the HEC probabilities or an image with 
the HEC letters given in different sizes?

Thanks!

bkoep Staff Lv 1

Cool! I hadn't seen this work!

Glancing at the paper just now, I don't see terribly strong evidence that ORF6 is itself embedded in the membrane. It does look like ORF6 associates with the ER, but that could also just mean that it binds another molecule at the ER. Also, just looking at the ORF6 sequence, I don't see anything that screams "transmembrane" (like a long stretch of hydrophobics).

That said, I have not studied this particular protein, so I can't claim any authority on this subject—there may be other evidence that ORF6 is membrane-embedded! If it is, you are correct that the default Foldit score is inappropriate for structure prediction. To predict the structure of a membrane-bound protein, we would want to adjust our score function to account for the non-aquatic environment.

DoctorSockrates Lv 1

It might not be perfectly hydrophobic, but I did end up with a triple helix solution with two "membrane embedded" helices that I tried my hand at satisfying the polar atoms for. Could always be the case that it links up with another protein and offers quaternary structure to cover the rest.

Steven Pletsch Lv 1

is psipred capable of predicting pi helices and 310 helices ? if so does it specify this in its output ? or is it simply ptedictin alpha/beta/loop