Placeholder image of a protein
Icon representing a puzzle

2210: SARS-CoV-2 helicase CACHE Challenge: Round 3

Closed since over 3 years ago

Intermediate Overall Small Molecule Design

Summary


Created
October 07, 2022
Expires
Max points
100
Description

Compete in a challenge to design a drug targeting the SARS-CoV-2 helicase. Use the small molecule design tools and the compound library panel to find library compounds which bind to the active site of the enzyme.

Note: To get the most out of the small molecule design tools, we recommend changing you view settings to the Small Molecule Design Preset.

This puzzle is part of Foldit's participation in the CACHE Challenge. From the set of all compounds submitted in the multiple rounds of puzzles, Foldit scientists will select up to 100 compounds based on the CACHE-provided criteria. Only compounds which are in a commercially available library will be selected, so it's beneficial to make use of the Compound Library panel to search for library compounds similar to your current design. But don't limit yourself to the compound library. You're more likely to get good results by alternating: optimizing the molecule with the small molecule design tools, find the closest library compound, then further refine it with the design tools.

The main change from the second round is the addition of the "Fraction of four-bonded carbon" filter. Compounds which have too many double and triple bonds work poorly as drugs, so the CACHE organizers evaluate molecules based on the fraction of sp3 hybrizdized carbons -- basically, the fraction of carbons bonded with single bonds to four other atoms.

The other change is to the Compound Library panel. Results should now be pre-filtered to get rid of compounds with really bad objective scores, allowing you to focus on results which are more likely to score well.

Participation in CACHE puzzles is subject to the CACHE Terms of Participation, in particular “the Challenge IP [including Challenge Compounds] will be made freely available in the public domain pursuant to Creative Commons Attribution Only (CC-BY 4.0 or subsequent versions) licensing terms, with the intent that such Challenge IP may be Used and practiced by Users for any purpose”.

Top groups


  1. Avatar for Go Science 100 pts. 18,194
  2. Avatar for Contenders 2. Contenders 70 pts. 18,083
  3. Avatar for Anthropic Dreams 3. Anthropic Dreams 47 pts. 17,949
  4. Avatar for Gargleblasters 4. Gargleblasters 30 pts. 17,769
  5. Avatar for Marvin's bunch 5. Marvin's bunch 19 pts. 17,537
  6. Avatar for Australia 6. Australia 11 pts. 17,287
  7. Avatar for Hold My Beer 7. Hold My Beer 7 pts. 17,120
  8. Avatar for FamilyBarmettler 8. FamilyBarmettler 4 pts. 17,100
  9. Avatar for L'Alliance Francophone 9. L'Alliance Francophone 2 pts. 17,070
  10. Avatar for Team China 10. Team China 1 pt. 16,952

  1. Avatar for Sandrix72
    1. Sandrix72 Lv 1
    100 pts. 18,194
  2. Avatar for guineapig 2. guineapig Lv 1 94 pts. 18,064
  3. Avatar for MicElephant 3. MicElephant Lv 1 88 pts. 18,041
  4. Avatar for gmn 4. gmn Lv 1 83 pts. 17,946
  5. Avatar for LociOiling 5. LociOiling Lv 1 78 pts. 17,931
  6. Avatar for HuubR 6. HuubR Lv 1 73 pts. 17,769
  7. Avatar for Aubade01 7. Aubade01 Lv 1 68 pts. 17,766
  8. Avatar for Bletchley Park 8. Bletchley Park Lv 1 63 pts. 17,708
  9. Avatar for ucad 9. ucad Lv 1 59 pts. 17,594
  10. Avatar for carxo 10. carxo Lv 1 55 pts. 17,547

Comments


LociOiling Lv 1

The Markdown for the "sp3 hybrizdized" link didn't exactly work in the post above.

Here's the plain link: https://www.khanacademy.org/science/ap-chemistry-beta/x2eef969c74e0d802:molecular-and-ionic-compound-structure-and-properties/x2eef969c74e0d802:bond-hybridization/v/organic-hybridization-practice

I'm also not sure why they didn't post the usual information about the objectives.

We now have a "fraction of four-bonded carbons" objective. I think that was also a Sherlock Holmes case.

rmoretti Staff Lv 1

Objectives

Objectives in this puzzle are driven primarily by the evaluation criteria used by CACHE.

Maximum bonus: +8 000

Torsion Quality (max +1000)
Keeps bond rotations in a good range. Using Wiggle or Tweak Ligand can fix bad torsions. (Show highlights torsions to be rotated.)

Number of Rotatable Bonds (max +1000)
Intended to keep the ligand from getting too big and floppy. You can reduce rotatable bonds by deleting groups or forming rings. (Show highlights rotatable bonds.)

Ligand TPSA (max +1000)
Topological Polar Surface Area - Keeps the polar surface area (including buried polar surface) low. To improve, try removing oxygens and nitrogens. (Show highlights atoms contributing to higher TPSA.)

Ligand cLogP (max +1000)
A measure of polarity - Keeps the molecule from getting too hydrophobic. To improve, try adding polar oxygens and nitrogens. (Show highlights atoms contributing to higher cLogP.)

Fraction of four-bonded carbons (max +1000)
Measures how carbons with bonds to four atoms ("sp3 hybridized") there are. Too few (too many double and triple bonded carbons) is bad. (Show highlights carbon atoms at issue.)

Bad Groups (max +1000)
Gives a bonus for avoiding groups that interfere with assays, or which are far from the compounds in the library. (Show highlights groups at issue.)

Molecular Weight (max +1000)
Keeps the ligand a reasonable size.

Synthetic Accessibility (max +1000)
Keeps the ligand from going too far from the compounds in the library. (Show highlights parts of the molecule at issue.)

rmoretti Staff Lv 1

@spvincent The compounds theoretically should be ordered (most similar to least), but it looks like they may be getting scrambled. I'm looking into a fix.

Bruno Kestemont Lv 1

One of my team player found a solution with only one residue (in the ligand) and still 8000 full bonus (and almost 17 000 pts).
My higher scoring ligands are not centered in the protein, but stick at the "surface" (which could be on a hidden part of the protein).
Are there means to avoid these misleading high scores by adding more filters in the futire ?

rmoretti Staff Lv 1

So far we haven't had issues with decent scoring ligands being out of the pocket. If we do, there are additional objectives we can add in to try to keep things within the pocket.

argyrw Lv 1

the objectives is necessary for the puzzle?
the points down is negative when one objectives is bad

argyrw Lv 1

I have share 2 shares of puzzle 2210 with points 4935 which have 4 hydrogen and after the same with one residue with have 2 hydrogen with 16361 points in the second the objectives is 7600 can you tell me which is wrong? I have share with my group and with the science

rmoretti Staff Lv 1

The objectives are necessarily for this puzzle – they're measuring a number of compound quality measures which otherwise wouldn't be captured by the Foldit score. We can see a definite difference in compound quality of the top scoring solutions when we adjust the objective settings. We generally set objectives to be positive (with worse structures given a lower bonus), but if things are really bad they may go negative in some instances.

For your shares the main issue is the Bad Group objective – for the one which has a -397924 objective score it's saying there's 217 bad groups on the ligand. You can click "Show" to highlight the atoms at issue. You'll need to change the atom identities and connectivity to get rid of the Bad Group scoring. (I'd certainly recommend monitoring the objectives as you build the ligand, rather than trying to fix things up later.)

Note that 217 might be an over-estimate of the number of atoms involved, as there's potentially a bunch of double counting. Certain bad group patterns are written as disconnected patterns, and having multiple ways of expressing the same combination is counted as separate instances. In your case, the pattern you're most running afoul of is a nitrogen one. The pattern matches on 4 nitrogens of particular types, and it looks like you have 8 or more (depending on solution). So you'll get the combinitorial effect of "8 choose 4" = 70 (or more) instances of a bad group for just that pattern, not to mention potential double counting of other patterns. – This is something I may look to change, as the combinitorial effect may be over penalizing things, but ideally there shouldn't be any bad groups, and the objective does capture that.