Custom Puzzles

Started by Gamer127

rosie4loop Lv 1

(Edited for clarity and fix typos)

Always happy to see a plan before the start of a project, this is useful to estimate the resources required and the skill you need to equip. It is good to have a motivation to start a project. Indeed, drafting a research plan and search for the tool you would like to use is a good practice for any researchers.

So I had some fun trying to comment on it during a break, imagining this is an idea from researchers who work in other fields.

Let's break it down into different aspects to see if it is good to execute the current plan.

I'm trying to create an enzyme to fix a mutated protein into its original shape and function.

(1) Science question:

  • Mutants M1, M2, M3… of a protein (lets call it P) lost its function due to mutation
  • Key assumption here:
    • mutants M1, M2, M3… no longer function because of the changed shape upon mutation
    • which means:
      • if protein P is an enzyme, the mutated sites are not directly involved in the reaction
      • if protein P is a kind of regulators that inhibit or activate other proteins, or if P is a signaling molecule that affect other downstream processes, after correcting the shape it can still bind its binding partner
      • if protein P is a structural protein, the function can be restored by holding the shape with something else

(2) Main project objective:

  • Restore the function of protein P from M1, M2, M3…
  • by designing a new enzyme (lets call it E)
  • E turns the shape of M123 back into P for its function

(3) What you need according to your statement:

  • The correct shape of wildtype P
  • Sequence of M123… and build their structure from the correct shape of P
  • "Wrong" structure of M1, M2, M3…
  • Enzyme E, assuming you need to design it, that restore the shape of M123 into the shape of P
    • E need to bind the "wrong" structure of M1, M2, M3 to initiate the whole process (!!)
    • After binding to the "wrong" M1, M2, M3… E need to force them back into the shape of P. (this is very challenging to investigate or to design, that E need to induce a change upon binding. May need energy)
    • For an initial design of a small project maybe ok to use only one form of mutant (?)
  • Metrices to decide whether the designed enzyme E is good enough for your purpose
  • Experimental validation

(4) Intermediate goals you need to achieve to get the items in (3):

  • get the structure of P (if exists)
  • get the strcuture of M1, M2, M3…
  • build the "corrected" fold of M1, M2, M3
  • design the enzyme E
  • evaluate different designs of enzyme E.
  • (likely out of the scope of the project) model the process of E + M (wrong) -> E + M (correct)

(5) Comments and unanswered questions on each proposed step

I will collect the amino acid sequence of each mutated protein,

  • First of all, double-check if the key assumption is valid that "P -> M1, M2, M3… mutants lost their functions because of conformational change upon mutation"
  • Is there any known stucture(s) of the mutated proteins? If no, would it be important to model their structures first?
  • Which of the following you would like to target? Is it the structure of the "wrong" shape of M1, M2, M3…? Or the structure of the "corrected" M1, M2, M3…?

and one at a time run foldit with it creating a new enzyme strand that will alter the mutated protein to its correct shape. (I understand that is not built in to Foldit, so I will create my own system using Python and the Script window provided in-game)

This can be breakdown into multiple steps (if I get it right):

  1. design the backbone of the enzyme E that bind M1, M2, M3…
  2. optimize the sequence of E to produce the fold you want
  3. predict the interaction between E and M1, M2, M3…
  4. predict if E interacts with the "wrong" structure of protein
  5. predict if E changes the shape of M1, M2, M3… upon binding
  6. predict whether the corrected structure of M123 can serve their fuction

(1)-(3) may be possible to do after you decide the target, but (4)-(6) are difficult, that Foldit may not be the ideal tool for it even if you can do exhaustive sampling.

Also how can you decide that the enzyme can alter the mutated protein to the correct shape? This kind of changes are complicated. It's already difficult to design a binder, even harder to design a machine that works after binding. (I'm curious if you know a tool that do it, let me know what it is!)

Afterwards, I will manually add in blocking structures to avoid the enzyme from collapsing.

  • If you design a folded enzyme, why you need a blocking structure to avoid it from collapsing?
  • Does it mean you just design a certain motif that "hold the shape", but without a stable structure? So the blocking structure is also a protein?

I will check my solution by having the enzyme in contact of every protein type it may encounter individually from all angles.

  • I am not sure about the purpose of this step, or if I have misunderstood something. Why you need to bind it from all angles?
  • If you design the enzyme to target a structure, or if protein P is known to bind something, you should know where you want it to bind already. If the interface is unknown it'd be difficult to evaluate.
  • How to decide the contact from a certain angle is important or useful?
  • Prediction of protein-protein interaction on without a known interface surface is challenging (actually still challenging even if the interface is known). How do you decide this binding mode is relevent?

(6) Other comments

  • I don't know the scale or the level of this project, is it a project in class? Or a honor project for final year students? Or it is a personal project solely for personal interest? How much time you will have to finish it?
  • If it is just a class project, have an rough prediction of the structure of mutants and discuss on how the mutation affects the functional sites may take a long time already.
  • If you decide to use either the "correct" or the "wrong" fold of M1, M2, M3… as your binding target of an initial design
    • need to tell the reason of using this conformation as the target
    • lets say at this stage, designing a binder could be good enough within the time limit. So you have the "correct" shape of mutants and try to bind someting with it to stabilize.
    • If you use Foldit, you need the PDB of "corrected" M1, M2, M3… as the taget, and a dummy chain for binder design. Like those binder design begineer puzzles in Foldit.
    • Manually design a binder can be challenging and time-consuming (ask the rest of Foldit community). If you don't have enough time for that, you may try another tool "RFdiffusion" which is also related to Rosetta like Foldit. You may try it for free from Google colab (https://colab.research.google.com/github/sokrypton/ColabDesign/blob/v1.1.1/rf/examples/diffusion.ipynb). That tool let AI do the design instead of human.
      • Even if you have a design of E that binds one form of M123, it's difficult to check if it could turn them back into a correct shape.
  • Likely out of the scope of your project
    • If more time is allowed and you have the resources and skill, I'd do some simulation of WT and mutants to investigate the stability, conformation changes (need to use other methods, not Foldit. But this is more of my personal intesest to know what happens.)
    • There are some tools that predict the change in activity and thermodynamical properites upon mutation using AI. That sometimes also tell the reason from the structural perspective.
    • model the process of E + M (wrong) -> E + M (correct) is challenging even for scientists. There are too many variables there, require many (computational) experiments. Also need to breakdown the project into smaller parts to do it.

(7) Final Remarks

Don't just rely on "random people on internet who claim they know something" (like the person who wrote these comments).
Read papers from peer-reviewed journals on the topic you are interested in, so you know the current trend, and how scientists design experiments to do something similar to what you want to do. Also disuss with people who know you better, like your mentors or instructors. So you know more on what kind of knowledge you need to learn and what kind of skill you need to have, until you get what you desired.

And good luck on your project!

rosie4loop Lv 1

Another problem to be considered is that whether you aim the complex of E+M(corrected) as the functional unit, or M must work by itself after being "corrected" by E, and E would leave M after correcting its shape. It's actually very likely that M may get back into the "wrong" shape after E unbinds from it.

If E is an editor that edit the sequence of M or modify the amino acid enough that it can fold back into the correct shape, it's more probable.
Or if E acts like a lego block that upon complexation of E+M, this new complex has the shape and function of original protein P.

But all these cases are very challenging to work on, in practice.

(Edited)

rosie4loop Lv 1

Also add another note here. Foldit can only do energy minimization (wiggle), conformation sampling (shake rebuild), etc.

It evaluate the structure by giving it a score, which is a set of equations based on physics.

It is not for dynamical simulation that estimate a time-dependent process or behavior (at least for now).

Even for programs that do simulation, the timescale we can do is still limited with the current computational power.

If we use other methods to model a process, we need to be aware that any process can happen on computer, just build it. But it may not happen in reality. We predict whether it'd happen by estimating the energy change of the process, and assuming "to get A from B, path 1 is more probable than path 2". Nevertheless it's just an hypothesis, a model, until we can validate it with experiments.

LociOiling Lv 1

Back on the regular online Foldit front, I did try modifying my private puzzle to use the various examples provided with Foldit standalone.

There's a "PDB" example which uses only a PDB file in its config. That didn't work as a private puzzle in the online version, getting the same error complaining about a missing WTS file.

I'll open a bug report after a little further research. I need to look back at when the new Foldit was launched. I seem to recall some discussion of private puzzles at the time.

I'm not too optimistic about any bugs getting resolved soon.

See the Foldit standalone page on the wiki for an intro to the offline version version of Foldit. You have to go through a licensing step to download Foldit standalone, but it's purely a formality. Once standalone starts, you can either load in your PDB and other config files, or start with a previously saved standalone session.

As far as I know, Foldit standalone is still a 32-bit app on Windows. As the wiki page notes, there's a forum of sorts for Foldit standalone. The standalone forum is hosted at at a different site (not fold.it), and isn't really active.

rosie4loop Lv 1

@LociOiling would it work if you save the puzzle session in standalone then upload it to the online version, instead of uploading the structure? Or save session, import in standalone (generate multiple files), then upload the files generated by the import to the online version? Just wondering.

I don't have an option to upload a private puzzle to test, need to apply via email according to the "for education" post.

LociOiling Lv 1

Sorry for the slow reply @rosie4loop – I don't think there's a path to load a Foldit solution file (*.ir_solution) into the game version, but I'll do a little more experimenting. Standalone does have some kind of "save solution" feature, but it's been a while.

Thanks for the custom puzzle links. 2019 is maybe not that old, we'll see how much has changed.

LociOiling Lv 1

I guess private puzzles are working. Over in #bugs-and-feedback on Discord, @rmoretti supplied a .wts that can be added to the "puzzle files" when creating a private puzzle.

The tricky part is that the PDB must also be one of the "puzzle files" entries on the setup screen.

I had previously tried supplying the PDB as the "starting structure", but then the puzzle fails to open. I'm currently thinking they meant modified starting structure, but further testing is needed for that.

I also opened a bug report a couple of days back: https://fold.it/forum/bugs/private-puzzles-crash-opening-non-existent-wts-file

The bug report has the most complete details at the moment.

LociOiling Lv 1

@rmoretti wrote:

The "starting structure" is just used for the preview in the upper right of the puzzle page – it doesn't have any bearing on the puzzle setup itself.

It seems like the starting structure is actually used for the initial pose. I created a de-novo freestyle puzzle, and it shows the extended chain from the "starting structure" PDB, not the folded version from the PDB under "puzzle files".