agcohn821 Staff Lv 1
[1:04 PM] jflat06: Alrighty, hello everyone, and welcome to office hours! I'm jflat06, a developer on Foldit. I work on the software side of things for both the client and server. I don't have anything specific to talk about today, so if you have any questions or topics, feel free to bring them up
[1:05 PM] spvincent: hi jflat. I'm wondering if you're making progress on fixing the problems with small molecule design
[1:09 PM] jflat06: While I don't work on that specifically, I know that those are actively being worked on. I think we have 1 fix waiting already, and more should be coming soon as well.
[1:09 PM] spvincent: ok, tx
[1:11 PM] jflat06: We have several developers who are dedicated specifically towards development on the small molecule design stuff
[1:11 PM] spvincent: i know from experience how hard it is to track down irreproducible bugs
[1:13 PM] jflat06: Yeah! It's definitely a nightmare sometimes
[1:14 PM] jflat06: We have been trying to add more debugging info over the years to help with these things, but it's always a tough problem
[1:15 PM] jflat06: One of the features for the new site that I'm really looking forward to is the ability to download and install any old version of the client, which will be amazing for enabling players to help track down bugs for us
[1:15 PM] pc: Hi. AlphaFold tool is very interesting and helpfull in puzzles. Will you add some additional informations in AlphaFold prediction, like confidence detail in each segments (with color for exemple) ?
[1:15 PM] jflat06: @pc definitely
[1:15 PM] jflat06: We have some things in mind, and we hope those features will be coming sooner than later
[1:16 PM] spvincent: Perhaps you could look into sending out versions which display more debug info: stack traces and the like.
[1:16 PM] pc: I saw this in a scientific software video (it is confidence color)
[1:16 PM] jflat06: There are some technical hurdles to overcome to communicate that info between the client/server
[1:17 PM] jflat06: Yup! The stuff we're looking at specifically is giving you confidence on a per-residue basis
[1:17 PM] LociOilingIRC: do we have any ideas on why sometimes the traceback info on windows is all zeroed out?
[1:17 PM] jflat06: You mean the crash stack traces?
[1:18 PM] jflat06: Unfortunately I'm not sure
[1:18 PM] LociOilingIRC: yes, seems like something must have zapped memory
[1:18 PM] LociOilingIRC: also sometimes we get a debug.txt after a crash, but often we don't
[1:19 PM] jflat06: Yeah, the crash reporter tries to get that stack trace, but some crashes are just 'segfaults', where the crash reporter never even gets a chance to grab that stack trace
[1:19 PM] jflat06: and unfortunately not much we can do in those cases
[1:20 PM] LociOilingIRC: ok, that makes sense
[1:20 PM] spvincent: Maybe there are better tools on other platforms
[1:21 PM] pc: Thanks. In foldit it is hard to visualize clashes and voids with details. to optimise packing we mostly use script and not enough hand folding. So maybe we need this kind of visualisation : (edited)
[1:23 PM] pc: with a volumetric cloud in a 3d texture
[1:26 PM] jflat06: I know we have the isosurface, but I realize that is a little… lacking
[1:29 PM] LociOilingIRC: just thought of this again, but would it be possible to have a protein design sandbox with alphafold?
[1:29 PM] spvincent: I was thinking the same thing!
[1:29 PM] LociOilingIRC: puzzle 2027 can be used up to a point, but it has a limit of 120 segments
[1:30 PM] pc: Do you know if players use AF in puzzles ? I saw some top score players forgot to use it. Maybe score system can take count about some protein analysis in shared solutions ?
[1:30 PM] LociOilingIRC: also, AF seems doubtful about some real proteins
[1:31 PM] pc: yes LociOilingIRC ^^. I experiment new proteins in a design puzzle round begin. But can be interesting in sandbox yes ^^.
[1:31 PM] LociOilingIRC: for example, I just submitted the zinc binding protein, and the prediction came back with only 60% confidence
[1:31 PM] spvincent: I don't think its a question of forgetting to use it.
[1:32 PM] LociOilingIRC: not forgetting, sometimes you just have to move on without AF
[1:33 PM] spvincent: I suppose at some point the AF score might get integrated into the Foldit score but i've no idea how that could reasonably be done.
[1:33 PM] jeff101:I've had solutions that start with good AF scores, but as they gain Foldit points, the AF confidence score goes down.
[1:33 PM] pc: sometimes we have to reduce points to get better confidence. Score system can be adapted maybe.
[1:33 PM] alcor29: Hi jflat06. would you know if foldit will work with Windows 11?
[1:33 PM] jflat06: I don't run the puzzles, but we might have some plans for that in the future with a setup like the competition we ran earlier this year where you try to get as many viable solutions as possible.
[1:33 PM] jflat06: @alcor29 I have no idea :x
[1:34 PM] jflat06: I don't think anyone on the team has upgraded yet
[1:34 PM] alcor29: k
[1:34 PM] alcor29: Tx.
[1:35 PM] pc: In binder puzzles when we adapt too mush our protein to target, the alone protein become less stable alone. it is normal. For the first time we can know when it happen with AF. But we can need an objective in game that tell us when out protein is to mush twisted. (edited)
[1:35 PM] jeff101:is AlphaFold based just on structures of monomeric proteins that contain no metals, ligands, or other cofactors and are not part of larger protein complexes?
[1:36 PM] jeff101:is AF based only on structures in the physiological pH range that most FOldit puzzles are done at?
[1:37 PM] pc: In last puzzle I maybe found a protein that keep his hight confidence during all round (91.5%). It is because I talked it to bko-ep this week ^^
[1:37 PM] jeff101:are any of the structures AF is based on membrane proteinns? are any in environments that favor disulfide bonds? are any in environments that disfavor disulfide bonds?
[1:38 PM] jflat06: I'm not sure about what training data was used to train alphafold
[1:38 PM] jflat06: I think the paper would be the best place to look up what training set they used
[1:40 PM] jeff101:it would be nice to be able to color residues by any subscore we choose. This would give a different perspective on things like packing, voids, and clashing.
[1:41 PM] jflat06: Yeah, that would be a cool feature. I know we have some plans for re-doing some of the visualizations, and that might be something we try and work in
[1:41 PM] jflat06: no timeline on that, though
[1:42 PM] jeff101:it would also be nice if we could access the # of neighbors Rosetta/Foldit uses to decide which residues are in the core
[1:42 PM] alcor29: Will any work on the new client be directed to improving the hypersensitivity of the move tool?
[1:44 PM] spvincent: this may be off topic a bit, but what will Rosetta be doing in this new era of AF and the Rosetta equivalent? looking at proteins with non-canonical amino acids: that kind of thing?
[1:45 PM] jeff101:can you cite a paper or web page that gives more details about how Rosetta/Foldit uses cones to find the # of nearest neighbors to a residue? I would guess that the cone angle and size depends on what amino acid a residue is. Things like glycine and alanine would have smaller cones than tryptophan for example. If we knew all the angles and distances involved, we might be able to code our own neare
[1:45 PM] jeff101:st neigbor calculator with LUA
[1:45 PM] jflat06: @alcor29 can you describe the problem more specifically? Is the rotation too sensitive?
[1:46 PM] jflat06: @spvincent I don't know all of what is happening on the Rosetta side of things, but I do know that there's a huge rush to figure out how to use and extend the machine learning algorithms, specifically with respect to protein design
[1:47 PM] jeff101:Rosetta@Home is still sending out jobs
[1:47 PM] jflat06: Even from a Foldit point of view, this is super exciting, because it gives us a chance to dramatically increase the odds that your designs will fold up properly by the time they make their way to us
[1:48 PM] pc: rotate around protein in symetric puzzles is not very easy. It is because it depends of protein position I suppose ? When protein is far it is less easy.
[1:48 PM] jflat06: @jeff101, I don't know much about the inner workings of the nearest neighbor calculations, unfortunately
[1:48 PM] jeff101:@pc do you use q to center the rotation on the protein. it helps
[1:48 PM] alcor29: Very hard to impossible to make a slight correction, especially with the gimble function. Can you do something akin to changing the gear ratio?
[1:48 PM] pc: thanks jeff ^^
[1:49 PM] jflat06: yeah I think I know what you're talking about. I think we could improve that, I always found it finicky, but I never actually had to do anything with it!
[1:49 PM] jflat06: I just had to test it… hah
[1:49 PM] spvincent: I can understand how the field has been turned upside down but isn't there a danger of relying too much on AF and abandoning Molecular Mechanics approaches? The latter should be much more general.
[1:50 PM] jeff101:I made a recipe called FindMonomers to help find monomers when they are far apart. It puts some bands between the monomers that you can follow from one monomer to another. It also adds labels to at least one monomer so you can find it using Notes Mode.
[1:50 PM] Vincera: Is there a way for bona fide, regular, independent FoldIt researchers without current instituitonal affiliation or gain access to full-text articles via UW? We often need to read beyond the abstracts and cannot afford 40$ a pop.
[1:51 PM] jflat06: I don't think anyone is abandoning it, but more thinking about when and where it can be integrated to improve existing techniques
[1:53 PM] spvincent: integrated into RosettaFold?
[1:53 PM] jflat06: @Vincera Generally, I'm not sure. I think any of the studies related to Foldit we pay to make open-access, though
[1:54 PM] jeff101:Vincera, if you know the title and authors, you can sometimes find pdf's of key articles doing a google search. Some authors post them on their own web pages.
[1:55 PM] Vincera: Scribd and Semantic Scholars offer pdfs but I don;t want someone's hijacked, stolen work.
[1:55 PM] jflat06: There are tons of different methodologies under constant research within the Rosetta community, and I think people are all asking themselves, "how does this fit into my research?"
[1:55 PM] spvincent: Also if you ask the authors for a copy saying who you are and why you want it they'll usually send you a copy.
[1:56 PM] jeff101:back in the day, authors would order hardcopy preprints or offprints that they could mail to colleagues
[1:56 PM] Vincera: Yes; but, as an American citizen, I cannot - for instance - directly request an Iranian article. This week I ahd to do a workaround and access via the Dutch researcher.
[1:57 PM] jflat06: Sometimes that might be, "can I just do this all with machine learning?". But more often I would expect people to ask, "What if this part of my research was a solved problem?"
[1:57 PM] Vincera: And some of the Caribbean Basin works are the most difficult.
[1:57 PM] Vincera: I read the French and Spanish research
[1:57 PM] Vincera: but cannot gain access to the researcehrs themselves often.
[1:58 PM] Vincera: Much of their work has not even made it into the databanks.
[1:58 PM] jflat06: For example, from a Foldit point of view, what if protein structure prediction was a solved problem? How do we redesign foldit with that in mind? We would want to have more of a focus on sequence manipulation and tools to match a sequence to a specific shape.
[1:59 PM] Vincera: I myself am published and had to pay for my own article.
[2:00 PM] spvincent: Perhaps but Foldit has been emphasizing protein design for years
[2:01 PM] alcor29: AF can predict monomers, but we have to change sequences to "bind" with other substances or symmetric copies. That's where I have a problem seeing where AF can help unless it is trained on millions of interfaces?\
[2:01 PM] pc: Foldit is more a "design software" than a "prediction software" now
[2:02 PM] pc: And I think in the future of Foldit we will have design puzzles with proteins that have "functions" (maybe)
[2:02 PM] pc: It will be very interesting ^^
[2:03 PM] jflat06: AF doesn't currently pick shapes that bind to other shapes. But having an oracle that tells you what shape a specific sequence folds up into is still enormously powerful for this task.
[2:03 PM] alcor29: Absolutely.
[2:04 PM] alcor29: Thanks jflat.
[2:05 PM] jflat06: That's never existed in the field before, and rather than viewing it as encroaching, I think most people in the field are beyond excited about how they can apply it to different problems.
[2:05 PM] spvincent: Maybe. I just have reservations about the way AF is sometimes portrayed in popular science websites as being the last word in protein structure prediction. It may be a major advance but ultimately it's pattern matching not science. Even if its the pinnacle of pattern matching
[2:06 PM] alcor29: No doubt. I have a feeling the interface problem will eventually be worked on.
[2:06 PM] jflat06: Sure - but a lot of science itself is pattern matching. Much of rosetta is based on statistical models and trying to emulate patterns.
[2:07 PM] jflat06: I certainly don't think that AF means the end of statistical of physical predictive modelling for protein folding.
[2:07 PM] spvincent: thats good
[2:07 PM] pc: AF give a very important feedback in design puzzles. But yes prediction is not resolved.
[2:07 PM] NinjaGreg: Seems to me what you'd want is a tool that allowed you to specify a fold, and have it tell you the sequence to get there.
[2:08 PM] jflat06: @NinjaGreg that's the holy grail
[2:08 PM] alcor29: That would be great!
[2:08 PM] NinjaGreg: Right now it's sort of "try this and see what happens".
[2:08 PM] pc: NinjaGreg : maybe it is possible yes, most of deep learning algo can be reversed (edited)
[2:09 PM] NinjaGreg: Think the AF people would look at that?
[2:10 PM] pc: https://fold.it/portal/node/2011976 : I made this topic for that @NinjaGreg ^^
[2:10 PM] jflat06: I'm sure they will at some point.
[2:10 PM] NinjaGreg: Hey, thanks pc!
[2:10 PM] jflat06: I'm pretty sure Rosetta people are
[2:10 PM] alcor29: You can have a library of sequences and corresponding folds, even the unconforming ones.
[2:11 PM] jflat06: Ok, I am going to go grab some lunch. Thank you everyone for coming!
[2:11 PM] alcor29: Thanks jflat
[2:11 PM] spvincent: tx jflat
[2:11 PM] NinjaGreg: Thanks for hosting, @jflat06!
[2:12 PM] pc: "bon appétit"
[2:12 PM] pc: thanks
[2:12 PM] Nicm25: thanks very much ^^.
[2:12 PM] jeff101:thanks jflat06