Replies: 5 comments 18 replies
-
Another potential benefit is that some isozymes could be skipped by ftINIT perhaps. Not sure how much it matters though. This could be a post-processing step at the end of ftINIT, to just remove the isozymes with a score < 0 if there are others with a score > 0. |
Beta Was this translation helpful? Give feedback.
-
Good to notice, is that the new
The most important part of the proposed Note that there should also be some function that can parse e.g. Complex Portal output (loadComplexData). One idea would be to have In that strategy, Regarding the EC codes, to me this is a separate issue (how to annotate EC numbers, potentially extracted from proteins that are annotated with EC numbers (e.g. from UniProt). |
Beta Was this translation helpful? Give feedback.
-
So, the main point is that the EnzymeInfo would be the primary storage of this information, instead of GPRs and eccodes, which are a bit confusing as I pointed out, so it would replace them. GPRs and eccodes will probably have to be kept for legacy reasons, but would be generated from the EnzymeInfo. What we do inside GECKO is less a concern I'm thinking - I think this will be different for light vs full. But the problem is the same for both. I also think this is a more clear structure - the GPRs can also be defined in other forms, i.e., in more complex setups than just (ANDs) OR (ANDs), which I think is bad - it makes it much more difficult to work with them. It is like that in Human-GEM to save space. So, this is not to replace the model.ec fields, but to replace the GPR and ECCodes fields. About the ftINIT - let's leave it for now, I'm not sure it is that valuable to identify which enzyme complexes that are available for a reaction. |
Beta Was this translation helpful? Give feedback.
-
I somehow fixed the enzyme info issue in the CofactorYeast project although it might not be an efficient way but could inspire a bit here. I store the info in the file enzymedata.mat, in which there are different enzyme-related fields (e.g., subunits' id, ec code, stoichiometry, kcat and kcat confidence...) with the same length that equals the number of split enzymatic reactions. |
Beta Was this translation helpful? Give feedback.
-
After the |
Beta Was this translation helpful? Give feedback.
-
As written by @johan-gson on email:
What I propose instead is to add another structure to the model, called EnzymeInfo or something like that, that is a list of enzyme complexes (just ids, maybe EC numbers, I’m not sure). Then we would add a separate table called Enzyme Complexes or similar, which would contain:
We would then add a table of subunits, which would contain gene id, protein id, and potentially some other info. Alternatively, this table could be skipped, and we could just use gene directly in the enzyme complexes above. But it depends on what we want to add. Ideally, I think the subunits table should contain things like MW.
I would then think that Gecko 3 could use different data depending on what is available. If this new structure is there in a model, that is what we will use. Otherwise, we can provide code to build it from GPRs and/or the old eccodes field.
If you want, I could generate this type of structure for Human-GEM, perhaps as part of the Human2 project? If you agree, that is. The question is exactly what should be put in this structure, and what should be retrieved from other databases. When we work with it, the GPRs and ECCodes fields could then be generated from the new structure, so we don’t store things in multiple ways.
Beta Was this translation helpful? Give feedback.
All reactions