Replies: 4 comments 4 replies
-
Hi @dessygil, One idea we have but haven't explored much is to convert PDB to NumPy matrices and store them in .zarr format, similar to the SDF converter. Came across this library, which could be a good starting point. CC @cwognum |
Beta Was this translation helpful? Give feedback.
-
Besides parsing the PDB file, one question is how to save the data contained in a PDB file to an array. This would ideally be lossless if possible, but remember that Polaris is ultimately meant for ML modeling. Instead of lossless, it would thus also suffice to just save the information needed for modeling. Maybe as a first step, we could simply store the 3D coordinates and amino acid sequence? Maybe worth going through a couple of ML papers that actually cite PDE10A and see which information they use? |
Beta Was this translation helpful? Give feedback.
-
Hi, @dessygil |
Beta Was this translation helpful? Give feedback.
-
This suggestion has been implemented in #171 ! 🎉 |
Beta Was this translation helpful? Give feedback.
-
Hi all,
I'm currently trying to add the dataset suggested in a previous discussion post called PDE10A. In this dataset, there are various files representing the ligand, pocket and protein. The ligand is within a .sdf file, whereas the pocket and protein are represented in a .pdb or .mol2 file. Polaris already has the function to convert a .sdf file to be used but I would like to add a .pdb converter to add the functionality to Polaris and to use for the specific dataset I'm trying to add.
In short, I'm writing this post to discuss the best strategy for adding the pdb converter.
Beta Was this translation helpful? Give feedback.
All reactions