Allow using hexadecimal ATOM id #370

SSchott · 2023-09-06T11:48:59Z

Was facing issues parsing too big PDBs. I adapted the parsing for the serial entry for ATOM to allow hexadecimal notation, as when PDBs are rewritten by VMD. I don't know how much it would be necessary to include this for other atom id dependent entries. If so, making a function for it might be necessary. Anything that goes against such a move?

Allow using hexadecimal ATOM IDs

sobolevnrm · 2023-09-09T18:07:32Z

Can you please provide more information, including links to references, about PDB files using hexademical numbering? I'm unfamiliar with this. Thank you!

SSchott · 2023-09-10T09:45:54Z

I didn't find "official" documentation, aside the mailing list here , but is already used by other packages, e.g. here . Seems like adding it for resnumbers is also an option.

sobolevnrm

Hello --

I'm hesitant to add this functionality to PDB2PQR because it is a non-standard format. However, if we do add it, can you please add some tests to make sure it doesn't break other functionality, particularly when parsing PDB entries with chain IDs or insertion codes?

In general, all new functionality in PDB2PQR should include tests to demonstrate when the code works, when it breaks, and how it integrates with other features in the code.

Thank you,

Nathan

speleo3 · 2023-10-28T07:38:52Z

VMD's solution will produce many "hex" numbers which are digit-only, and thus will not be parsed back correctly and will produce non-unique atom IDs. So I wonder how useful this really is, and if it is any better than to just write zeros or 99999 for overflowing numbers.

>>> int("%05x" % 100096)
18700

SSchott · 2023-10-30T15:46:58Z

Hi,
Thanks for the comments. @sobolevnrm I'll see what I can do regarding a test. In any case, any standard PDB should never trigger the except section, as ATOM IDs should always be int, but we know how the PDB-space looks like.... @speleo3 I don't follow what you mean. The try and except should ensure only hex numbers are transformed into a decimal number. It should ensure traceability in the long term.

SSchott · 2024-01-03T14:56:53Z

Finally got around this. I made a draft in #383 . Let me know if it goes in line with what you were looking for.

Update pdb.py

e8edf40

Allow using hexadecimal ATOM IDs

sobolevnrm requested changes Oct 8, 2023

View reviewed changes

GerardCarreraCardona approved these changes Oct 16, 2023

View reviewed changes

SSchott closed this Jan 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow using hexadecimal ATOM id #370

Allow using hexadecimal ATOM id #370

SSchott commented Sep 6, 2023

sobolevnrm commented Sep 9, 2023

SSchott commented Sep 10, 2023

sobolevnrm left a comment

speleo3 commented Oct 28, 2023

SSchott commented Oct 30, 2023

SSchott commented Jan 3, 2024

Allow using hexadecimal ATOM id #370

Allow using hexadecimal ATOM id #370

Conversation

SSchott commented Sep 6, 2023

sobolevnrm commented Sep 9, 2023

SSchott commented Sep 10, 2023

sobolevnrm left a comment

Choose a reason for hiding this comment

speleo3 commented Oct 28, 2023

SSchott commented Oct 30, 2023

SSchott commented Jan 3, 2024