Get PDBWriter to follow standard for ATOM/HETATM #2826

IAlibay · 2020-07-07T10:59:21Z

Related to #1753

Is your feature request related to a problem?

Currently the PDB writer will always write out every single residue with an ATOM record type.

This is causing some issues downstream where we pass PDB files written by MDA to propka (Becksteinlab/propkatraj#24)

My interpretation of the PDBv3 standard is that standard amino acids and nucleotides are written as ATOM records, everything else is a HETATM (https://www.wwpdb.org/documentation/file-format-content/format33/sect9.html).

Describe the solution you'd like

As a first step, default to writing ATOM records if atoms belong to a residue that is included in the standard, otherwise write a HETATM record. Then we could expand to expand to the suggestions in #1753?

Describe alternatives you've considered

The alternative is that we use one of the converters downstream, but it would be nice to have a standard compliant PDB writer.

The text was updated successfully, but these errors were encountered:

Luthaf · 2020-07-07T13:06:41Z

You can try to use chemfiles PDB writer instead (using format='CHEMFILES'), which should deal with this specific issue. If you do try it, let me know how it goes!

IAlibay · 2020-07-07T13:36:32Z

Thanks @Luthaf. I did try chemfiles unfortunately there's a deeper issue with NamedStream objects that I haven't been able to get to the bottom of yet. i.e. the following will fail:

pstream = mda.lib.util.NamedStream(StringIO(), 'file.pdb')
with ChemfilesWriter(pstream, chemfiles_format='PDB') as w:
  w.write(ag)
pstream.reset()

Either way, I think the MDA PDB writer needs fixing. Indeed the fact that chemfiles' writer differs in behaviour is an even bigger reason. In my opinion, any way that a user tries to write out a given file format with MDA should yield the same answer.

Luthaf · 2020-07-07T13:56:06Z

Yes, since chemfiles uses its own C++ code to write to files and not standard python file objects, I would expect such code to fail. This is something I might be able to fix with the next release which introduces in-memory reading/writing.

IAlibay added Format-PDB Component-Writers downstream labels Jul 7, 2020

mieczyslaw mentioned this issue Jul 30, 2020

HETATM record type written out #2880

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get PDBWriter to follow standard for ATOM/HETATM #2826

Get PDBWriter to follow standard for ATOM/HETATM #2826

IAlibay commented Jul 7, 2020

Luthaf commented Jul 7, 2020

IAlibay commented Jul 7, 2020

Luthaf commented Jul 7, 2020

Get PDBWriter to follow standard for ATOM/HETATM #2826

Get PDBWriter to follow standard for ATOM/HETATM #2826

Comments

IAlibay commented Jul 7, 2020

Is your feature request related to a problem?

Describe the solution you'd like

Describe alternatives you've considered

Luthaf commented Jul 7, 2020

IAlibay commented Jul 7, 2020

Luthaf commented Jul 7, 2020