-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Should the Document constructor use a different parser for collapse_whitespace? #54
Comments
after taking a few superficial glances, just some open ended threads:
|
i've been thinking that whitespace would make a good major topic for the 0.5 version. and, instead of relying on libxml specifics, we can implement serialization natively. that's to be engineered for the Rust implementation anyway and we can look at the API design (pretty-formatted string representations? always move all namespace declarations to the root node when serializing a document?). |
the last cell in the |
i started looking into this which led me to realise that the serialization doesn't consider the |
quick update: yesterday i was honest enough to meself to realize that i'm actually traumatized by the task of producing properly placed whitespace. but i still think the target is in eye's sight. let's hope the XML Foundation covers rehab. |
currently right now i'm in a manic phase (yes diggin to solve the problem got me to new experiences) and i imagine that the implementation will produce the most beautifullest XML that the world has ever seen and only the radiated überhumen on Mars will be able to deliver something better. anyway after a few hints by @zed-g i have the idea to compile an appendix for the documentation that compares "pretty" XML serialisat productions by different serializers for a small variety of samples. |
lxml.etree.tostring
withpretty_print=True
has this caveat:Now instantiating a
delb.Document
with thecollapse_whitespace
flag somewhat feels like it should do away with whitespaces in a way that makes the parsed XML suitable for custom formatting, e.g. calling:...or something like this. However, in order to be able to pretty print delb content, it is still necessary to use a custom parser on instantiation, e.g.
...in which case the
collapse_whitespace
flag of theDocument
constructor isn't even relevant.I feel like wanting to pretty-print delb objects as a usecase is somewhat justified (I needed it today in order to simplify a test), and think that this behaviour is somewhat obscured right now and should at least be documented in some way. But maybe this could even be handled in a more user-friendly way. Is there a point in using
delb.Document
withcollapse_whitespace
without anlxml
parser that also removes whitespace or could the use of such a parser perhaps be implied bycollapse_whitespace
in general?Should
TagNode
have atostring
method with an optionalpretty_print
flag as well?The text was updated successfully, but these errors were encountered: