Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The image generated by get_pixmap() is abnormal, but the text result is correct #3854

Open
1339503169 opened this issue Sep 9, 2024 · 2 comments
Labels
upstream bug bug outside this package

Comments

@1339503169
Copy link

Description of the bug

here is original pdf
1832786.pdf
image generated by get_pixmap()
1832786 pdf_0
what is looks like in wps
image

I opened this file in WPS and found it to be OK, and the text extraction was also correct. However, the image generated by get_pixmap() is very strange, and the Chinese text seems to be garbled

How to reproduce the bug

import fitz
document = fitz.open('path/to/original pdf')
page = document.load_page(0)
page.get_pixmap()

PyMuPDF version

1.24.5

Operating system

Windows

Python version

3.8

@JorjMcKie
Copy link
Collaborator

This a bug in the base library, MuPDF. I have entered a bug report there, here is the link: https://bugs.ghostscript.com/show_bug.cgi?id=708019.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
upstream bug bug outside this package
Projects
None yet
Development

No branches or pull requests

3 participants
@JorjMcKie @1339503169 and others