The image generated by get_pixmap() is abnormal, but the text result is correct #3854

1339503169 · 2024-09-09T10:47:25Z

Description of the bug

here is original pdf
1832786.pdf
image generated by get_pixmap()

what is looks like in wps

I opened this file in WPS and found it to be OK, and the text extraction was also correct. However, the image generated by get_pixmap() is very strange, and the Chinese text seems to be garbled

How to reproduce the bug

import fitz
document = fitz.open('path/to/original pdf')
page = document.load_page(0)
page.get_pixmap()

PyMuPDF version

1.24.5

Operating system

Windows

Python version

3.8

JorjMcKie · 2024-09-09T11:06:20Z

This a bug in the base library, MuPDF. I have entered a bug report there, here is the link: https://bugs.ghostscript.com/show_bug.cgi?id=708019.

JorjMcKie added the upstream bug bug outside this package label Sep 9, 2024

JorjMcKie mentioned this issue Sep 9, 2024

The image generated by get_pixmap() is abnormal #3853

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The image generated by get_pixmap() is abnormal, but the text result is correct #3854

The image generated by get_pixmap() is abnormal, but the text result is correct #3854

1339503169 commented Sep 9, 2024

JorjMcKie commented Sep 9, 2024

The image generated by get_pixmap() is abnormal, but the text result is correct #3854

The image generated by get_pixmap() is abnormal, but the text result is correct #3854

Comments

1339503169 commented Sep 9, 2024

Description of the bug

How to reproduce the bug

PyMuPDF version

Operating system

Python version

JorjMcKie commented Sep 9, 2024