You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I see, can you please check line bbox.
For all other files these tuples are not equal to (0, 0, 0, 0)
Try this:
doc = fitz.open('/home/sshustov/Downloads/example.pdf')
page = doc[0]
blocks = page.getText('dict')['blocks']
for block in blocks:
print(block['bbox'])
for line in block['lines']:
print(line['bbox'])
Please provide all mandatory information!
Describe the bug (mandatory)
For some reason I noticed several documents that have wrong bbox value
To Reproduce (mandatory)
You can use getText('dict') on attached document. I also attached the screenshot (you will find that 1 'bbox' value is correct, but another one is not
Expected behavior (optional)
Both bbox should have some numbers, not 0
Screenshots (optional)
example.pdf
data:image/s3,"s3://crabby-images/f1add/f1add73b3f7cdaaa06d57fd7d8d33b34c2e3eef2" alt="example_from_debug"
Your configuration (mandatory)
The text was updated successfully, but these errors were encountered: