Expose dump_data_fields_utf8 ? #37

ldng · 2020-04-02T04:20:35Z

My version of pdftk (3.0.2) has both a dump_data_fields and a dump_data_fields_utf8.
The former does not output accent correctly while the later does the job as expected.

Could/should it be exposed ?
Should the dump_data_fields python function use silently the dump_data_fields_utf8 command when available since in Python3 all string are unicode anyway ?

revolunet · 2020-04-03T15:03:51Z

Agree that we should only have one function for this

mogli91 · 2020-10-20T21:06:33Z

+1
I ran into this issue when running pdftk-java in a debian docker container. Basically, German umlaut characters and other special unicode characters are simply replaced by a "?". However, if I use cmd = f'{pypdftk.PDFTK_PATH} debug_filled.pdf dump_data_fields_utf8' then also p = pypdftk.check_output(cmd, shell=True) has mainly the expected output (correct unicode characters).
I am saying mainly because I noticed that multiline FieldValues are currently unsupported. I will open a separate issue for this (created issue #43 )

JHei · 2025-01-13T08:01:19Z

+1
unfortunately still not available?

mogli91 mentioned this issue Oct 20, 2020

dump_data_fields does not support multi-line FieldValues #43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose dump_data_fields_utf8 ? #37

Expose dump_data_fields_utf8 ? #37

ldng commented Apr 2, 2020

revolunet commented Apr 3, 2020

mogli91 commented Oct 20, 2020 •

edited

Loading

JHei commented Jan 13, 2025

Expose dump_data_fields_utf8 ? #37

Expose dump_data_fields_utf8 ? #37

Comments

ldng commented Apr 2, 2020

revolunet commented Apr 3, 2020

mogli91 commented Oct 20, 2020 • edited Loading

JHei commented Jan 13, 2025

mogli91 commented Oct 20, 2020 •

edited

Loading