Both pdfinfo & exiftool have trouble with 1.0.4 but works with 1.0.3
Hi, We use PDF tools like mutool, pdfinfo (from poppler) and exiftool in our workflow (once the PDFs are created by ConTeXt). But looks like we have hit upon some regression issue between 1.0.3 and 1.0.4 versions. Both pdfinfo (from poppler) and exiftool are able to read PDFs created from 1.0.3 but have trouble reading PDFs created with 1.0.4. Please see below logs for details. Our 1.0.4 is from April-2021. We will try to upgrade to the latest 1.0.4 and try again. mtx-context | ConTeXt Process Management 1.04 mtx-context | mtx-context | main context file: /ConTeXt/apr2021/tex/texmf-context/tex/context/base/mkiv/context.mkiv mtx-context | current version: 2021.04.21 11:39 mtx-context | main context file: /ConTeXt/apr2021/tex/texmf-context/tex/context/base/mkxl/context.mkxl mtx-context | current version: 2021.04.21 11:39 mtx-context | ConTeXt Process Management 1.03 mtx-context | mtx-context | main context file: /ConTeXt/sep2020/tex/texmf-context/tex/context/base/mkiv/context.mkiv mtx-context | current version: 2020.09.20 23:02 mtx-context | main context file: /ConTeXt/sep2020/tex/texmf-context/tex/context/base/mkiv/context.mkxl mtx-context | current version: 2020.09.20 23:02 *====pdfinfo (from Poppler)====* ~/TeX$ pdfinfo Hello-world_1.0.4.pdf Syntax Error: Missing 'endstream' or incorrect stream length Syntax Error: Missing 'endstream' or incorrect stream length Title: Hello world! Subject: A ConTeXt template Keywords: template Author: Mickey Mouse Creator: LuaMetaTeX 2.09 20210418 + ConTeXt LMTX 2021.04.21 11:39 Producer: LuaMetaTeX-2.09 CreationDate: Tue Aug 10 10:18:36 2021 +08 ModDate: Tue Aug 10 10:18:36 2021 +08 Tagged: no UserProperties: no Suspects: no Form: none JavaScript: no Pages: 2 Encrypted: no Page size: 612 x 792 pts (letter) Page rot: 0 File size: 39062 bytes Optimized: no PDF version: 1.7 ~/TeX$ pdfinfo Hello-world_1.0.3.pdf Title: Hello world! Subject: A ConTeXt template Keywords: template Author: Mickey Mouse Creator: LuaTeX 2.07 20200920 + ConTeXt MkIV 2020.09.20 23:02 Producer: LuaTeX-2.07 CreationDate: Tue Aug 10 09:16:30 2021 +08 ModDate: Tue Aug 10 09:16:30 2021 +08 Tagged: no UserProperties: no Suspects: no Form: none JavaScript: no Pages: 2 Encrypted: no Page size: 612 x 792 pts (letter) Page rot: 0 File size: 39861 bytes Optimized: no PDF version: 1.7 *====exiftool====* ~/exiftool/Image-ExifTool-12.29/exiftool Hello-world_1.0.4.pdf ExifTool Version Number : 12.29 File Name : Hello-world_1.0.4.pdf Directory : . File Size : 38 KiB File Modification Date/Time : 2021:08:10 10:19:52+08:00 File Access Date/Time : 2021:08:10 10:20:17+08:00 File Inode Change Date/Time : 2021:08:10 10:19:52+08:00 File Permissions : -rwxr-x--- File Type : PDF File Type Extension : pdf MIME Type : application/pdf PDF Version : 1.7 Linearized : No Warning : Error reading stream data ~/exiftool/Image-ExifTool-12.29/exiftool Hello-world_1.0.3.pdf ExifTool Version Number : 12.29 File Name : Hello-world_1.0.3.pdf Directory : . File Size : 39 KiB File Modification Date/Time : 2021:08:10 09:17:51+08:00 File Access Date/Time : 2021:08:10 09:18:00+08:00 File Inode Change Date/Time : 2021:08:10 09:17:51+08:00 File Permissions : -rwxr-x--- File Type : PDF File Type Extension : pdf MIME Type : application/pdf Linearized : No Author : Mickey Mouse Con Te Xt Jobname : Hello-world Con Te Xt LMTX : 0.10 Con Te Xt Support : contextgarden.net Con Te Xt Time : 2021-08-10 09:16 Con Te Xt Url : www.pragma-ade.com Con Te Xt Version : 2020.09.20 23:02 Create Date : 2021:08:10 09:16:30+08:00 ID : Hello-world | 2021-08-10T09:16:30+08:00 Keywords : template Modify Date : 2021:08:10 09:16:30+08:00 Subject : A ConTeXt template Te X Support : tug.org Language : en Format : application/pdf Creator : Mickey Mouse Description : A ConTeXt template Title : Hello world! Id : Hello-world | 2021-08-10T09:16:30+08:00 Con Te Xt Jobname : Hello-world Con Te Xt Time : 2021:08:10 09:16 Con Te Xt Url : www.pragma-ade.com Con Te Xt Support : contextgarden.net Con Te Xt Version : 2020.09.20 23:02 Con Te Xt LMTX : 0.10 Te X Support : tug.org Lua Te X Version : 2.07 Lua Te X Functionality : 20200920 Lua Te X Lua Version : 5.4 Lua Te X Platform : win64 Creator Tool : LuaTeX 2.07 20200920 + ConTeXt MkIV 2020.09.20 23:02 Metadata Date : 2021:08:10 09:16:30+08:00 Producer : LuaTeX-2.07 Trapped : False Document ID : uuid:41a746a2-4d66-ad14-387a-ace654b14d2c Instance ID : uuid:125a13c6-4cf4-9bff-2617-972e13516712 Page Mode : UseNone Page Count : 2 PDF Version : 1.7
On 8/10/2021 6:33 AM, Ramkumar KB via ntg-context wrote:
Hi,
We use PDF tools like mutool, pdfinfo (from poppler) and exiftool in our workflow (once the PDFs are created by ConTeXt).
But looks like we have hit upon some regression issue between 1.0.3 and 1.0.4 versions. Both pdfinfo (from poppler) and exiftool are able to read PDFs created from 1.0.3 but have trouble reading PDFs created with 1.0.4. Please see below logs for details. hard to say without files but indeed updating might help
btw, there is (kind of old but still ok): mtxrun --script pdf --info filename.pdf Hans ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl -----------------------------------------------------------------
Hans,
Thanks for the feedback. I will update to the latest version and run some
tests again.
mtxrun does not list the custom attributes unlike the others. From our
(limited) experience, mutool / poppler / exiftool all excel in different
areas. Dealing with different tools and keeping them upto date is
something we have internalized.
We did observe that mutool did not give errors with version 1.0.4 but
others did. In the ConTeXt development workflow, are there any tests with
such external tools (to possibly catch regression issues)?
We have experienced (reasonable) success with tools like diff-pdf
https://vslavik.github.io/diff-pdf/ to catch regression in our workflow.
Thank you as always!
best regards,
Ramkumar
On Tue, Aug 10, 2021 at 3:19 PM Hans Hagen
On 8/10/2021 6:33 AM, Ramkumar KB via ntg-context wrote:
Hi,
We use PDF tools like mutool, pdfinfo (from poppler) and exiftool in our workflow (once the PDFs are created by ConTeXt).
But looks like we have hit upon some regression issue between 1.0.3 and 1.0.4 versions. Both pdfinfo (from poppler) and exiftool are able to read PDFs created from 1.0.3 but have trouble reading PDFs created with 1.0.4. Please see below logs for details. hard to say without files but indeed updating might help
btw, there is (kind of old but still ok):
mtxrun --script pdf --info filename.pdf
Hans
----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl -----------------------------------------------------------------
Hans,
To close this thread, both poppler and exiftool worked properly after I
upgraded to the latest version (details below). I will add a section in the
wiki about these tools, as they are mighty useful when working with PDFs.
mtx-context | ConTeXt Process Management 1.04 mtx-context |
mtx-context | main context file:
/home/tex/texmf-context/tex/context/base/mkiv/context.mkiv mtx-context
| current version: 2021.08.10 12:42 mtx-context | main context file:
/home/tex/texmf-context/tex/context/base/mkxl/context.mkxl mtx-context
| current version: 2021.08.10 12:42
Thanks again!
best regards,
Ramkumar
On Tue, Aug 10, 2021 at 7:01 PM Ramkumar KB
Hans,
Thanks for the feedback. I will update to the latest version and run some tests again.
mtxrun does not list the custom attributes unlike the others. From our (limited) experience, mutool / poppler / exiftool all excel in different areas. Dealing with different tools and keeping them upto date is something we have internalized.
We did observe that mutool did not give errors with version 1.0.4 but others did. In the ConTeXt development workflow, are there any tests with such external tools (to possibly catch regression issues)?
We have experienced (reasonable) success with tools like diff-pdf https://vslavik.github.io/diff-pdf/ to catch regression in our workflow.
Thank you as always!
best regards, Ramkumar
On Tue, Aug 10, 2021 at 3:19 PM Hans Hagen
wrote: On 8/10/2021 6:33 AM, Ramkumar KB via ntg-context wrote:
Hi,
We use PDF tools like mutool, pdfinfo (from poppler) and exiftool in our workflow (once the PDFs are created by ConTeXt).
But looks like we have hit upon some regression issue between 1.0.3 and 1.0.4 versions. Both pdfinfo (from poppler) and exiftool are able to read PDFs created from 1.0.3 but have trouble reading PDFs created with 1.0.4. Please see below logs for details. hard to say without files but indeed updating might help
btw, there is (kind of old but still ok):
mtxrun --script pdf --info filename.pdf
Hans
----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl -----------------------------------------------------------------
participants (2)
-
Hans Hagen
-
Ramkumar KB