
On 7/7/25 18:05, Hans Hagen via ntg-context wrote:
[...] Indeed. When there are useful and reliable mappings we can of course add more. In that case there also should be intended structure mentioned because we can have a richer structure in context than possible with those mappings to pdf which then can result in an invalidated mapping.
It's mostly about playing safe.
Of course, I totally agree with that. I only want to play to be able to investigate which options are the right ones. This is my way of looking into this, just by testing (or breaking) it first.
[...] Did you test it with ua 1 replaced by ua 2 ?
No, it works fine. Sorry, but with lines such as: description = { pua = "ua1", pdf = "Note" }, `pua = "ua1"` made me think that it was UA-1 only.
If needed we can pick up on it next month or so unless there are clean and easy additions to make (we're currently not in that mode and it's not the most interesting stuff to play with so we like to concentrate it, and the more it rains the better).
Tagging is boring and despairing for me (but subcolumnsets for parallel texts are in pause mode now). As far as I can recall, there are three things pending regarding spans: languages, alternative text and actual text (/StructElem is missing there). Language tagging should be able for some block elements (at least, delimitedtext and probably description).
Thanks for looking into it (I have no use cases and no possibilities to test and Mikael depends on what means and demands the university puts on it.)
Many thanks for mentioning it. The latest tagging sample mentioned in the list (shorter version, https://perspectivia.net/servlets/MCRFileNodeServlet/pnet_derivate_00007290/...) contains also some tagging that is really hard to believe it might be right, such as: /P <>BDC 0.145 Tw 9 0 0 9 66.4399 67.6929 Tm [(e)13.6 (.)27.4 (g)16.2 (. Peter Frankopan)-7 (, )]TJ EMC /P <>BDC /TT1 1 Tf 10.022 0 Td [(The Earth Transformed: An )-0.6 (Untold History)]TJ EMC At least, there are the following issues here: 1. Too many /P elements for what should be /Span elements. 2. “The Earth Transformed: An Untold Story” is English, not German. 3. /MCID are orphan (they lack their corresponding /StructElem). #2 may be a human honest mistake, but #1 and #3? My impression is that many (when not most) people may be struggling with proper tagging. Many thanks for your reply and your help, Pablo