See HTML-tidy, https://www.html-tidy.org/ it could help you pre-processing your HTML files. Massi Il 16/05/22 17:08, Pablo Rodriguez via ntg-context ha scritto:
Dear list,
I would like to feed https://seumasjeltzz.github.io/LinguaeGraecaePerSeIllustrata/001.html as XML input for ConTeXt.
The problem is that (as many other XML files that I haven’t generated myself) some <meta> and <link> tags aren’t closed, such as in:
<meta charset="utf-8"> <link href="https://fonts/css?greek" rel="stylesheet"> <link href="style.css" rel="stylesheet">
So, all that I get is the following message:
invalid xml file - parsed text
Unsuccessfully I have tried the following:
\xmlsetsetup{#1}{html/head/(meta|link)}{-}
Is there no way to make ConTeXt more tolerant, so that it is able to ignore those tags?
Many thanks for your help,
Pablo