\starttext
$\int_{i=1}^{\infty} x^2$
\stoptext
$ mudraw -ttt test.pdf
<?xml version="1.0"?>
<document name="test.pdf">
warning: ignoring surrogate pair mapping in cmap
<page>
<block bbox="280.534 67.7741 286.392 84.595">
<line bbox="280.534 67.7741 286.392 84.595">
<span bbox="280.534 67.7741 286.392 84.595" font="LMRoman12-Regular"
size="11.9552">
<char bbox="280.534 67.7741 286.392 84.595" c="1"/>
</span>
</line>
</block>
<block bbox="75.4124 101.448 84.9435 156.849">
<line bbox="75.4124 101.448 84.9435 156.849">
<span bbox="75.4124 101.448 84.9435 156.849" font="LatinModernMath-Regular"
size="8.36861">
<char bbox="75.4124 101.448 84.9435 156.849" c="∞"/>
</span>
</line>
</block>
<block bbox="72.2341 100.779 80.1837 179.922">
<line bbox="72.2341 100.779 80.1837 179.922">
<span bbox="72.2341 100.779 80.1837 179.922" font="LatinModernMath-Regular"
size="11.9552">
<char bbox="72.2341 100.779 80.1837 179.922" c="∫"/>
</span>
</line>
</block>
<block bbox="68.8824 124.759 83.5355 180.159">
<line bbox="68.8824 124.759 83.5355 180.159">
<span bbox="68.8824 124.759 83.5355 180.159" font="LatinModernMath-Regular"
size="8.36861">
<char bbox="68.8824 124.759 70.5728 180.159" c="u"/>
<char bbox="70.5728 124.759 72.2631 180.159" c="�"/>
<char bbox="72.2633 124.759 78.7736 180.159" c="="/>
<char bbox="78.7741 124.759 83.5355 180.159" c="1"/>
</span>
</line>
</block>
<block bbox="87.513 100.785 99.7105 179.928">
<line bbox="87.513 100.785 99.7105 179.928">
<span bbox="87.513 100.785 94.3509 179.928" font="LatinModernMath-Regular"
size="11.9552">
<char bbox="87.513 100.785 94.3509 179.928" c="?"/>
</span>
<span bbox="94.9491 109.213 99.7105 164.614" font="LatinModernMath-Regular"
size="8.36861">
<char bbox="94.9491 109.213 99.7105 164.614" c="2"/>
</span>
</line>
</block>
</page>
</document>
On Wed, May 15, 2013 at 6:14 PM, Xan
Al 13/05/13 09:55, En/na luigi scarso ha escrit:
On Sun, May 12, 2013 at 4:32 PM, Xan
wrote: Hi,
I just want to know if there is any tool to convert a pdf (generated by latex or context) to latex source or context source file. Does anyone have got an experience on that?
I'm thinking about two alternatives: * libraries for reading like podofo and custom script for passing pdf context (text) to context commands * pass pdf to jpg, and apply http://detexify.kirelabs.org/classify.htmlfor passing to tex symbols.
For me it's vital to pass mathematical symbols like (\int) to tex symbol and not like utf-8 symbols.
Thanks a lot, Xan.
Have you seen the mudraw program of mupdf http://www.mupdf.com/ ? It has a -t switch that outputs txt and a -tt and -ttt switches that output xml.
-- luigi
Thank you for answering and sorry for delay. I will check it, but I suspect that if I have
$$\int_{i=1}^{\infty} x^2$$
in one latex document and it generates pdf, then mupdf -t of that document does not generate that formula, else "S i=1 x²".
Thanks, Xan.
-- luigi