On 12-11-2011 17:40, Khaled Hosny wrote:
On Sat, Nov 12, 2011 at 05:31:23PM +0100, Philipp Gesang wrote:
(Beware that processing wiki text from WP is extremely complicated due to WP’s using special plugins (“templates” and stuff). So the only way to make sure that a parser accept any well formed WP page would be to include all those plugins. Which would entail rewriting the PHP code in Lua for use as a context script. And then you’d have to decide for every plugin what its output should look like in Context.[0] If you have the time ...)
I think scraping the MediaWiki-generated HTML would be simpler.
Doesn't it also depend on the first line being recognizable as such? Hans ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | voip: 087 875 68 74 | www.pragma-ade.com | www.pragma-pod.nl -----------------------------------------------------------------