PDA

View Full Version : HTML parser using Qt3



lni
27th June 2007, 03:49
Hi,

Which Qt class can be used to parse HTML string?

Thanks.

guilugi
27th June 2007, 08:46
Maybe you could use xml classes to achieve this !

http://doc.trolltech.com/3.3/xml.html

lni
3rd July 2007, 19:30
Maybe you could use xml classes to achieve this !

http://doc.trolltech.com/3.3/xml.html

I am afraid not.

XML requires to have open and close tags. It would fails, for instance, at line break <BR>, or horizontal line <HR>.

I am trying to find similar library that is used in lynx, usage "lynx -dump http://www.qtcentre.org", but it does not seem to have an easy one. Was trying to extracted from lynx source codes but I gave up due to complexity...

guilugi
3rd July 2007, 21:47
I know some people managed to use Xerces in order to parse HTML...seems it can be with Cyberneko (Java)...
Maybe you can be inspired :)