PDA

View Full Version : Reparse HTML buffer with QWebKit support



saxon
19th January 2012, 00:38
Hi.
I need to process potentially uncorrect HTML markup and get like-WebKit 'dirty-mode' XHTML (optionally, HTML).
I don't need to process non-html blocks (scripts, styles - keep as is).
What i do:
//////////////////////////////
QWebPage page;
page.setHtml( content );

...onFinishLoading( bool )
{
page.mainFrame()->toHtml();
}
//////////////////////////////
But it doesn't work. For a simple raw html code it works fine.
But when i try to parse any URL content (wiki for example), it return's false in onFinishLoading and <html></html>

Added after 1 58 minutes:

Thanks all.
I've found solution.
These lines must be added before call to skip non-HTML processing.

QWebSettings * settings = QWebSettings::globalSettings();
settings->setAttribute(QWebSettings::AutoLoadImages, false);
settings->setAttribute(QWebSettings::JavascriptEnabled, false);
settings->setAttribute(QWebSettings::JavaEnabled, false);
settings->setAttribute(QWebSettings::PluginsEnabled, false);
settings->setAttribute(QWebSettings::PrivateBrowsingEnabled, true);