Thanks! That's what I wanted to know. Seems like I have no other options, regular expressions would be too impractical in my case, and XML parsers are not able to parse HTML as it's said here http://stackoverflow.com/a/5202694

So now my only remaining problem is to solve the WebCoreNode leak, any ideas how ? I tried deleteLater() on everything, didn't help. WebCoreNode leak warning shows only when I include the abovementioned code, so the root cause should be there.