WARNING:
Please note that this article was published a long time ago. The information contained might be outdated. My new project is online. It's called Web Dehydrator and it can be described as a tool that transforms any web-page to JSON. Web Dehydrator is made by a mix of Zend Framework 2, Symfony DomCrawler and PhantomJS. This is what each component does: I haven't published the code behind the Web Dehydrator service, but I could share it if someone is interested in helping. The following is a sample JSON output of the result of the data extracted from the http://www.dilbert.com/ website: