Google, Yahoo, Bing and Wiki Leaks definitely don't use FrontPage.
You could even view the code and copy and paste it into WordPad. Many great sites are made by WordPad, copy and paste and modification so the content is unreckognizeable. Some even advice a newbie to start with WordPad and a ftp program
But I would not call it content (screen) scraping And if the site has more than a few pages it is inherently slow to use that technique.
And if you want to separate content from markup etc. you need a scraper with parsing abilities that could even put the content directly into your database.
Inline assembly and / or plain c is fastest.
Last edited by kgun; 12-31-2010 at 11:51 AM.
Mini Network:: Financial information at your fingertips
Learn object oriented programming where it started
Conversations creates communities and conversions create profit.