I've run into the problem of Google continuing to index pages and subfolders that no longer exist. They're not in my SiteMap.xml nor do they exist on the web server, yet Google keeps trying to access (and give 404 Not Found errors) to these same pages over and over. Some of those pages were removed from the server over a year ago.
Should I take the URLs of those listed in the 404 errors on Google's webmaster tools and add them to my robots.txt file as "Disallow"?
The webserver/hosting provider our company uses doesn't allow me to access and/or create an .htaccess or any other server admin functions (I can't even get the FrontPage admin functions to work, even though the server is supposed to have FrontPage server extensions installed.)