Oh I agree with that, whatever info it picks up, is what info it uses its complex text matching techniques on, it doesn't care wether you have alt tags, or if you use invalid attributes like margin= in your body tag. The robot might read the doctype so it knows how to parse code, on the other hand, it may not. The small spider I wrote for my own website doesn't care at all about syntax, it strips all tags and just looks for text, although bad tags could cause invalid stripping like </title might strip everything between <title> and <body>...
It is worth just running your code through a validator to check syntax, ensure all your page is being seen, just as part of your task in making yourself search engine friendly.
|