iEntry 10th Anniversary Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Web Programming Discussion Forum Working with an API? Developing a plugin? Writing a Mod or script for your favorite blog, Web 2.0 site or Forum? Welcome.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 05-29-2008, 03:55 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,648
wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9
Default An idea for a spider emulator Firefox plugin

I have an idea for a plugin for Firefox that will render the page in a way similar to how a spider might see that page - for example, highlighting links and different types of text, suppressing client-side scripting, etc. I know there are tools and web sites that already allow similar functionality, but personally, I prefer being able to analyze pages of a site directly in my browser, rather than having to go to an outside program/site to do the test.

Would you be interested in using such an add-on? Also, would you be interested in assisting with the development or testing of the addon?
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #2 (permalink)  
Old 05-29-2008, 05:52 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: An idea for a spider emulator Firefox plugin

Quote:
Originally Posted by wige View Post
I have an idea for a plugin for Firefox that will render the page in a way similar to how a spider might see that page - for example, highlighting links and different types of text, suppressing client-side scripting, etc. I know there are tools and web sites that already allow similar functionality, but personally, I prefer being able to analyze pages of a site directly in my browser, rather than having to go to an outside program/site to do the test.

Would you be interested in using such an add-on? Also, would you be interested in assisting with the development or testing of the addon?
I am for sure interested!
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #3 (permalink)  
Old 05-30-2008, 01:59 AM
incrediblehelp's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Jan 2004
Location: Live in Cincy Now
Posts: 7,573
incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4
Default Re: An idea for a spider emulator Firefox plugin

I will help test, but that is all I have time for now.
Reply With Quote
  #4 (permalink)  
Old 05-30-2008, 10:08 AM
DaveSawers's Avatar
WebProWorld Veteran
 
Join Date: Dec 2006
Location: Calgary, Alberta, Canada
Posts: 492
DaveSawers RepRank 3DaveSawers RepRank 3
Default Re: An idea for a spider emulator Firefox plugin

Don't take this the wrong way, but I find it hard to understand what use such a tool would be.

Search engine spiders each see different things. Search engine providers prefer to keep the details of their algorithms secret and change them regularly.

Whilst I can see that it is useful to calculate things such as keyword density, etc. what else would this tool tell me and what benefit would I get from it?
__________________
Dynamic Software Development
www.activeminds.ca
Reply With Quote
  #5 (permalink)  
Old 05-30-2008, 10:39 AM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,648
wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9
Default Re: An idea for a spider emulator Firefox plugin

The current concept:

The first iteration would simply remove scripting, stylistic tags, and CSS content, convert images to text, and display expanded information about links on the page (showing the alt text etc.) to give a quick overview of the page so someone can spot the more obvious (and likely) issues such as javascript-based navigation that the bots can't follow.

The next version would include the ability to hide and/or highlight sections of the page that the spiders may see as suspicious, for example marking tiny text or hidden text.

This is of course subject to change as I get feedback and develop the concept more.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #6 (permalink)  
Old 05-30-2008, 11:04 AM
WebProWorld Veteran
 
Join Date: Jul 2004
Posts: 913
activeco RepRank 2
Default Re: An idea for a spider emulator Firefox plugin

Quote:
Originally Posted by DaveSawers View Post
Search engine spiders each see different things. Search engine providers prefer to keep the details of their algorithms secret and change them regularly.
No, they all see the same thing (well, most of them). It has nothing to do with algorithms, but rather a simple spider view, just a bit above the code level.
Something like Google's text cache: Activeminds Software Development & Website Design

With such a view it is much easier to spot problems which are usually hard to see when fully rendered.

I personally use Lynx for the purpose, but I'm pretty sure a lot of Windows users would appreciate the plugin.
__________________
Impossible? You just underestimate the time.
Reply With Quote
  #7 (permalink)  
Old 05-30-2008, 11:29 AM
incrediblehelp's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Jan 2004
Location: Live in Cincy Now
Posts: 7,573
incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4
Default Re: An idea for a spider emulator Firefox plugin

Doesnt the Web Developer plugin for FF do all of this?
Reply With Quote
  #8 (permalink)  
Old 05-30-2008, 11:55 AM
DaveSawers's Avatar
WebProWorld Veteran
 
Join Date: Dec 2006
Location: Calgary, Alberta, Canada
Posts: 492
DaveSawers RepRank 3DaveSawers RepRank 3
Default Re: An idea for a spider emulator Firefox plugin

Quote:
Originally Posted by activeco View Post
No, they all see the same thing (well, most of them). It has nothing to do with algorithms, but rather a simple spider view, just a bit above the code level.
Something like Google's text cache: Activeminds Software Development & Website Design

With such a view it is much easier to spot problems which are usually hard to see when fully rendered.
I don't pretend to be an SEO specialist, so apologies if this is old hat. Just trying to learn a little more...

OK, so the text view is mildly interesting, but what can I learn from it?

And while I'm asking questions, why don't the spiders look at Javascript? It's not like it hasn't been around for years and it's not as if it doesn't contain interesting links and information relevant to search engines. I used to think it was just laziness on the part of spider developers but I hope there's a better reason. With more and more AJAX applications around, not probing the Javascript is going to make the search engines increasingly out of touch with the very content they should be indexing.
__________________
Dynamic Software Development
www.activeminds.ca

Last edited by DaveSawers; 05-30-2008 at 11:58 AM.
Reply With Quote
  #9 (permalink)  
Old 05-30-2008, 12:31 PM
WebProWorld Veteran
 
Join Date: Jul 2004
Posts: 913
activeco RepRank 2
Default Re: An idea for a spider emulator Firefox plugin

Quote:
Originally Posted by DaveSawers View Post
And while I'm asking questions, why don't the spiders look at Javascript?
Simply because of "what they see is what they get". As you know it is a client side scripting language, so they need an interpreter too, which would require huge resources. The same for flash, etc.

Actually Google often look for the links inside the js code and they try harder in the last time: "We're getting better at parsing some JavaScript, but we still don't get it all 100%."
__________________
Impossible? You just underestimate the time.
Reply With Quote
  #10 (permalink)  
Old 05-30-2008, 12:43 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,648
wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9
Default Re: An idea for a spider emulator Firefox plugin

The main purpose I had in mind was to make it so the user can spot navigation that doesn't work. But I would also want to expand on it to show possible problem areas.

As far as Javascript, I have heard a few reasons why the spiders don't use it. The main thing is that implementations of Javascript vary from client to client, so what standard should the bot follow in interpreting the script? Should it process it the way IE would, the way Firefox would? Should it use the latest engine, or an old engine, or some combination? What about users who don't use Javascript, would they have a different experience, or be taken to a page that would not be accessible to them normally?

Processing Javascript would also require the spider to run the page through a fairly complex parser to process each page, which would increase the resources required. In addition, it could open up an avenue to spam the engines, as pages could be coded so the search engines would see the content, but the script would break in most browsers.

Javascript is also left alone because it gives a way to hide content from the search engines. If there are links that you want to ensure most spiders and bots won't follow, the surest way is to obscure them in javascript. This will stop normal search spiders and many spam bots.

Search engines do, however, take at least a cursory look at Javascripts, but this is done more as a quality control measure. The spider will look for redirects and content that appears spammy, without fully processing the script. Some discovery of links is also done, but this tends to be unreliable.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #11 (permalink)  
Old 05-30-2008, 12:56 PM
DaveSawers's Avatar
WebProWorld Veteran
 
Join Date: Dec 2006
Location: Calgary, Alberta, Canada
Posts: 492
DaveSawers RepRank 3DaveSawers RepRank 3
Default Re: An idea for a spider emulator Firefox plugin

Quote:
Originally Posted by activeco View Post
As you know it is a client side scripting language, so they need an interpreter too, which would require huge resources. The same for flash, etc.
Javascript isn't that resource hungry so I doubt the required resources would be "huge".

SWF files can now be opened up for examination althoguh I don't know how far that goes. If they just extract text then that is of limited use as lots more interesting stuff is likely to be buried in the ActionScript.
__________________
Dynamic Software Development
www.activeminds.ca
Reply With Quote
  #12 (permalink)  
Old 05-30-2008, 01:04 PM
DaveSawers's Avatar
WebProWorld Veteran
 
Join Date: Dec 2006
Location: Calgary, Alberta, Canada
Posts: 492
DaveSawers RepRank 3DaveSawers RepRank 3
Smile Re: An idea for a spider emulator Firefox plugin

Quote:
Originally Posted by wige View Post
As far as Javascript, I have heard a few reasons why the spiders don't use it...
None of which I find particularly compelling.

I'm particularly interested in this aspect because it seems that the emphasis on web site development is becoming more and more directed towards search engine placement and less on providing compelling information to end users. That's the tail wagging the dog. If search engines can't be bothered to properly index ALL content maybe it's time for a revised paradigm.

Of course I realize that if excellent content can't be easily found it might as well not be there so there's no doubt search engines are important. They just seem to me to be doing a poor job.

Sorry for hijacking your thread.
__________________
Dynamic Software Development
www.activeminds.ca
Reply With Quote
  #13 (permalink)  
Old 05-30-2008, 03:41 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: An idea for a spider emulator Firefox plugin

Wige I think if is done in a way a page may be viewed in Lynx, it will be excellent.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #14 (permalink)  
Old 05-30-2008, 04:35 PM
WebProWorld Veteran
 
Join Date: Apr 2004
Posts: 349
imvain2 RepRank 1
Default Re: An idea for a spider emulator Firefox plugin

This maybe be done initially with greasemonkey.
Reply With Quote
Reply

  WebProWorld > Webmaster, IT and Security Discussion > Web Programming Discussion Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Is there google rank emulator? webcruiser8 Google Discussion Forum 4 06-06-2009 05:07 AM
Outlook plugin Fendermate Marketing Strategies Discussion Forum 0 02-07-2006 02:38 AM
Anyone Using BidVertiser Ads Plugin for WordPress? davcheong Affiliate Marketing Discussion Forum 0 12-05-2005 10:14 PM
MSN Search MyCroft Plugin Now Available for Firefox WPW_Feedbot Search Engine Optimization Forum 0 09-29-2005 02:00 PM
plugin error Paulbisch Graphics & Design Discussion Forum 5 07-16-2005 12:11 AM


All times are GMT -4. The time now is 10:38 PM.



Search Engine Optimization by vBSEO 3.3.0