iEntry 10th Anniversary Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Search Engine Optimization Forum SEO is much easier with help from peers and experts! The WebProWorld SEO forum is for the discussion and exploration of various search engine optimization topics. Any non (engine) specific SEO or SEM topics should go here.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 12-14-2004, 02:22 AM
WebProWorld New Member
 
Join Date: Mar 2004
Location: Ocala, FL
Posts: 17
PastorRick RepRank 0
Default Use of ? marks in URLs

Hi Folks,
I have two questions please. One involves extensions and how changing them affects SE's re-spidering your site and the other has to do with how sites with ?'s marks get spidered.

Will changing ".htm to .php" affect search engine optimization? I'm helping someone with a site and they have a lot of pages. They are not using any templates or includes so the navigation is a bit of a mess. I want to use includes to make the navigation the same on each page but they are worried about me changing the extensions. Will it have an affect?

I am using aardvark topsites as a topsites list for someone and they are worried about the question marks in the URLs. They want to make sure each page including those after the ? get spidered. Does anyone know how I can remove them or change it so that it will be spidered correctly by search engines?
__________________
Pastor Rick
Crossway Online Ministries
http://ecrossway.com http://crosswayisp.net http://crosswaywebs.com
Reply With Quote
  #2 (permalink)  
Old 12-14-2004, 03:59 AM
WebProWorld 1,000+ Club
 
Join Date: Jul 2003
Location: Toronto, Canada
Posts: 1,782
cyanide RepRank 0
Default

The extension of pages... php, asp, htm, html, etc. has nothing to do with page rank. The question mark won't be a problem either
__________________
|
Web Hosting Guru
| Need Help For Your Forum?
Reply With Quote
  #3 (permalink)  
Old 12-14-2004, 04:59 PM
WebProWorld New Member
 
Join Date: Mar 2004
Location: Ocala, FL
Posts: 17
PastorRick RepRank 0
Default

OK, now I am confused. I have read and heard that Google robot for example, doesn't parse URL's past a ? which is why replicated sites cannot get into the SE's.

I know that google can index pages with all those extensions. The problem I see is that if google has page.html and you change it to page.php when google goes to check your site, it will not be found. This seems to me that you would have to start the whole promotion all over again. This hurts when you already have a page rank of 5 or higher.

Other Thoughts?
__________________
Pastor Rick
Crossway Online Ministries
http://ecrossway.com http://crosswayisp.net http://crosswaywebs.com
Reply With Quote
  #4 (permalink)  
Old 12-14-2004, 05:13 PM
WebProWorld 1,000+ Club
 
Join Date: Jul 2003
Location: Toronto, Canada
Posts: 1,782
cyanide RepRank 0
Default

Quote:
Originally Posted by PastorRick
OK, now I am confused. I have read and heard that Google robot for example, doesn't parse URL's past a ? which is why replicated sites cannot get into the SE's.
You are either listening to someone who is not informed or read an article that is way out of date.

If you are changing the page names, then you will have to put some re-direct lines in your htaccess file

such as:
Code:
redirect 301 /oldpage.html http://domain.com/newpage.php
__________________
|
Web Hosting Guru
| Need Help For Your Forum?
Reply With Quote
  #5 (permalink)  
Old 12-14-2004, 06:41 PM
WebProWorld Pro
 
Join Date: May 2004
Posts: 150
AndrewX RepRank 0
Default

Some have theorized that non-cgi extentions like .html are preferred over cgi, because cgi scripts can be like a bottom-less hole, the ?arg can simply increase by one and the software could return a page that is not really unique. This is easier to understand if you've ever tried to implement a spider for a search index.

Look into mod rewrite. This will let you keep your pages as they are, but they can be rewritten by your server into a nicer name, something that looks like a normal static HTML page URL.
__________________
Money Talk || SEO + Directory = SEOMA | SEO 1 | Link Vault
Reply With Quote
  #6 (permalink)  
Old 12-15-2004, 01:49 PM
WebProWorld New Member
 
Join Date: Mar 2004
Posts: 23
ringofire RepRank 0
Default ? in Urls do matter...

If you are creating dynamic pages with a ? in the url, Google typically will not spider beyond 10 characters past the ?. They recommend that you put items like session ids in a cookie instead. It really depends on the composition of the entire page.

Another option is to use mod rewrite to create search-friendly urls. (as AndrewX also recommended)

Bernadette~
Reply With Quote
  #7 (permalink)  
Old 12-15-2004, 06:08 PM
WebProWorld New Member
 
Join Date: Jun 2004
Posts: 13
savvymedias RepRank 0
Default

Trying to make dynamics static is the biggest waste of time out there. Time better spent making the site full of relevant content.

Make sure you have valid code, meta tags, relevant text, a site map and try to get a least a few new permanent links from people every week and you will be fine!

Check out lingerieatlarge.com and babysunprotection.com on google -- same shopping cart.
Reply With Quote
  #8 (permalink)  
Old 12-15-2004, 06:09 PM
WebProWorld New Member
 
Join Date: Jun 2004
Posts: 13
savvymedias RepRank 0
Default

Trying to make dynamics static is the biggest waste of time out there. Time better spent making the site full of relevant content.

Make sure you have valid code, meta tags, relevant text, a site map and try to get a least a few new permanent links from people every week and you will be fine!

Check out lingerieatlarge.com and babysunprotection.com on google -- same shopping cart.
Reply With Quote
  #9 (permalink)  
Old 12-15-2004, 06:25 PM
WebProWorld Member
 
Join Date: Oct 2003
Location: New York City
Posts: 80
Jayms RepRank 0
Default

Quote:
Another option is to use mod rewrite to create search-friendly urls. (as AndrewX also recommended)
I third this as making your urls search engine friendly makes all of the difference in the world and there will never be a doubt as to whether your pages will be indexed.
Reply With Quote
  #10 (permalink)  
Old 12-15-2004, 06:49 PM
WebProWorld Member
 
Join Date: Apr 2004
Location: Australia
Posts: 28
newmarket RepRank 0
Default

I don't know about ? marks, but you could just set your web server to parse .htm or .html files with php.

That way you wouldn't necessarily need to change the file extensions and you would still be able to use php.

Off the top of my head, I can't remember, but it is a simple entry in the .htaccess file.

I hope that this helps.
Reply With Quote
  #11 (permalink)  
Old 12-15-2004, 07:37 PM
hal hal is offline
WebProWorld Member
 
Join Date: Nov 2003
Location: US -- New England
Posts: 78
hal RepRank 0
Default

Hello PastorRick,

The people over at Google have learned to never say never, because there always seem to be exceptions. So, what they do say tends to be rather generic.

At http://www.google.com/webmasters/2.html they say

Quote:
We are able to index dynamically generated pages. However, because our web crawler can easily overwhelm and crash sites serving dynamic content, we limit the amount of dynamic pages we index
At http://www.google.com/webmasters/guidelines.html they say

Quote:
If you decide to use dynamic pages (i.e., the URL contains a '?' character), be aware that not every search engine spider crawls dynamic pages as well as static pages. It helps to keep the parameters short and the number of them small.
And, upon doing a search recently for a "similar to" using as a seed a URL with a "?" in it, I remember seeing a note that they only check the first 10 ... was it "words?" "Elements?" <sigh> I forget the exact phrase.

In any case, there is a limit. The result is that the shorter the better. And many people have translated this into "difficulty with ? means don't do it at all."

Hope this helps.

Hal
__________________
Baughan & Company offers personalized Web site support to small business. http://www.rocksolidsite.com
Reply With Quote
  #12 (permalink)  
Old 12-15-2004, 07:57 PM
godzilla's Avatar
WebProWorld Member
 
Join Date: Aug 2003
Location: Montreal, CA
Posts: 52
godzilla RepRank 0
Default

From personal experience I can tell that google spiders have no problems indexing files with *.php?page=*. As example: http://godzilla.bounceme.net/~godzil...p?page=soen357 is easily indexed. Note also that the rankings of such pages do not suffer in any fashion.

But as all previous users have mentionned, any crawler google or not may have difficulty to index long url's with "?". As an example:http://godzilla.bounceme.net/~godzil...JtXzIwMDIuZG9j is sometimes indexed, sometimes not.
__________________
Everything has its reason.
Your greatest glory doesn't consist in falling, but rising every time you fall and this is what defines a perfect human.
Godzilla's pass-time
Reply With Quote
  #13 (permalink)  
Old 12-16-2004, 12:55 AM
WebProWorld Pro
 
Join Date: Sep 2003
Location: Mars
Posts: 171
alienzhavelanded RepRank 0
Default

Changing the page extension could affect your rankings, but there are ways around it. For example, with my current project I'm converting an ASP site into PHP. I will have to leave a 301 redirect in the ASP pages for the spiders to get the new address. The only catch in my situation is the pages will have duplicate content (for a short while)if I do it that way. However the new design will greatly improve the site so I'm willing to take a hit on the ranks. I can also safely assume the new pages will be indexed soon since the spiders regularly visit the site.

You can also do things with custom 404 pages and .htaccess to point anyone linking to the old page to the new one as well as cyanide pointed out.
Reply With Quote
  #14 (permalink)  
Old 12-16-2004, 04:37 AM
WebProWorld New Member
 
Join Date: Nov 2004
Location: Indiana
Posts: 24
Tamera Billings RepRank 0
Default

Can anyone tell me if a ? in a reciprocal link is counted as a back link?

For instance this is the link I got back from trading with a competitor:
http://www.leathertree.com/otherl.as...A016672DD54%7D

Does Google count this as a back link to my site?
Reply With Quote
  #15 (permalink)  
Old 12-16-2004, 06:03 AM
WebProWorld New Member
 
Join Date: Aug 2004
Location: London
Posts: 24
BobbyLocksville RepRank 0
Default

Inbound links to your site should really be direct links e.g Link to this cool site

However, if the page you are linking from has page rank, and the link can be followed by the Google spider, then yes, it will be included.

In my experience Google spider tends to stay away from links with more than two variables.

http://www.domain.com/viewpage.asp?id=234&side=2134
Would be indexed.

Where
http://www.domain.com/viewpage.asp?i...bv234hxd34dsff

Would probably be avoided.

The spider tends to avoid these kind of links as it will assume the site is highly dynamic and it could possible crash the site.

Keep your inbound links simple where possible.

- Bobby
__________________
Bobby Locksville
FREE MONTH TRIAL - Rent dvd online UK

Web Designer London
Reply With Quote
  #16 (permalink)  
Old 12-16-2004, 11:06 AM
WebProWorld Pro
 
Join Date: Jul 2004
Location: Virginia Beach, VA
Posts: 110
cryptblade RepRank 0
Default

Hi Pastor Rick,

I checked your site. The parameters in URLs that MAY affect your rankings don't just include "?". Currently, you have a lot of parameters.

Like others have said, a mod rewrite is a good way to go. After all, you are concerned with this issue, and a solution is available.

Always err on the side of caution when in doubt. A mod rewrite makes the URLs search-friendly - that is a good thing. Therefore, it doesnt hurt to spend the time doing that.

One poster said just focus on content. That poster might be a "disciple" if Jill Whalen of the SEO camp that contend that content and only content is king. That isn't necessarily true - as black hat seos prove time and time again. Besides that, it would be unfair for you to have to keep adding content after content and even spend time on the web building tenure before you got any rankings.

Cleaning the code up, url mod rewrite, titles, even meta tags are all good to do.
Reply With Quote
  #17 (permalink)  
Old 12-17-2004, 09:47 AM
DrTandem1's Avatar
WebProWorld 1,000+ Club
 
Join Date: Oct 2003
Location: Encinitas, CA
Posts: 1,830
DrTandem1 RepRank 2
Default

I think a more important question is whether or not the content on a dynamic page such as .php will be indexed. Basically, the spiders don't see the content derived from the PHP script. So, worrying about the identifying characters after the "?" is really missing the point.

In other words, whether or not the spider will read beyond the "?" has absolutely no relation to it being able to index what you think is on the page.
__________________
DrTandem's San Diego Web Page Design, drtandem.com
Reply With Quote
  #18 (permalink)  
Old 12-18-2004, 01:17 PM
WebProWorld New Member
 
Join Date: May 2004
Posts: 1
tsarouhan RepRank 0
Default Dynamic Site

I think "savvymedias" is correct that mod writing is a waste of time if you properly design the data driven site. You can use parameters for most search engines but keep them short and keep them to no more than 2 parameters (example: www.yoursite.com?pid=23&uid=5). I have recently redesigned a dynamic site using asp.net and we went from 30 pages indexed to 14,000 pages indexed. They are an ecommerce site with about 4,000 products. It does work if you do it right. Check out the site for some tips (www.optcorp.com)

Good Luck
Reply With Quote
Reply

  WebProWorld > Search Engines > Search Engine Optimization Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On



All times are GMT -4. The time now is 01:29 AM.



Search Engine Optimization by vBSEO 3.3.0