iEntry 10th Anniversary Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Google Discussion Forum Google Discussion forum is for topics specifically related to Google. There is a subforum dedicated to AdSense/AdWords subjects.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 06-24-2007, 05:10 PM
WebProWorld Member
 
Join Date: Aug 2003
Location: Ohio, USA
Posts: 46
Ricardo Zea RepRank 0
Exclamation Sitemap's URL structure

Hello all,

I tried looking for the answer in other posts but couldn't find it.

Here's my problem.

I have submitted my sitemap to Google and I see that my results' description are all messed up when I do a "site:www.aidmydebt.com" search in Google.

If you do a search right now the description of the pages look like this one:

"Index of /debt-settlement/HI-Hawaii/Lihue. Name Last modified Size Description. [DIR] Parent Directory 11-May-2007 12:51 - [ ] debt-settlement-Lihu. ..."

My sitemap XML file looks like this for that specific page:

" <url> <loc>http://www.aidmydebt.com/debt-settlement/HI-Hawaii/Lihue/debt-settlement-Lihue.php</loc>
</url> "

For some reason Google is not 'reading' the file "debt-settlement-Lihue.php" in the "/Lihue" folder, and of course, if Google doesn't see the .php file it will stay in the folder.

I have a lot of pages and all my sitemaps' URL structure looks like the one above, the only differences are the states' and cities' names.

Take into account the ending of the URL: ".php"

My question is: Does the sitemap URL structure HAS to end in a "/"?

I have read around the web and seen sitemap.xml examples and in some of them some URLs end in "/" and some others end in a simple ".html"... so I assumed that ending in ".php" would be ok too. Then again, seeing the description of my results in Google makes me want to wonder if I'm doing it wrong (which is probably the case, yet it doesn't make sense) or if it's a Google's issue.

I need to fix this fast.

Thank you for any help you can provide me with.
__________________
Ricardo Zea
Web Designer (X)HTML + CSS Certified by W3Schools
Reply With Quote
  #2 (permalink)  
Old 06-24-2007, 05:28 PM
WebProWorld Member
 
Join Date: Jul 2004
Location: London, England
Posts: 29
ukresident RepRank 0
Default Re: Sitemap's URL structure

"For some reason Google is not 'reading' the file "debt-settlement-Lihue.php" in the "/Lihue" folder, and of course, if Google doesn't see the .php file it will stay in the folder"

how do you know this?

but to answer your question the url can end in any extension as long as the url is valid
__________________
http://www.omarkattan.com
Reply With Quote
  #3 (permalink)  
Old 06-24-2007, 05:29 PM
WebProWorld Member
 
Join Date: Jul 2004
Location: London, England
Posts: 29
ukresident RepRank 0
Default Re: Sitemap's URL structure

sorry and I also meant to say that it can also end with a / as long as that is how you specified your url structure in htaccess
__________________
http://www.omarkattan.com
Reply With Quote
  #4 (permalink)  
Old 06-24-2007, 07:50 PM
incrediblehelp's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Jan 2004
Location: Live in Cincy Now
Posts: 7,573
incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4
Default Re: Sitemap's URL structure

I guess I am confused. What does your sitemap have to do with your description? Google is going to choose to list a description from various sources:

DMOZ description or content from your page or from the tag itself. Seeing Google use the description from your description tag is rare to say the least. So if your expecting Google use that you would be wrong.
Reply With Quote
  #5 (permalink)  
Old 06-24-2007, 11:59 PM
WebProWorld Member
 
Join Date: Aug 2003
Location: Ohio, USA
Posts: 46
Ricardo Zea RepRank 0
Default Re: Sitemap's URL structure

ukresident and incrediblehelp, thanks for your answers.

Now to clarify.

"how do you know this?"

I know this because if you do a site: search in Google, the description shows the content of a folder without an index.* file: "Index of /debt-settlement/HI-Hawaii/Lihue. Name Last modified Size Description. [DIR] Parent Directory 11-May-2007 12:51 - [ ] debt-settlement-Lihu. ..."

Just go to Google and type: site:www.aidmydebt.com, you'll see the second result is my example here (in fact, ALL the results, except the first one, looks like my example above corresponding to each folder).

"What does your sitemap have to do with your description?"
Well, that's exactly what I'm trying to find out, if maybe Google is using my sitemap to look for a specific file ending in .php and for some unknown reason it's not seeing .php file and it only goes as far a the /.

My great concern here is: How is it that Google is indexing this URL: www.aidmydebt.com/debt-settlement/HI-Hawaii/Lihue/ if there's nothing on my website poiting to that folder?

The complete route using that path would be: www.aidmydebt.com/debt-settlement/HI-Hawaii/Lihue/debt-settlement-Lihue.php

So that's why I'm speculating that Google is using my sitemap to find that incomplete URL (AND the rest of them, which there are a lot).

--

As rare as it is, Google IS describing my website www.aidmydebt.com from my Description tag.

Of course, the only description that Google is going to put on the 'incomplete' URLs (www.aidmydebt.com/debt-settlement/HI-Hawaii/Lihue/) is the only text that comes up wich is the little text that's there when you visit that incomplete URL.

--

Taking into account what incrediblehelp said, let me rephrase the question then:

How is it that Google is indexing those 'incomplete' URLs? ie: www.aidmydebt.com/debt-settlement/HI-Hawaii/Lihue/
• I have my sitemaps well typed and structured.
• All my links in my website are correct.
• Nothing on my website points to an incomplete URL like the example above.

Thanks for your help.
__________________
Ricardo Zea
Web Designer (X)HTML + CSS Certified by W3Schools
Reply With Quote
  #6 (permalink)  
Old 06-25-2007, 09:10 AM
WebProWorld New Member
 
Join Date: Jun 2007
Posts: 10
sam-adam RepRank 0
Default Re: Sitemap's URL structure

I tryed a sitemap structure for some of my sites i coudn't see any differecnce in indexing the pages or receiving more hits.

So i'm not convince about usefullnes of that
Reply With Quote
  #7 (permalink)  
Old 06-25-2007, 12:34 PM
WebProWorld Member
 
Join Date: Aug 2003
Location: Ohio, USA
Posts: 46
Ricardo Zea RepRank 0
Default Re: Sitemap's URL structure

I'm just speculating that my issue is sitemap related, not that I know this for sure.

Again, Does anyone have any idea why Google is indexing incomplete URLs on my website? ie: Index of /debt-settlement/HI-Hawaii/Lihue

Thank you.

sam-adam, thanks for your answer too.
__________________
Ricardo Zea
Web Designer (X)HTML + CSS Certified by W3Schools
Reply With Quote
  #8 (permalink)  
Old 06-25-2007, 01:00 PM
incrediblehelp's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Jan 2004
Location: Live in Cincy Now
Posts: 7,573
incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4
Default Re: Sitemap's URL structure

Quote:
Originally Posted by Ricardo Zea View Post
I'm just speculating that my issue is sitemap related, not that I know this for sure.

Again, Does anyone have any idea why Google is indexing incomplete URLs on my website? ie: Index of /debt-settlement/HI-Hawaii/Lihue

Thank you.

sam-adam, thanks for your answer too.
If these URLs are not on the sitemap then they are not getting them from there.

Could these URLs be available on your website somewhere? They could get them from there

Could these be available on other websites? Scrapper/spammers? Of course it could. I see many broken or unnecessary URLs index in Google and have no idea where they came when cheking index URLs for client websites. When this happens it is of no concern to me. Most of the time it is from scrapper websites.
Reply With Quote
  #9 (permalink)  
Old 06-25-2007, 02:11 PM
WebProWorld Member
 
Join Date: Aug 2003
Location: Ohio, USA
Posts: 46
Ricardo Zea RepRank 0
Default Re: Sitemap's URL structure

Good point incrediblehelp, scrapper websites (new term for me too )

Now, I've been talking to a friend of mine, seems like the problem is the webserver itself and not my sitemaps or anything from my end, yet we are not 100% sure. We posted a ticket with our hosting company, I will let you know as soon as I know something.

Thanks again guys.
__________________
Ricardo Zea
Web Designer (X)HTML + CSS Certified by W3Schools
Reply With Quote
  #10 (permalink)  
Old 06-25-2007, 07:31 PM
WebProWorld Member
 
Join Date: Aug 2003
Location: Ohio, USA
Posts: 46
Ricardo Zea RepRank 0
Default Re: Sitemap's URL structure

Ok, problem resolved, now it's time to wait for Google to eliminate those URLs from its index.

The problem was the server as I mentioned, the server was handing all the directories to the bots that didn't have an index.* file in them, so we disabled the directory indexes, this means that now if a folder does not contain an index.* file it can not be 'read' by anybody and this includes search engine's bots.

Now if anyone tries to visit one of the broken or 'incomplete' URLs they will get an ERROR 403, which means FORBIDDEN ACCESS.

I also edited my .htaccess file and added this line: ErrorDocument 403 http://www.aidmydebt.com/debt-settlement-quote.php. This line redirects all the visitors, humans and bots, to my form (you can choose where to redirect).

I will have to wait a few weeks to see all those incomplete URLs gone from Google's index.

Now everybody is happy

Thanks to all of you for your help.
__________________
Ricardo Zea
Web Designer (X)HTML + CSS Certified by W3Schools

Last edited by Ricardo Zea; 06-25-2007 at 07:33 PM.
Reply With Quote
  #11 (permalink)  
Old 06-25-2007, 08:23 PM
incrediblehelp's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Jan 2004
Location: Live in Cincy Now
Posts: 7,573
incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4
Default Re: Sitemap's URL structure

Just a reminder Ricardo. Google has a URL removal tool in their Google Webmaster Tool Set.
Reply With Quote
  #12 (permalink)  
Old 06-26-2007, 01:36 PM
WebProWorld Member
 
Join Date: Aug 2003
Location: Ohio, USA
Posts: 46
Ricardo Zea RepRank 0
Default Re: Sitemap's URL structure

Yes! You are right incredihelp... my issue is that I'd have to type in more than 50 URLs, maybe I'll do that anyway, thanks a lot for your help
__________________
Ricardo Zea
Web Designer (X)HTML + CSS Certified by W3Schools
Reply With Quote
Reply

  WebProWorld > Search Engines > Google Discussion Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Best page layout structure rjjj111 Graphics & Design Discussion Forum 6 05-19-2007 04:08 PM
how about a menu structure here? tom_moonen Submit Your Site For Review 4 05-14-2006 02:51 PM
Odd business structure emjaneundy The Castle Breakroom (General: Any Topic) 2 01-05-2006 01:14 PM
Site Map Structure LLFitness_Derek Google Discussion Forum 21 04-13-2005 07:54 PM
Designing the structure of the website godzilla Graphics & Design Discussion Forum 3 03-08-2005 08:21 PM


All times are GMT -4. The time now is 03:46 AM.



Search Engine Optimization by vBSEO 3.3.0