|
|
||||||
|
||||||
| Index Link To US Private Messages Archive FAQ RSS | ||||||
| Google Discussion Forum Google Discussion forum is for topics specifically related to Google. There is a subforum dedicated to AdSense/AdWords subjects. |
Share Thread: & Tags
|
||||
|
![]() |
|
|
LinkBack | Thread Tools | Display Modes |
|
|||
|
Hi Everybody!
I am using Google Webmaster Tools for quite a while now and am very happy with the information it provides for a site at one place. But a week back when I was checking one of my site in Google Webmaster Tools, it showed 258 URLs Not Found??? I was just amazed to see this! When I checked the URLs i couldn't understand what the URL is? Let me explain you clearly. I am sorry I will not post my website here. For example, let us say I have a website http://www.mysite.com . In GWT (Google Webmaster Tools) the URL shown is http://www.mysite.com/chinese.php?u=...www.mysite.com (this is for homepage) and in the same way for all inner pages like http://www.mysite.com/chinese.php?u=...e.com/abc.html My concern is, what is "chinese.php?u=" coming in these URLs. It also shows "spanish.php?u=" "french.php?u=" and "italian.php?u=" in URLs. I am just puzzled. I did not upload anything like this on my server or created any pages like mentioned above ever. Please help me in coming out of this maze. If anyone has any queries please ask. Thanks a lot in advance. |
|
|||
|
Sounds like you are using dynamic pages. Google has this to say about dynamic pages in its guidelines:
"If you decide to use dynamic pages (i.e., the URL contains a "?" character), be aware that not every search engine spider crawls dynamic pages as well as static pages. It helps to keep the parameters short and the number of them few." So right now, there are no guarantees for dynamic page indexing, although I have read in blogs that it is improving. Steve
__________________
Real Estate Web Site Marketing | |
|
||||
|
Don't be scared about using dynamic URLs. The SE's index them fine as long as you stay under 2-3 variables.
Now for the strange URLs it could be simply a scrapper website linking to a page on your website that doesn't exists. If I go to my blog and create a link: http://www.youtsite.com/iamacrazyperson.php. The spiders will crawl and it try to go to it. Your raw logs will register a 404 for this page. Now what is the scrapper is doing this a whole bunch of times (who knows why) then you will get a bunch of errors for pages that don't exist. Google Webmaster Console will also register this. |
|
|||
|
Quote:
Just a thought! |
|
|||
|
Quote:
They may have linked to the wrong domain. There is a domain that links to my site on every page of their site and I can tell its a mistake because its a domain I don't advertise, and has a 301 redirect on it. Google has me down for over 400 IBLs to my site from that site, crazy.
__________________
Computer problems solved. Guaranteed! Philadelphia & New Jersey I.T. Services Want more Web site traffic? Internet Marketing Services |
|
|||
|
Yes, incrediblehelp may have nailed this one.
One statement in the webmaster console is that: Quote:
Unfortunately the Google tool does NOT display where the link is coming from so you cannot trace it backwords and try to fix it yourself. |
|
|||
|
Quote:
I think Google has got a good handle on variables. Just my opinion!
__________________
Post as-it-happens crime stories of criminal behaviour at crimedigg.com |
|
|||
|
Google has given you the page it was looking for. Now look through your server logs and see what page was referring to that url. Then you can probably sort out who or what is generating the link that google might be following.
__________________
Inngenious B&B Website Design & Promotion |
|
|||||||
|
Hi,
Thanks a lot to everybody for replies. When I checked with Google Webmasters Tools today morning the list of not found URLs increased to 323 now with "portuguese.php?u=" added!!! Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
By the way, thanks a lot again and hoping for more replies and hope the problem will sort out this time. |
|
|||
|
Do you use www.xml-sitemaps.com to create your sitemaps?
Its just that I searched for "portuguese.php?u=" and noticed it appeared in a lot of sitemaps created by: xml-sitemaps.com - not sure if they offer a page translation feature as well? |
|
|||
|
Quote:
Waiting for reply!!! Thanks. |
|
|||
|
Maybe of use to some of you: a good stand-alone site crawler, and it is free too!
http://gsitecrawler.com/ You can generate Google and Yahoo sitemaps, test the "crawlability" of your site, use filters etc. etc. Good German quality :-) I am using it for about half a year now and I am very pleased with it! |
|
||||
|
Quote:
Just curious.
__________________
Free E-book & Instructions in Using EFT & NLP for Weight Loss OneMoreBite-Weightloss.com |
|
||||
|
"ashishdabas", this is common problem with G. The G forums are loaded with similar posts. No one knows from where the URL's are generated. In my case (and as with other people), it's showing URL's under the 404 or "not found" area that never existed, and URL's with strange characters in them like ?, =, % and the like. It may be entries like domain.com/valid-URL.html=blah% where it's adding garbage after the .html
What I do is have to create 301's from these bogus URL's to the real page because I don't want to give G any more "excuses" to screw things up for me.
__________________
God Bless, -Clint (Join Date: 2003) |
|
|||
|
Quote:
__________________
Inngenious B&B Website Design & Promotion |
|
||||
|
Please explain how this can create more problems. There are DOZENS of these, and I have heard that the Gbot returning dozens of 'not found' pages at your site is a bad thing.
Thanks.
__________________
God Bless, -Clint (Join Date: 2003) |
|
|||
|
Quote:
Quote:
a)waste of webmaster time (see above) who could be doing something more beneficial to the site like creating content and link building. b) By 301-ing them you generate no errors which means you have decreased your ability to track them back to their source and figure out where these links to phantom pages are coming from (to rule out that they aren't coming from your site). c) Redirecting dozens of pages that Google has no record of can leave a footprint that might give the impression you are up to something devious like trying to increase your page count or funnel PR from non-existent pages. (it looks just like a blackhat approach at getting multiple links from a site without them looking like sitewide links) c) MSN indexes 301 in a bad way so if there are actually links out there to your non-existent pages and you are 301-ing them. You are creating actual problems for other engines to eliminate something that was not a problem to begin with. d) Server load... too many 301s in htaccess. A safer way would be to block the phantom pages in robots.txt, but even that would be a waste of time.
__________________
Inngenious B&B Website Design & Promotion |
|
||||
|
Quote:
__________________
God Bless, -Clint (Join Date: 2003) |
|
||||
|
Ok I understand what you're saying.
Quote:
Quote:
Quote:
Quote:
What I think I may start doing is remove the older 301 redirects, to see if anything positive happens from it.
__________________
God Bless, -Clint (Join Date: 2003) |
|
|||||
|
Quote:
Quote:
Quote:
Quote:
Quote:
__________________
Inngenious B&B Website Design & Promotion |
|
|||
|
Quote:
|
|
|||
|
Quote:
K<o> |
|
|||
|
All of these applications that you've all mentioned are apps that are placed on the webserver itself to generate the sitemaps.
Are there any applications you can install on your local machine that will go over your internet connection to "crawl" and generate your maps, thus not having the need for you to install and run it on the web server? Something that will hit the server like a surfer using a browser? A "free" application would be nice, but not necessary ;-) BTW - my website has over 500 pages because it uses Early Impact's ProductCart. I suppose there's probably NO free generation application that will do that many pages. |
|
|||
|
Quote:
|
|
|||
|
Quote:
So, I can load it onto my Win XP Pro workstation and it'll work?? |
|
|||
|
Yep, it runs on your own computer and crawls the site from there!
http://gsitecrawler.com/ |
![]() |
|
| Thread Tools | |
| Display Modes | |
|
|
|
WebProWorld |
Advertise |
Contact Us |
About |
Forum Rules |
MVP's |
Archive |
Newsletter Archive |
Top |
WebProNews
WebProWorld is an iEntry, Inc. ® site - © 2009 All Rights Reserved Privacy Policy and Legal iEntry, Inc. 2549 Richmond Rd. Lexington KY, 40509 |