|
|
||||||
|
||||||
| Index Link To US Private Messages Archive FAQ RSS | ||||||
| SEO 101 Welcome to the SEO 101 forum on WebProWorld - This SEO Podcast is geared towards Newbie's in order to teach and bridge the gap between website owners and the elusive SEO practices. So sit back, relax, enjoy, learn, and prosper from the SEO 101 Podcast. |
Share Thread: & Tags
|
||||
|
![]() |
|
|
LinkBack | Thread Tools | Display Modes |
|
|||
![]() Every time I run the Google webmaster tools and I check for page errors with the Web Crawl, Google repeatedly lists the same seven pages and subfolders that haven't existed on our website for at least two years. These are the 404 error pages it lists (removed 'http://www.' to prevent links):
Any ideas/links I can try to use to correct this? It's getting quite frustrating! |
|
|||
|
Thanks Wige.
Since my hosting provider kinda sucks, I've contacted them about either providing me with the ability to change the .htaccess file or I will give them the instructions that you gave me and provide them with alternative links to redirect those to other pages on the site. If you're right about them possibly being viable links, it would probably be better to redirect them rather than block them. Question: One of the "problem pages" is the index page - the old page was "index.html" and the new/current page is "index.htm". Is there some way to add a line or two of code in the .htaccess file so that whether the page is index.htm or .html, browsers will recognize them as one and the same? So if down the road it was "accidentally" changed to index.html, those with links to index.htm won't get a 404 error and vice-versus? Another question: Can I use Google tools or some other method to "backtrack" what sites have those links to those pages? I've never done it before and I don't know how or if it's possible. There's one of them that I don't even know what it originally was (therefore, I don't know where to redirect them to). Last edited by MarkGatESS; 08-07-2008 at 10:59 AM. |
|
||||
|
As far as index.html to index.htm, the best you would be able to do is redirect one to the other. You do not want the browser to be able to access the page at both versions, as this would cause search engines to see the pages as duplicated. Generally, I prefer to rewrite all possible index.ext files to the root of the folder (so folder/index.ext always converts to folder/). Your server should automatically serve index.html or index.htm or index.something depending in which extension you have. Assuming that is the case, you would use the following rule to force browsers to the root:
RedirectMatch 301 ^(.*)/index\.(.*)$ http://url.tld$1/ This will match any folder structure, and remove index.ext from the end, redirecting the user and spider to the directory root. For best results, make sure your internal links point to the directory root. Also, please note that if you will be passing query strings (I am assuming this is not the case since the pages are .html) this method will not work.
__________________
The best way to learn anything, is to question everything. |
|
|||
|
Okay, they got me FTP access. In "my" directory, I see a BAK folder, data folder, and a www folder. The only other things there are two text files for our SSL certs and a zipped file of our shopping cart software. The BAK folder has a copy of all our site images, the data folder holds our database files, and the www folder is the same list of files/folders that I have access through FrontPage - basically, our website files (the www is our "root").
Nowhere did I see an .htaccess file. Is it possible to run a website without the file, or is it that I don't have access to where the file is? Also, can a web server have a single .htaccess file that runs multiple domains/websites or do I have a specific .htaccess file for our website on that server? Last edited by MarkGatESS; 08-12-2008 at 04:53 PM. |
|
|||
|
Take this advice very suspiciously, htaccess is not something to be trifled with and I am not a master of its full potential.
However, usually I create or see them created in the /www/ path. If you have root access, unlikely if on a shared versus dedicated server, you can have one at root and if you like, one in each (IP) domain also - they cascade down the path iirc, so only do as little as you need to, especially in areas such as 301 redirects. Wige is a whiz on this kind of thing so I will shaddup now.
__________________
Denver SEO, Blog, and Web Design Consultants |
|
||||
|
Quote:
It's more likely that your FTP program is filtering the files you see. In your setup (with the FTP program) you should be able to experiment with filters that will allow you see the .htaccess. In cuteFTP I have to choose to "enable filters" and then tell cuteFTP to let the server filters do the filtering, not cuteFTP. That's usually enough.
__________________
Jade Burnside, Ahead of the Web What good is your web site if no one can find it? SEO & Optimized Web Site Design |
![]() |
|
| Thread Tools | |
| Display Modes | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| How long does it take Google to remove the old pages info? | megame | Google Discussion Forum | 8 | 05-25-2008 01:14 AM |
| How to remove single pages from the Google Index | Dubbya | Google Discussion Forum | 6 | 03-14-2007 01:36 AM |
| Pagerank for Non Existent pages | Mong | Google Discussion Forum | 2 | 12-13-2006 10:28 PM |
| Is there a way to remove some pages? | A. Smith | Other Engines/Directories | 1 | 05-29-2006 12:00 AM |
| Old non-existent pages suddenly return !! | Spooky | Google Discussion Forum | 0 | 11-29-2004 09:09 AM |
|
WebProWorld |
Advertise |
Contact Us |
About |
Forum Rules |
MVP's |
Archive |
Newsletter Archive |
Top |
WebProNews
WebProWorld is an iEntry, Inc. ® site - © 2009 All Rights Reserved Privacy Policy and Legal iEntry, Inc. 2549 Richmond Rd. Lexington KY, 40509 |