Submit Your Article Forum Rules

Page 1 of 2 12 LastLast
Results 1 to 10 of 14

Thread: Best Robots.txt wordpress

  1. #1
    WebProWorld MVP
    Join Date
    Oct 2006
    Posts
    1,254

    Best Robots.txt wordpress

    Recently I've got a wordpress blog at www.domain.com/blog/
    With the recent changes in Google algorithm, I would like to take concrete steps in making it more SEO friendly by removing the low quality pages with tags.
    I mean to say I do want to remove the indexing of tag and other low quality ages that decrease the value of main blog posts.
    For this I need to write a good robots.txt
    Could anyone help me in getting the best robots.txt for my wordpress blog placed at www.domain.com/blog/

    Thanks in advance

  2. #2
    WebProWorld MVP mjtaylor's Avatar
    Join Date
    Dec 2003
    Posts
    6,237
    There is a basic instruction page and automated tool from Google here: http://www.google.com/support/webmas...n&answer=40360.
    SEO Friendly Premium Web Directory - Submit Now| Need to write a love letter to Google? I'm an SEO Copywriter who knows Search Smart DesignŽ. | Travel Gypsy in Key West.

  3. The following user agrees with mjtaylor:
  4. #3
    WebProWorld MVP
    Join Date
    Oct 2006
    Posts
    1,254
    Hey! Thanks for the reference. I've gone through it.

    After research I've found this robots.txt:

    ====================

    User-agent: Googlebot
    Disallow: /wp-content/
    Disallow: /trackback/
    Disallow: /wp-admin/
    Disallow: /feed/
    Disallow: /archives/
    Disallow: /sitemap.xml
    Disallow: /index.php
    Disallow: /*?
    Disallow: /*.php$
    Disallow: /*.js$
    Disallow: /*.inc$
    Disallow: /*.css$
    Disallow: */feed/
    Disallow: */trackback/
    Disallow: /page/
    Disallow: /tag/
    Disallow: /category/

    User-agent: Googlebot-Image
    Disallow: /wp-includes/

    User-agent: Mediapartners-Google*
    Disallow:

    User-agent: ia_archiver
    Disallow: /

    User-agent: duggmirror
    Disallow: /


    ========================

    As I said above, I'm installing the blog at www.domain.com/blog/ , so do I need to change the above robots.txt file?


    One more query! I want to block the robots from accessing a particular web page www.domain.com/XYXABC.html then how to include that in the txt file?
    Last edited by innominds; 07-18-2011 at 10:52 AM.

  5. #4
    Senior Member
    Join Date
    Apr 2011
    Location
    /public_html/
    Posts
    127
    LoL! Disallow: /sitemap.xml
    Why would you disallow your sitemap ? The sitemap.xml file purpose is to tell search engines about your pages and you block Google ?

  6. #5

  7. #6
    WebProWorld MVP
    Join Date
    Oct 2006
    Posts
    1,254
    I'm sorry! Is this OK now?

    User-agent: Googlebot
    Disallow: /wp-content/
    Disallow: /trackback/
    Disallow: /wp-admin/
    Disallow: /feed/
    Disallow: /archives/
    Disallow: /*?
    Disallow: /*.php$
    Disallow: /*.js$
    Disallow: /*.inc$
    Disallow: /*.css$
    Disallow: */feed/
    Disallow: */trackback/
    Disallow: /page/
    Disallow: /tag/
    Disallow: /category/

    User-agent: Googlebot-Image
    Disallow: /wp-includes/

    User-agent: Mediapartners-Google*
    Disallow:

    User-agent: ia_archiver
    Disallow: /

    User-agent: duggmirror
    Disallow: /

  8. #7
    WebProWorld MVP deepsand's Avatar
    Join Date
    May 2004
    Location
    State College, PA
    Posts
    16,481
    In the first instance, why block only Google?

    And, you want to block crawling of all pages with the PHP extension?

  9. The following user agrees with deepsand:
  10. #8
    Senior Member
    Join Date
    Apr 2011
    Location
    /public_html/
    Posts
    127
    I agree with deepsand that you must use the * for all search engine spiders.

    For the second, on a wordpress site all articles/pages doesn't have a php extension. So he would block the wordpress core files by blocking *.php .

    You don't need to disallow /archives/ , just set them to nofollow by using All in One Seo Pack.
    /category/ and /page/ is a bad idea to block them: I got Page Rank 2 on most of the pages and categories.
    Last edited by AboutWeb; 07-19-2011 at 01:39 AM.

  11. #9

  12. #10
    WebProWorld MVP
    Join Date
    Oct 2006
    Posts
    1,254
    Sorry for the mistakes!
    Actually I'm poor at coding (php)
    If you don't mind could you give me the best robots.txt for my wordpress blog at www.domain.com/blog/

Page 1 of 2 12 LastLast

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •