Contact Us Forum Rules Search Archive
WebProWorld Part of WebProNews.com
Page One Link To Us Edit Profile Private Messages Archives FAQ RSS Feeds  
 

Go Back   WebProWorld > Site Design > Syndication and Social Media Discussion Forum
Subscribe to the Newsletter FREE!


Register FAQ Members List Calendar Arcade Chatbox Mark Forums Read

Syndication and Social Media Discussion Forum Got a favorite blog, podcast or otherwise syndicated site? Let eveybody else in on it. Have some questions, comments, ideas or concerns about how to more effectively use blogs, syndication and social media for your business/pleasure? Let's chat about those too.

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 02-19-2008, 12:03 PM
Gert Leroy's Avatar
WebProWorld Pro
 

Join Date: Oct 2005
Location: Natal - Brasil
Posts: 212
Gert Leroy RepRank 0
Default wordpress archive pages, robots.txt and Adsense (all together)

hi,

as suggested by a lot of wordpress users, it's a good thing to disallow the archive pages in the robots.txt file, to avoid duplicate content.

I did this as well on my blog, but now an annoying thing happens: i have an Adsense wide skycraper in my sidebar, on every page of the blog, also on the archive-pages (you can have a look at e.g. Lifestyle | Emigrant.be). Problem with these archive pages is that they show irrelevant Adsense ads, just because the pages are not indexed by Google..
(all the other pages that are indexed show relevant ads)

My question:
can i remove to archive pages from my robots.txt file so they get indexed as well? is this really gonna give me a penalty in Google? as i'm using excerpts in the archives personaly i think it can't be a problem..

what do you think you wordpress seniors? hehe..
Reply With Quote
  #2 (permalink)  
Old 02-20-2008, 07:21 PM
incrediblehelp's Avatar
Moderator
WebProWorld Moderator
 

Join Date: Jan 2004
Location: Live in Cincy Now
Posts: 7,697
incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4
Default Re: wordpress archive pages, robots.txt and Adsense (all together)

Can you list a link to an archive page so we can see? Really hard for me to tell if it is irrelevant or not since I dont speak the language. not sure why the ads would all of sudden become irrelevant by disallowing the spider access.
Reply With Quote
  #3 (permalink)  
Old 02-21-2008, 09:55 AM
Gert Leroy's Avatar
WebProWorld Pro
 

Join Date: Oct 2005
Location: Natal - Brasil
Posts: 212
Gert Leroy RepRank 0
Default Re: wordpress archive pages, robots.txt and Adsense (all together)

archive pages:

Categories:
Bedenkingen & Emoties | Emigrant.be
Film | Emigrant.be
Immobiliën | Emigrant.be
Levensstandaard | Emigrant.be
Lifestyle | Emigrant.be
Salondans | Emigrant.be
Sfeerbeelden | Emigrant.be
Uitgaan | Emigrant.be
Windsurf | Emigrant.be
Zon, Zee & Strand | Emigrant.be

The category-pages show irrelevant Adsense Ads like:
Grant Farm, International Super FUnd, WIn a $10K scholarship, ..
Some of the ads however may appear relevant, because they pick up a word on the page and show ads about that word. That doesn't mean they are relevant. The ads on the individual (indexed) post-pages are much more relevant.

Same goes for:
Months:
2007 september | Emigrant.be
2007 oktober | Emigrant.be
2007 november | Emigrant.be
2007 december | Emigrant.be
2008 januari | Emigrant.be
2008 februari | Emigrant.be

Some posts (that allready got indexed):

Investeren in immobilien in Natal
This is a post about real estate in the city of Natal, Brazil.
Adsense ads: Brazil Travel, hotels Natal, real estate, .. => relevant ads

Auto-ongeval in Natal en verzekering
This is a post about a carcrash and car insurance.
Adsense ads: mainly about insurances => relevant ads

Interview met een Aalstenaar in Brazilie
This is a post about an interview with a Belgian in Brazil
Adsense Ads: living abroad, hotels Brazil, move to Brazil => relevant ads

...

They next question i could ask is:
let's say i have my archive-pages indexed.. should i provide them with relevant titles (and metatags) because actually there is no reason at all for them to show up in the search results..

Last edited by Gert Leroy : 02-21-2008 at 09:57 AM.
Reply With Quote
  #4 (permalink)  
Old 02-21-2008, 10:33 AM
WebProWorld Pro
 

Join Date: Dec 2007
Location: Brussels, Belgium
Posts: 123
Jean-Luc RepRank 1
Default Re: wordpress archive pages, robots.txt and Adsense (all together)

Dag Gert,

Your robots.txt file is not valid. First of all, you should remove all blank lines. Secondly, I would recommend you look at the robots.txt specification here.

Also note that Google uses several web robots: Googlebot is for the Google search engine and MediaPartners is for AdSense.

To disallow all bots but Mediapartners, use something like this:
Code:
User-agent: *
Disallow: /dir1/
Disallow: /dir2/
Disallow: /dir3/

User-agent: Mediapartners-Google
Disallow:
Groetjes,

Jean-Luc
__________________
Checking redirects is now as easy as 1 2 3, even if you are not a HTTP-header guru !
AWStats Support: installation assistance, add-on's, extra sections, dedicated forum,...
Reply With Quote
  #5 (permalink)  
Old 02-24-2008, 07:28 PM
Gert Leroy's Avatar
WebProWorld Pro
 

Join Date: Oct 2005
Location: Natal - Brasil
Posts: 212
Gert Leroy RepRank 0
Default Re: wordpress archive pages, robots.txt and Adsense (all together)

hallo Jean-Luc,

thanks for the tip about the mediapartners-bot, i didn't think about that, and it will (probably) resolve my problem.

about the robots.txt file not being valid, i'm not sure about that.. Google webmasters tools doesn't report any problem and the file does what it's supposed to do.

i haven't changed the robots.txt yet. i'll return in some days to report the changes in this thread.

thanks for the tip!

groeten,
Gert
Reply With Quote
  #6 (permalink)  
Old 02-25-2008, 05:05 PM
uberanimal's Avatar
WebProWorld Member
 

Join Date: Jan 2008
Posts: 57
uberanimal RepRank 0
Default Re: wordpress archive pages, robots.txt and Adsense (all together)

i find that it helped to leave the archives visable to the crawler. i would allow it. it does not really hurt and count as "duplicate content".
Reply With Quote
  #7 (permalink)  
Old 02-25-2008, 07:18 PM
incrediblehelp's Avatar
Moderator
WebProWorld Moderator
 

Join Date: Jan 2004
Location: Live in Cincy Now
Posts: 7,697
incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4
Default Re: wordpress archive pages, robots.txt and Adsense (all together)

Quote:
Originally Posted by uberanimal View Post
i find that it helped to leave the archives visable to the crawler. i would allow it. it does not really hurt and count as "duplicate content".
For some people it does and for other it doesnt. Very finicky part of working on optimization of a website.
Reply With Quote
  #8 (permalink)  
Old 02-29-2008, 06:56 PM
Gert Leroy's Avatar
WebProWorld Pro
 

Join Date: Oct 2005
Location: Natal - Brasil
Posts: 212
Gert Leroy RepRank 0
Default Re: wordpress archive pages, robots.txt and Adsense (all together)

i have added the following lines the my robots.txt:

User-agent: Mediapartners-Google
Disallow:

The archive-pages remain disallowed for all bots but Mediapartners-Google.
Now let's wait and see what happens with the Adsense-ads..
Reply With Quote
  #9 (permalink)  
Old 03-04-2008, 04:37 PM
Gert Leroy's Avatar
WebProWorld Pro
 

Join Date: Oct 2005
Location: Natal - Brasil
Posts: 212
Gert Leroy RepRank 0
Default Re: wordpress archive pages, robots.txt and Adsense (all together)

ok problem solved !!

the archives remain disallowed for all buts but media-partners
they now show relevant ads !!

thanks Jean-Luc for the tip !
Reply With Quote
Reply

  WebProWorld > Site Design > Syndication and Social Media Discussion Forum
Tags: , , ,



Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Similar Threads
Thread Thread Starter Forum Replies Last Post
Wordpress, Robots, and SEO J-Spider Search Engine Optimization Forum 13 07-09-2007 06:15 PM
Adsense on a Wordpress Blog? spongebob Search Engine Optimization Forum 0 09-12-2006 12:12 PM
Adsense and Wordpress Galore!! hotwired Other Engines/Directories 2 08-12-2006 07:36 AM
Internet Archive Sued Over Access To Pages WPW_Feedbot Search Engine Optimization Forum 0 07-13-2005 09:30 AM
AdSense and robots.txt angelpure Google AdWords/Google AdSense 6 03-01-2005 12:07 PM


Search Engine Optimization by vBSEO 3.2.0