|
|
||||||
|
||||||
| Index Link To US Private Messages Archive FAQ RSS | ||||||
| Syndication and Social Media Discussion Forum Got a favorite blog, podcast or otherwise syndicated site? Let eveybody else in on it. Have some questions, comments, ideas or concerns about how to more effectively use blogs, syndication and social media for your business/pleasure? Let's chat about those too. |
Share Thread: & Tags
|
||||
|
![]() |
|
|
LinkBack | Thread Tools | Display Modes |
|
|||
|
hi,
as suggested by a lot of wordpress users, it's a good thing to disallow the archive pages in the robots.txt file, to avoid duplicate content. I did this as well on my blog, but now an annoying thing happens: i have an Adsense wide skycraper in my sidebar, on every page of the blog, also on the archive-pages (you can have a look at e.g. Lifestyle | Emigrant.be). Problem with these archive pages is that they show irrelevant Adsense ads, just because the pages are not indexed by Google.. (all the other pages that are indexed show relevant ads) My question: can i remove to archive pages from my robots.txt file so they get indexed as well? is this really gonna give me a penalty in Google? as i'm using excerpts in the archives personaly i think it can't be a problem.. what do you think you wordpress seniors? hehe.. |
|
|||
|
archive pages:
Categories: Bedenkingen & Emoties | Emigrant.be Film | Emigrant.be Immobiliƫn | Emigrant.be Levensstandaard | Emigrant.be Lifestyle | Emigrant.be Salondans | Emigrant.be Sfeerbeelden | Emigrant.be Uitgaan | Emigrant.be Windsurf | Emigrant.be Zon, Zee & Strand | Emigrant.be The category-pages show irrelevant Adsense Ads like: Grant Farm, International Super FUnd, WIn a $10K scholarship, .. Some of the ads however may appear relevant, because they pick up a word on the page and show ads about that word. That doesn't mean they are relevant. The ads on the individual (indexed) post-pages are much more relevant. Same goes for: Months: 2007 september | Emigrant.be 2007 oktober | Emigrant.be 2007 november | Emigrant.be 2007 december | Emigrant.be 2008 januari | Emigrant.be 2008 februari | Emigrant.be Some posts (that allready got indexed): Investeren in immobilien in Natal This is a post about real estate in the city of Natal, Brazil. Adsense ads: Brazil Travel, hotels Natal, real estate, .. => relevant ads Auto-ongeval in Natal en verzekering This is a post about a carcrash and car insurance. Adsense ads: mainly about insurances => relevant ads Interview met een Aalstenaar in Brazilie This is a post about an interview with a Belgian in Brazil Adsense Ads: living abroad, hotels Brazil, move to Brazil => relevant ads ... They next question i could ask is: let's say i have my archive-pages indexed.. should i provide them with relevant titles (and metatags) because actually there is no reason at all for them to show up in the search results.. Last edited by Gert Leroy; 02-21-2008 at 09:57 AM. |
|
|||
|
Dag Gert,
Your robots.txt file is not valid. First of all, you should remove all blank lines. Secondly, I would recommend you look at the robots.txt specification here. Also note that Google uses several web robots: Googlebot is for the Google search engine and MediaPartners is for AdSense. To disallow all bots but Mediapartners, use something like this: Code:
User-agent: * Disallow: /dir1/ Disallow: /dir2/ Disallow: /dir3/ User-agent: Mediapartners-Google Disallow: Jean-Luc |
|
|||
|
hallo Jean-Luc,
thanks for the tip about the mediapartners-bot, i didn't think about that, and it will (probably) resolve my problem. about the robots.txt file not being valid, i'm not sure about that.. Google webmasters tools doesn't report any problem and the file does what it's supposed to do. i haven't changed the robots.txt yet. i'll return in some days to report the changes in this thread. thanks for the tip! groeten, Gert |
|
||||
|
i find that it helped to leave the archives visable to the crawler. i would allow it. it does not really hurt and count as "duplicate content".
__________________
A non-techie perspective to removal of Conficker Worm and the latest Info. |
|
|||
|
i have added the following lines the my robots.txt:
User-agent: Mediapartners-Google Disallow: The archive-pages remain disallowed for all bots but Mediapartners-Google. Now let's wait and see what happens with the Adsense-ads.. |
|
|||
|
ok problem solved !!
the archives remain disallowed for all buts but media-partners they now show relevant ads !! thanks Jean-Luc for the tip ! |
![]() |
|
| Thread Tools | |
| Display Modes | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Wordpress, Robots, and SEO | J-Spider | Search Engine Optimization Forum | 13 | 07-09-2007 06:15 PM |
| Adsense on a Wordpress Blog? | spongebob | Search Engine Optimization Forum | 0 | 09-12-2006 12:12 PM |
| Adsense and Wordpress Galore!! | hotwired | Other Engines/Directories | 2 | 08-12-2006 07:36 AM |
| Internet Archive Sued Over Access To Pages | WPW_Feedbot | Search Engine Optimization Forum | 0 | 07-13-2005 09:30 AM |
| AdSense and robots.txt | angelpure | Google AdWords/Google AdSense | 6 | 03-01-2005 12:07 PM |
|
WebProWorld |
Advertise |
Contact Us |
About |
Forum Rules |
MVP's |
Archive |
Newsletter Archive |
Top |
WebProNews
WebProWorld is an iEntry, Inc. ® site - © 2009 All Rights Reserved Privacy Policy and Legal iEntry, Inc. 2549 Richmond Rd. Lexington KY, 40509 |