Author Topic: Prevent bots from indexing urls with sessionid. A possible solution!  (Read 3291 times)

0 Members and 1 Guest are viewing this topic.

Offline cpuswe

  • Newbie
  • *
  • Posts: 47
    • View Profile
I stubled upon an article about the robots.txt file accepting wildcards (or more exact the bots that read robots.txt) I haveŽnt seen any forumpost so...

To keep bots from indexing urls containing sessionind i have added the following to my robots.txt

User-Agent: *
Allow: /
Disallow: /*?sessionid=

I have tested this against the Google bot via Google sitemaps and it is OK by the tool used.

I am no expert on this and i have just implemented it so the impact on indexing is a bit unclear but for me worth a try...

To read more:

http://www.smart-it-consulting.com/article.htm?node=140&page=46
http://www.ysearchblog.com/archives/000372.html
http://www.webmasterstalks.com/seo-4-smf/robots-txt-t1040.0.html

If you dont know what robots.txt is, start here: http://www.robotstxt.org/

Offline AntiNSA2

  • Hero Member
  • *****
  • Posts: 774
  • As long as I can finish my site before I die.
    • View Profile
    • http://www.thelifephotography.com
CAn some experienced people talk about  the positive andf negative aspects of this issue?
As long as I can finish my site before I die.