Author Topic: Avoid search engine listing...  (Read 4422 times)

0 Members and 1 Guest are viewing this topic.

Offline 4brain

  • Pre-Newbie
  • Posts: 1
    • View Profile
Avoid search engine listing...
« on: January 18, 2007, 01:31:06 PM »
Hi All!

i want a hint to avoid a seach engine listing of my photogallerie.
Has anyone a hint to do that?

I know there are principles but a principle is not an RFC.

For hints to secure my gallerie i would be very glad.

Thx

Markus

Offline mawenzi

  • 4images Moderator
  • 4images Guru
  • *****
  • Posts: 4.500
    • View Profile
Re: Avoid search engine listing...
« Reply #1 on: January 18, 2007, 01:51:25 PM »
... make a index-site only with a loginform and without any links and without any content ...
Your first three "must do" before you ask a question ! ( © by V@no )
- please read the Forum Rules ...
- please study the FAQ ...
- please try to Search for your answer ...

You are on search for top 4images MOD's ?
- then please search here ... Mawenzi's Top 100+ MOD List (unsorted sorted) ...

Offline CeJay

  • Sr. Member
  • ****
  • Posts: 425
    • View Profile
Re: Avoid search engine listing...
« Reply #2 on: January 18, 2007, 08:06:00 PM »
The robots.txt file is a good way to prevent a page or site from getting indexed. However, not every site can use it. The only robots.txt file that the spiders will read is the one at the top html directory of your server. This means you can only use it if you run your own domain. The spiders will look for the file in a location similar to this below:
http://www.mysite.com/robots.txt


 Create a file called robots.txt and place in there what areas you want to protect.

If you want to exclude all the search engine spiders from your entire domain, you would write just the following into the robots.txt file:

User-agent: *
Disallow: /


If you want to exclude all the spiders from a certain directory within your site, you would write the following:

User-agent: *
Disallow: /yourgallery/


If you want to do this for multiple directories, you add on more Disallow lines:

User-agent: *
Disallow: /yourgallery/


If you want to exclude certain files, then type in the rest of the path to the files you want to exclude:

User-agent: *
Disallow: /yourgallery/index.php
Disallow: /directory/example.htm


If you want to keep a specific search engine spider from indexing your site, do this:
(Note for Google make this Google & make a seperate one for Googlebot )

User-agent: Robot_Name
Disallow: /



This is what I use for certain sections and it actually works for me.
To see more on how to allow go to: http://www.searchtools.com/robots/robots-txt.html