 |
08-10-2006, 12:18 AM
|
#1 (permalink)
|
|
Inactive
Join Date: 05-31-06
Location: South Africa
Posts: 5
Latest Blog: None
|
Password Protection
I would like to know, if I have a section of my website that is private and I don't want it crawled by any search engine, if I password protect that folder would it still be able to be crawled?
|
|
|
08-10-2006, 11:06 AM
|
#2 (permalink)
|
|
Contributing Member
Join Date: 07-27-06
Location: Phil
Posts: 93
Latest Blog: None
|
Just put robot.txt to your pages so that the bots cant crawl that pages
|
|
|
08-10-2006, 11:14 AM
|
#3 (permalink)
|
|
v7n Mentor
Join Date: 10-13-03
Location: Central Ohio (Dublin)
Posts: 1,519
Latest Blog: None
|
Quote:
|
Originally Posted by Aisne
I would like to know, if I have a section of my website that is private and I don't want it crawled by any search engine, if I password protect that folder would it still be able to be crawled?
|
You can password protect it fairly easily with htaccess, if you're host is using cPanel.
|
|
|
08-10-2006, 12:47 PM
|
#4 (permalink)
|
|
v7n Mentor
Join Date: 01-03-06
Location: Illinois
Posts: 1,499
|
Just create robots.txt in your root and here is an example:
Quote:
User-agent: *
Disallow: /directory/
|
This will prevent all robots from accessing it.
|
|
|
08-10-2006, 12:53 PM
|
#5 (permalink)
|
|
Member
Join Date: 10-09-03
Posts: 9,594
Latest Blog: None
|
the question is
Quote:
|
if I password protect that folder would it still be able to be crawled?
|
not how do I or should I use a robot.txt page... 
|
|
|
08-10-2006, 12:57 PM
|
#6 (permalink)
|
|
v7n Mentor
Join Date: 01-03-06
Location: Illinois
Posts: 1,499
|
Yeah I think that if you do password protect directories using cPanel or .htaccess, it wouldn't be crawled by Google (if it did, it would have to be a blank page in the cache)
If they could actually access the content, it would be a major security problem!
|
|
|
08-10-2006, 04:33 PM
|
#7 (permalink)
|
|
Contributing Member
Join Date: 05-24-06
Location: Boulder, CO
Posts: 446
|
I think password protection will keep spiders out. Like sixty6 said it would be a real security issue if they could still get through.
As for robots.txt it doesn't prevent robots or keep them out. It's just a file telling robots what you don't want to have indexed. It's up to the specific robot in question whether or not to honor it. Supposedly the major engines will all honor it, but given that pages I have disallowed in my robots.txt did make it into the index I question this a little.
|
|
|
08-10-2006, 04:44 PM
|
#8 (permalink)
|
|
v7n Mentor
Join Date: 01-03-06
Location: Illinois
Posts: 1,499
|
Quote:
|
Originally Posted by vangogh
I think password protection will keep spiders out. Like sixty6 said it would be a real security issue if they could still get through.
As for robots.txt it doesn't prevent robots or keep them out. It's just a file telling robots what you don't want to have indexed. It's up to the specific robot in question whether or not to honor it. Supposedly the major engines will all honor it, but given that pages I have disallowed in my robots.txt did make it into the index I question this a little.
|
Yeah, it is up to them, but most robots do follow the rules!
|
|
|
08-11-2006, 04:03 PM
|
#9 (permalink)
|
|
Inactive
Join Date: 07-02-06
Posts: 9
Latest Blog: None
|
Definitely don't use robots.txt.
Search engines definitely won't be able to access the content if you password protect with .htaccess. They'll receive a 403(?) forbidden header and i would imagine no search engine would bother indexing such a page. If you protect with a dynamic form script then the page may be indexed but just with "Enter userid" text or similar depending on how you set up the password script.
|
|
|
08-11-2006, 05:09 PM
|
#10 (permalink)
|
|
v7n Mentor
Join Date: 01-03-06
Location: Illinois
Posts: 1,499
|
It depends on how serious it is, if you simply just don't want it indexed, use robots.txt
But if it's some big secret, I guess you would have to use cPanel...
|
|
|
08-11-2006, 05:23 PM
|
#11 (permalink)
|
|
Banned
Join Date: 02-16-06
Posts: 9,607
Latest Blog: None
|
my question is, if you want the content password protected, why would you want it spidered in the first place?
|
|
|
08-11-2006, 05:28 PM
|
#12 (permalink)
|
|
v7n Mentor
Join Date: 01-03-06
Location: Illinois
Posts: 1,499
|
The point is with .htaccess you wouldn't have to type a pass and username EVERY time you visit it or have a visitor visit it...
with robots.txt you can keep a good amount of robots away from it and save the hassle
If you're fine with typing in a pass and having your visitor do the same, go ahead!
|
|
|
|
Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
|
|
|
| Thread Tools |
|
|
| Display Modes |
Linear Mode
|
Posting Rules
|
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is Off
|
|
|
All times are GMT -7. The time now is 03:40 AM.
© Copyright 2008 V7 Inc
|
|