Webmaster Forum


Go Back   Webmaster Forum > Marketing Forums > SEO Forum
Register FAQ Members List Calendar Search Today's Posts Mark Forums Read

SEO Forum Search engine optimization discussions.

   

Reply
 
LinkBack Thread Tools Display Modes
Old 06-04-2005, 09:01 AM   #1 (permalink)
v7n Mentor
 
Thanol's Avatar
 
Join Date: 10-13-03
Location: Central Ohio (Dublin)
Posts: 1,519
iTrader: 0 / 0%
Latest Blog:
None

Thanol is a highly respected web proThanol is a highly respected web proThanol is a highly respected web proThanol is a highly respected web proThanol is a highly respected web proThanol is a highly respected web proThanol is a highly respected web proThanol is a highly respected web proThanol is a highly respected web proThanol is a highly respected web proThanol is a highly respected web pro
Send a message via AIM to Thanol Send a message via MSN to Thanol Send a message via Yahoo to Thanol
Robots.txt question

It's been a while since I've had to write one, but if I put Disallow: ?o= it disallows everything that starts with ?o= correct?
__________________
-Scott
Build a Website : Other site
Thanol is offline  
Add Post to del.icio.us
Reply With Quote
Old 06-04-2005, 03:49 PM   #2 (permalink)
Inactive
 
WhatiFind's Avatar
 
Join Date: 03-12-05
Posts: 265
iTrader: 0 / 0%
WhatiFind is a glorious beacon of lightWhatiFind is a glorious beacon of lightWhatiFind is a glorious beacon of lightWhatiFind is a glorious beacon of lightWhatiFind is a glorious beacon of lightWhatiFind is a glorious beacon of lightWhatiFind is a glorious beacon of lightWhatiFind is a glorious beacon of lightWhatiFind is a glorious beacon of lightWhatiFind is a glorious beacon of lightWhatiFind is a glorious beacon of light
Don't know for sure, but I believe it goes something like this: Disallow: *?o=
check this page for more info: http://www.robotstxt.org/wc/exclusion-admin.html
WhatiFind is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-30-2006, 05:43 AM   #3 (permalink)
Individualist
 
John Scott's Avatar
 
Join Date: 09-27-03
Location: Japan, mostly
Posts: 42,521
iTrader: 3 / 100%
Latest Blog:
Google????

John Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster material
Send a message via AIM to John Scott Send a message via Yahoo to John Scott
I have never done a robots.txt for a subdomain. Am I correct is assuming that the subdomain gets its own robots.txt?

Say I wanted to exclude http://directory.v7n.com/cgi-bin/

How would I write that up?
John Scott is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-30-2006, 06:03 AM   #4 (permalink)
v7n Mentor
 
Louis's Avatar
 
Join Date: 01-12-04
Location: Gatineau, QC, Canada
Posts: 6,219
iTrader: 0 / 0%
Louis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest order
What's exactly the purpose of a robots.txt? I have an idea of what it do but I'm not grasping the bigger picture.

What would/could it bring me on my domain for example?
__________________
SignéLouis.ca, Handturned Exotic Woods Pen and Keychains

LouisWorld.ca, my personal blog.
Louis is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-30-2006, 06:15 AM   #5 (permalink)
Individualist
 
John Scott's Avatar
 
Join Date: 09-27-03
Location: Japan, mostly
Posts: 42,521
iTrader: 3 / 100%
Latest Blog:
Google????

John Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster material
Send a message via AIM to John Scott Send a message via Yahoo to John Scott
The robotx.txt file is the first file a spider visits, and you can tell the spider where to not go, etc.
John Scott is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-30-2006, 06:20 AM   #6 (permalink)
v7n Mentor
 
Louis's Avatar
 
Join Date: 01-12-04
Location: Gatineau, QC, Canada
Posts: 6,219
iTrader: 0 / 0%
Louis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest order
Yes, but what can it bring me?
__________________
SignéLouis.ca, Handturned Exotic Woods Pen and Keychains

LouisWorld.ca, my personal blog.
Louis is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-30-2006, 06:21 AM   #7 (permalink)
Individualist
 
John Scott's Avatar
 
Join Date: 09-27-03
Location: Japan, mostly
Posts: 42,521
iTrader: 3 / 100%
Latest Blog:
Google????

John Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster material
Send a message via AIM to John Scott Send a message via Yahoo to John Scott
Fame and riches?
John Scott is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-30-2006, 06:25 AM   #8 (permalink)
v7n Mentor
 
Louis's Avatar
 
Join Date: 01-12-04
Location: Gatineau, QC, Canada
Posts: 6,219
iTrader: 0 / 0%
Louis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest orderLouis is a web professional of the highest order
Hmmm, Is that how you did it John?
__________________
SignéLouis.ca, Handturned Exotic Woods Pen and Keychains

LouisWorld.ca, my personal blog.
Louis is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-30-2006, 06:28 AM   #9 (permalink)
Individualist
 
John Scott's Avatar
 
Join Date: 09-27-03
Location: Japan, mostly
Posts: 42,521
iTrader: 3 / 100%
Latest Blog:
Google????

John Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster material
Send a message via AIM to John Scott Send a message via Yahoo to John Scott
LOL.

I wish it were that easy.
John Scott is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-30-2006, 06:34 AM   #10 (permalink)
aka Colleen
 
Join Date: 03-25-04
Location: Canada
Posts: 5,925
iTrader: 0 / 0%
Latest Blog:
None

Kalina is a web professional of the highest orderKalina is a web professional of the highest orderKalina is a web professional of the highest orderKalina is a web professional of the highest orderKalina is a web professional of the highest orderKalina is a web professional of the highest orderKalina is a web professional of the highest orderKalina is a web professional of the highest orderKalina is a web professional of the highest orderKalina is a web professional of the highest orderKalina is a web professional of the highest order
I think John just finds a rich girlfriend.
__________________
Ruby Jewelry Sales
Kalina is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-30-2006, 07:06 AM   #11 (permalink)
v7n Mentor
 
Sabeur's Avatar
 
Join Date: 12-04-05
Location: UK
Posts: 845
iTrader: 0 / 0%
Sabeur is just really niceSabeur is just really niceSabeur is just really niceSabeur is just really niceSabeur is just really niceSabeur is just really niceSabeur is just really niceSabeur is just really niceSabeur is just really niceSabeur is just really niceSabeur is just really nice
Send a message via MSN to Sabeur
Quote:
I think John just finds a rich girlfriend.
where? / Who? / What? / When?
--
Is there away to know if robots.txt works? because i use for most of my websites.
Sabeur is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-30-2006, 07:33 AM   #12 (permalink)
Individualist
 
John Scott's Avatar
 
Join Date: 09-27-03
Location: Japan, mostly
Posts: 42,521
iTrader: 3 / 100%
Latest Blog:
Google????

John Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster material
Send a message via AIM to John Scott Send a message via Yahoo to John Scott
Quote:
Originally Posted by Colleen
I think John just finds a rich girlfriend.

LOL. I wish. *Dreaming*
John Scott is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-30-2006, 07:52 PM   #13 (permalink)
Contributing Member
 
lordspace's Avatar
 
Join Date: 05-30-06
Location: Canada
Posts: 466
iTrader: 0 / 0%
Latest Blog:
Notice

lordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nice
Send a message via ICQ to lordspace Send a message via Skype™ to lordspace
Hi all,

JohnScott, I think that you have to have a robots.txt file in the document root of virtual domain in this case: directory.

e.g.:
in /home/v7n/www/directory/robots.txt

User-agent: *
Disallow: /cgi-bin/

Svet
lordspace is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-30-2006, 08:01 PM   #14 (permalink)
v7n Mentor
 
Buskerdoo's Avatar
 
Join Date: 10-16-03
Location: USA
Posts: 1,559
iTrader: 0 / 0%
Latest Blog:
None

Buskerdoo is a highly respected web proBuskerdoo is a highly respected web proBuskerdoo is a highly respected web proBuskerdoo is a highly respected web proBuskerdoo is a highly respected web proBuskerdoo is a highly respected web proBuskerdoo is a highly respected web proBuskerdoo is a highly respected web proBuskerdoo is a highly respected web proBuskerdoo is a highly respected web proBuskerdoo is a highly respected web pro
I'd also like to block a se and it's been awhile since I've set one up.

How would I block google from a subdirectory?
__________________
Great CD/DVD Sleeves, Mailers, Inserts, Labels, and More - Buskerdoo. We also carry Shipping Labels.
Buskerdoo is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-30-2006, 09:25 PM   #15 (permalink)
Inactive
 
COBOLdinosaur's Avatar
 
Join Date: 05-28-06
Location: Canada
Posts: 21
iTrader: 0 / 0%
Latest Blog:
None

COBOLdinosaur is a jewel in the roughCOBOLdinosaur is a jewel in the roughCOBOLdinosaur is a jewel in the roughCOBOLdinosaur is a jewel in the roughCOBOLdinosaur is a jewel in the roughCOBOLdinosaur is a jewel in the roughCOBOLdinosaur is a jewel in the rough
Quote:
How would I block google from a subdirectory?
the Google spider uses the user agent id of Googlebot. It will comply to instructions in teh robots.txt if you follow the stadards for robot exclusion:

User-agent: Googlebot
Disallow: /your directory

Detailed control instruction for Google spiders are here:

http://www.google.ca/support/webmast...y?answer=35303

The Google spiders are well behaved if the site does not have a lot of crap and script generated links. AFAIK Google bot is the only one that will obey rel="nofollow" on a link and that gives you very detailed control of what they index.

Cd&
COBOLdinosaur is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-31-2006, 12:37 AM   #16 (permalink)
Individualist
 
John Scott's Avatar
 
Join Date: 09-27-03
Location: Japan, mostly
Posts: 42,521
iTrader: 3 / 100%
Latest Blog:
Google????

John Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster materialJohn Scott is supreme webmaster material
Send a message via AIM to John Scott Send a message via Yahoo to John Scott
Quote:
Originally Posted by lordspace
Hi all,

JohnScott, I think that you have to have a robots.txt file in the document root of virtual domain in this case: directory.

e.g.:
in /home/v7n/www/directory/robots.txt

User-agent: *
Disallow: /cgi-bin/

Svet
Thanks Svet
John Scott is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-31-2006, 12:51 AM   #17 (permalink)
Contributing Member
 
WagerX's Avatar
 
Join Date: 03-22-06
Location: Costa Rica
Posts: 365
iTrader: 0 / 0%
Latest Blog:
None

WagerX is a glorious beacon of lightWagerX is a glorious beacon of lightWagerX is a glorious beacon of lightWagerX is a glorious beacon of lightWagerX is a glorious beacon of lightWagerX is a glorious beacon of lightWagerX is a glorious beacon of lightWagerX is a glorious beacon of lightWagerX is a glorious beacon of lightWagerX is a glorious beacon of lightWagerX is a glorious beacon of light
Send a message via AIM to WagerX
Even though googlebots behave most of the time.. I've had a page listed even though I have excluded it in the robots.txt file.(There since website inception). Here's the case:
Exclusion: >>http://www.paidx.com/robots.txt<<
Webpage listed: >>http://www.paidx.com/affiliate/index.asp<<

It even has a PR2 ??

I have only 1 inbound link to that page from a PR3 page.

So does that mean that inbound links force robots to ignore exclusions in robots.txt??
WagerX is offline  
Add Post to del.icio.us
Reply With Quote
Old 05-31-2006, 07:15 AM   #18 (permalink)
Contributing Member
 
lordspace's Avatar
 
Join Date: 05-30-06
Location: Canada
Posts: 466
iTrader: 0 / 0%
Latest Blog:
Notice

lordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nicelordspace is just really nice
Send a message via ICQ to lordspace Send a message via Skype™ to lordspace
Hi WagerX,

Quote:
So does that mean that inbound links force robots to ignore exclusions in robots.txt??
It seems is possible to have you PR but this page is not shown in results.

if you search for your domain in google only main page is shown, so r