Webmaster Forum


Go Back   Webmaster Forum > The Webmaster Forums > Forum Lobby > Computers & Internet
Register FAQ Members List Calendar Search Today's Posts Mark Forums Read

Computers & Internet For all those computer related discussions.

Ezilon Directory   ClickBooth Network   V7N Directory

Reply
 
LinkBack Thread Tools Display Modes
Old 04-09-2007, 11:27 AM   #1 (permalink)
Inactive
 
Join Date: 01-01-07
Posts: 15
iTrader: 0 / 0%
Latest Blog:
None

billybw is liked by many
Puzzled by Website's Visitor Behavior

My website stats includes a Latest Visitors log, and I keep seeing something on it that puzzles me. At least once or twice a day, a visitor will appear to move through all the pages on the site at a very rapid rate. By this I mean that there is only one or two seconds spent on each page, before going to the next one.

By doing some testing, I've determined that It is PHYSICALLY IMPOSSIBLE for anyone to click through the pages on the site that quickly.

Also, these are real visitors, not robots or crawlers. I know this because they come from Google and I can see the search terms that they used to find the site. Also, sometimes this type of visitor will spend a few minutes looking at two or three pages, before suddenly starting to rapidly go through the rest of the site as I described above.

Does anyone have an explanation for what I'm seeing?

Thank you
billybw is offline  
Add Post to del.icio.us
Reply With Quote
Sponsored Links
SEO Hosting by HostGator  Advertise Here  Buy Blog Links
Old 04-11-2007, 03:21 AM   #2 (permalink)
Contributing Member
 
Nicole's Avatar
 
Join Date: 02-23-07
Location: USA
Posts: 874
iTrader: 0 / 0%
Nicole is a glorious beacon of lightNicole is a glorious beacon of lightNicole is a glorious beacon of lightNicole is a glorious beacon of lightNicole is a glorious beacon of lightNicole is a glorious beacon of lightNicole is a glorious beacon of lightNicole is a glorious beacon of lightNicole is a glorious beacon of lightNicole is a glorious beacon of lightNicole is a glorious beacon of light
Maybe they are using one of those programs that download your site for viewing offline ?
__________________
~Nicole

V7N Contextual Links
Nicole is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-11-2007, 07:26 AM   #3 (permalink)
Inactive
 
Join Date: 04-11-07
Posts: 10
iTrader: 0 / 0%
Latest Blog:
None

Weizheng is liked by many
Does your site rank high in search engine?

Could be one of those keyword analysis tools. When a keyword is entered, it does a search in Google then crawls the top N sites to look for keywords in titles and meta tags.
Weizheng is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-11-2007, 10:10 AM   #4 (permalink)
Inactive
 
Join Date: 01-01-07
Posts: 15
iTrader: 0 / 0%
Latest Blog:
None

billybw is liked by many
Thanks for the replies.

Weizheng-- This is a non-commercial site with no ads and nothing to sell, so I doubt that anyone would be analyzing the type of keywords that send visitors to the site. Also, sometimes a visitor will spend a few minutes looking at two or three pages before the rapid movement through the rest of the site begins.

Nicole -- You're idea that people may be using a program to download the entire site makes sense. Does anyone know how common it is for people to do this on the sites they visit? Also, what is this kind of program called?

Thank you
billybw is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-11-2007, 11:09 AM   #5 (permalink)
v7n Mental
 
waffles's Avatar
 
Join Date: 06-30-06
Location: Not where I want to be.
Posts: 1,485
iTrader: 1 / 100%
waffles is a name known to allwaffles is a name known to allwaffles is a name known to allwaffles is a name known to allwaffles is a name known to allwaffles is a name known to allwaffles is a name known to allwaffles is a name known to allwaffles is a name known to allwaffles is a name known to allwaffles is a name known to all
Send a message via AIM to waffles
Fasterfox is an extension that can do it if they're using Firefox.
__________________
waffles Radio - Back on the air. Listen Mondays from 4-6 or catch the bootleg whenever

Juggle Chainsaws What happens when a college kid gets the bright idea to write
waffles is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-11-2007, 01:53 PM   #6 (permalink)
Inactive
 
StupidScript's Avatar
 
Join Date: 09-22-06
Location: Los Angeles
Posts: 678
iTrader: 0 / 0%
Latest Blog:
None

StupidScript is just really niceStupidScript is just really niceStupidScript is just really niceStupidScript is just really niceStupidScript is just really niceStupidScript is just really niceStupidScript is just really niceStupidScript is just really niceStupidScript is just really niceStupidScript is just really niceStupidScript is just really nice
You're definitely seeing bot/crawler activity. As you noted, pages requests are too rapid for it to be a human who would need to wait for a page to load before clicking on the next link.

With regards to the Google query string, it's not unusual for a bot to post a query through Google and then spider the results, including any attached codes. I've seen hundreds of visits from obvious bots that include identical Google query strings ... it's not too complicated to accomplish.

It's also not difficult to program a bot to mimic human behavior by slowly going through the first couple of pages and then speeding up the rest of the crawl.

There are any number of reasons why a rogue bot might be crawling your site, from email harvesting to anti-competitive behavior to searching for unsecure forms to exploit to gathering your pages in order to build a duplicate site that potentially holds malware to ... ? Maybe your site is just one of hundreds of thousands on a list being used by bots for who-knows-what purpose.

On the benign side, it may very well be users who liked what they saw on the first couple of pages and then use a "site-grabber" to download every page for offline viewing, as noted above.

If you are concerned about the behavior, you should check more deeply into those visits and see if there exists some unique identifier you could use to block them, like their IP address, although I'm pretty sure you'll find that the variety of IPs and User-agents will be extensive. Also, there are scripts/programs out there that can "throttle" request activity. Using the software to exclude valid bots, like Googlebot, you could restrict every visitor to, say 1 request per 10 seconds. Most rogue bot programs won't wait around, that long. They'll keep making their speedy requests, and if most of them are denied, due to the throttling, they'll finish their run and leave without all they came for.
StupidScript is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-11-2007, 02:45 PM   #7 (permalink)
v7n Mentor
 
Taltos's Avatar
 
Join Date: 11-22-06
Location: Phoenix, AZ
Posts: 1,804
iTrader: 0 / 0%
Latest Blog:
None

Taltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web pro
Send a message via Yahoo to Taltos
Heck, IE7 and I think IE6, all you have to do is File >> Save As and it will save the whole site in a folder. It creates a .mht file, I think.
__________________
Experimenting
Taltos is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-12-2007, 10:21 AM   #8 (permalink)
Inactive
 
Join Date: 01-01-07
Posts: 15
iTrader: 0 / 0%
Latest Blog:
None

billybw is liked by many
I have done some more checking, and am convinced that these are real visitors, for several reasons:

1. Not all of them are sent by Google. Some come from Yahoo, and some come from a link on another site.

2. The search terms vary, and the entry pages vary.

3. In one case I noticed that the visitor waited about six hours, probably doing something else, then came back, looked at two more pages, than began the rapid movement through the site.

4. This is an environmental site about rare tree species, so there is no reason for automated crawlers to be repeatedly collecting the same information over and over every day.
billybw is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-12-2007, 12:39 PM   #9 (permalink)
v7n Mentor
 
Taltos's Avatar
 
Join Date: 11-22-06
Location: Phoenix, AZ
Posts: 1,804
iTrader: 0 / 0%
Latest Blog:
None

Taltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web pro
Send a message via Yahoo to Taltos
Sounds to me like somebody grabbing the whole site for school. Either a teacher to use it in class or a student for a paper. Just a guess but that my theory and I'm stickin' to it

Any chance you can tell the general map location by IP address? Are they from the same area? From the same school?
__________________
Experimenting
Taltos is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-12-2007, 01:10 PM   #10 (permalink)
Inactive
 
Join Date: 01-01-07
Posts: 15
iTrader: 0 / 0%
Latest Blog:
None

billybw is liked by many
There is no way all of these visitors are coming from the same class or school. I first noticed this happening at least six months months ago, and it has happened hundreds of times since then. Also, as I said, these visitors find the site in different ways such as Google, Yahoo, an external link, etc. I just noticed today that one of them was sent by the Google Image search originally, looking for a photo of a particular tree blossom, then after about three minutes going rapidly through all the other pages as I've described earlier.

Also, in looking through some logs I've saved, a few of the visitors that do this were sent by google.co.uk and google.ca, so they must live in England and Canada.

But I am wondering why so many visitors to my site seem to be doing this. Does anyone know how common it is for people to save an entire site on their computer?

Thank you
billybw is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-12-2007, 06:17 PM   #11 (permalink)
Contributing Member
 
Join Date: 12-07-05
Location: Smallville.ca
Posts: 248
iTrader: 0 / 0%
Latest Blog:
None

E.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of light
take a good sentence from different pages of your site and quote text them in search with google, see if your content is being copied.

let us know the results and if you find copying do a whois on the domain being used for your records.

This may not be the answer but might help eliminate the possiblities.

Good luck
E.Scape is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-13-2007, 10:42 AM   #12 (permalink)
Inactive
 
Join Date: 01-01-07
Posts: 15
iTrader: 0 / 0%
Latest Blog:
None

billybw is liked by many
E.Scape -- I did some searches as you suggested and didn't find any other sites with any of the sentences I tested. It doesn't make sense anyway, that hundreds of other webmasters would be copying content from my site to their sites.

The only thing that seems to fit is Nicole's idea that some visitors are downloading the site for viewing off-line. After studying the data some more, I estimate that an average of about two visitors per day may be doing this, which is about 1.5% of all visitors to the site. Does anyone have any estimates for their sites that I can compare that with?

Thank you
billybw is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-15-2007, 10:16 AM   #13 (permalink)
Contributing Member
 
Join Date: 12-07-05
Location: Smallville.ca
Posts: 248
iTrader: 0 / 0%
Latest Blog:
None

E.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of lightE.Scape is a glorious beacon of light
I suggested this because it does happen and with so many "harvesting" signs It's at least good to eliminate the obvious.

I have alot of custom articles written for me and I have to check regularily for people copying my content. And as much as it doesnt make sence that people would do this for the obvious lack of attention to their sites, this is what most MFA and PPC sites run off. Even duplicated content gets indexed and as a result brings SE traffic to an ad splash page.

Good luck and let us know if you find the problem, this can help others here.

Jack
E.Scape is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-15-2007, 10:21 AM   #14 (permalink)
v7n Mentor
 
Taltos's Avatar
 
Join Date: 11-22-06
Location: Phoenix, AZ
Posts: 1,804
iTrader: 0 / 0%
Latest Blog:
None

Taltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web pro
Send a message via Yahoo to Taltos
Yes, I have had a couple sites that ranked very well in a particular industry that was still somewhat young on the internet. Those site got ripped off regularly and duplicated almost completely in a couple of cases. That made it pretty easy to find the stolen sites and send DMCA nasty-grams. With the exception of one Canadian site on a Canadian host that didn't both with DMCA, the rest quickly were taken down volentarily and they went on to do their own work. Which resulted in much worse SERPs than they had with my stolen site

But those were commercial sites. Yours is not so it is less likely to get copied like that. It definitely happens more than you would like to think.
__________________
Experimenting
Taltos is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-15-2007, 10:35 AM   #15 (permalink)
Inactive
 
Join Date: 01-01-07
Posts: 15
iTrader: 0 / 0%
Latest Blog:
None

billybw is liked by many
Quote:
Good luck and let us know if you find the problem, this can help others here.

What problem are you talking about? I'm not aware of any problem. I alreadfy told you that no one is copying my site, so there's no problem there. So what are you talking about?
billybw is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-15-2007, 10:40 AM   #16 (permalink)
v7n Mentor
 
Taltos's Avatar
 
Join Date: 11-22-06
Location: Phoenix, AZ
Posts: 1,804
iTrader: 0 / 0%
Latest Blog:
None

Taltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web pro
Send a message via Yahoo to Taltos
I think he's just asking that if you figure out this weird behavior, let us know what you discover. I know I am curious now. It is kind of strange but so far appears harmless. Good luck on the hunt
__________________
Experimenting
Taltos is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-15-2007, 11:00 AM   #17 (permalink)
Inactive
 
Join Date: 01-01-07
Posts: 15
iTrader: 0 / 0%
Latest Blog:
None

billybw is liked by many
Taltos --Nicole already gave the explanation in the second post on this thread. I don't know why it's so hard for some people here to understand it.
billybw is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-16-2007, 10:35 AM   #18 (permalink)
v7n Mentor
 
Taltos's Avatar
 
Join Date: 11-22-06
Location: Phoenix, AZ
Posts: 1,804
iTrader: 0 / 0%
Latest Blog:
None

Taltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web pro
Send a message via Yahoo to Taltos
Sorry, in later posts you didn't sound convinced that was the answer. Problems solved.
__________________
Experimenting
Taltos is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-16-2007, 11:22 AM   #19 (permalink)
v7n Goddess
 
Tanya's Avatar
 
Join Date: 09-24-06
Location: Where I Belong
Posts: 3,331
iTrader: 3 / 100%
Tanya is a web professional of the highest orderTanya is a web professional of the highest orderTanya is a web professional of the highest orderTanya is a web professional of the highest orderTanya is a web professional of the highest orderTanya is a web professional of the highest orderTanya is a web professional of the highest orderTanya is a web professional of the highest orderTanya is a web professional of the highest orderTanya is a web professional of the highest orderTanya is a web professional of the highest order
OFF TOPIC


Pokes Taltos
__________________
Design Pride- A Breast Cancer Awareness Theme <-- Check it out!!
A guy, a girl, and a blog - My new fav blog to read, so funny in a nasty way!!!
Tanya is offline  
Add Post to del.icio.us
Reply With Quote
Old 04-16-2007, 11:29 AM   #20 (permalink)
v7n Mentor
 
Taltos's Avatar
 
Join Date: 11-22-06
Location: Phoenix, AZ
Posts: 1,804
iTrader: 0 / 0%
Latest Blog:
None

Taltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web proTaltos is a highly respected web pro
Send a message via Yahoo to Taltos
DOH! Hey, its been long enough that I don't have the chair and rope right here but their not far away! You come back here!
__________________
Experimenting
Taltos is offline  
Add Post to del.icio.us
Reply With Quote
Go Back   Webmaster Forum > The Webmaster Forums > Forum Lobby > Computers & Internet

Reply



Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Similar Threads
Thread Thread Starter Forum Replies Last Post
How's my website's mascot superboysahil Graphic Design Forum 16 07-08-2007 01:50 PM
Finding out a website's web host nategd Web Hosting Forum 11 08-08-2006 06:43 AM
Puzzled - Index page gone from Google (Others are listing us everywhere) Locus Google Forum