|
Is this still valid.. some time since I did any webdesign, not sure if the user agent string is still the same (24th april 04)
google still has the string 'Googlebot' in it's UA? if so this should work to strip session ID's off forum url when he comes visiting. Been known to work to good effect in the past - anyone confirm if it's still good?
[code:1:5456f7d95b]
################################################## ############
## MOD Title: enhance-google-indexing
## MOD Author: Showscout & R. U. Serious
## MOD Description: If the User_agent includes the string 'Googlebot', then no session_ids are appended to links, which will (hopefully) allow google to index more than just your index-site.
## MOD Version: 0.9.1
##
## Installation Level: easy
## Installation Time: 2 Minutes
## Files To Edit: includes/sessions.php
## Included Files: n/a
################################################## ############
## For Security Purposes, Please Check: http://www.phpbb.com/mods/downloads/ for the
## latest version of this MOD. Downloading this MOD from other sites could cause malicious code
## to enter into your phpBB Forum. As such, phpBB will not offer support for MOD's not offered
## in our MOD-Database, located at: http://www.phpbb.com/mods/downloads/
################################################## ############
## Author Notes: There may be issues with register globals on newer
## PHP version. If you know for sure and also how to fix it post in
## this thread: http://www.phpbb.com/phpBB/viewtopic.php?t=32328
##
## Obviously, if someone thinks it's funny to surf around with a
## user_agent containing Googlebot and at the same time does not
## allow cookies, he will loose his session/login on every pageview.
## Should he complain to you, tell him to eat your shorts.
##
## If you want to add further crawlers look at the appropiate line and
## feel free to add part of the user_agent which should be _unique_
## unique to that, so a user is never confused with a bot.
##
################################################## ############
## Version History: 0.9.0 initial release, only googlebot
## 0.9.1 added inktomi (MSN-search/crawler-bot)
################################################## ############
## Before Adding This MOD To Your Forum, You Should Back Up All Files Related To This MOD
################################################## ############
#-----[ OPEN ]------------------------------------------
includes/sessions.php
#-----[ FIND ]------------------------------------------
global $SID;
if ( !empty($SID) && !eregi('sid=', $url) )
#-----[ REPLACE WITH ]------------------------------------------
global $SID, $HTTP_SERVER_VARS;
if ( !empty($SID) && !eregi('sid=', $url) && !strstr($HTTP_SERVER_VARS['HTTP_USER_AGENT'] ,'Googlebot') && !strstr($HTTP_SERVER_VARS['HTTP_USER_AGENT'] ,'slurp@inktomi.com;'))
#
#-----[ SAVE/CLOSE ALL FILES ]------------------------------------------
#
# EoM
[/code:1:5456f7d95b]
just noticed my code was a bit different in my forum (2.0.6) .. uses some animal called preg_match rather than eregi (whatever the finer points of each is i don't know.)
if you have preg_match instead of eregi try
[code:1:5456f7d95b]
if ( !empty($SID) && !preg_match('#sid=#', $url) && !strstr($HTTP_SERVER_VARS['HTTP_USER_AGENT'] ,'Googlebot') && !strstr($HTTP_SERVER_VARS['HTTP_USER_AGENT'] ,'slurp@inktomi.com;') )
[/code:1:5456f7d95b]
check the parentheses. in a rush & I might well have cocked up ;-)
|