Webmaster Forum


Go Back   Webmaster Forum > The Webmaster Forums > Tech Talk

Tech Talk Discuss computer issues, tech gadgets and hardware, operating systems, browsers, broadband and wireless, virus, trojan, and spyware help.


Reply
 
LinkBack Thread Tools Display Modes
Share |
  #1 (permalink)  
Old 06-10-2012, 09:13 PM
Contributing Member
Latest Blog:
None

 
Join Date: 01-06-12
Location: Jackson Mo
Posts: 244
iTrader: 0 / 0%
indexing non-public webpages

I get a report sent to me daily that has about 50-70 javascript links to real estate information. This is not information that is publicly available, so it is not indexed by anything like Google. I need to be able to index and search the linked pages so that I can find specific information quickly. I have tried saving the web pages (Didn't work), PDF files (logistically not feasible), using chrome history search (only partially successful).

Does any one know of a program that will index webpages as viewed??
__________________
HouseViewOnline: Real estate for Cape Girardeau, Jackson, Festus and St. Louis.
 
Reply With Quote
  #2 (permalink)  
Old 06-12-2012, 08:19 AM
carwell's Avatar
Contributing Member
 
Join Date: 02-15-10
Posts: 188
iTrader: 0 / 0%
Use firefox history box?
 
Reply With Quote
  #3 (permalink)  
Old 06-16-2012, 07:14 AM
Contributing Member
 
Join Date: 09-13-11
Posts: 132
iTrader: 0 / 0%
I think if it is not indexed on Google, it means the website owners might have set limitations on crawling their pages....Any respectable search engine would respect the website owner instructions and wont crawl... may be you need to go beyond...
 
Reply With Quote
  #4 (permalink)  
Old 06-16-2012, 07:25 AM
snakeair's Avatar
Super Moderator
 
Join Date: 12-31-07
Location: Medford, NJ
Posts: 42,162
iTrader: 3 / 100%
Quote:
Originally Posted by techielog View Post
I think if it is not indexed on Google, it means the website owners might have set limitations on crawling their pages....Any respectable search engine would respect the website owner instructions and wont crawl... may be you need to go beyond...
This is a report that is sent to the thread creator so obviously it's not indexed in the search engines.

Do you know of any program in which the thread creator could put the report on so he can search through it easily instead of scrolling through every single page and every single line?
__________________
Newbiz Advertising - A resourceful blog

Like us on Facebook: facebook.com/Newbizshop

Premium WordPress Themes - A list of themes.
 
Reply With Quote
  #5 (permalink)  
Old 07-23-2012, 03:48 AM
Member
 
Join Date: 07-23-12
Location: London
Posts: 36
iTrader: 0 / 0%
I don;t think there's an off the shelf solution.

But with a little skill (and maybe Linux) I'd suggest whip up (or get someone else to write) a script that takes the URL's and runs them through curl (or some other command line web downloader like wget) that can save the HTML to a folder that you can then search using Google Desktop.

You could semi automate this, and if you've got control over your email setup, could pipe the output of the email directly into the script for full automation (so long as it's a unique email address).
__________________
Load test your website in the cloud with Loadzen, providing affordable, easy to use load testing services for web masters, developers and business owners.
 
Reply With Quote
Go Back   Webmaster Forum > The Webmaster Forums > Tech Talk

Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
I need to edit some webpages Kelley Eidem Buy Web Services 5 11-08-2011 11:46 AM
How to use keywords on your webpages potchie SEO Forum 11 07-10-2009 01:46 AM
software used in your webpages mjpeddi1254 Web Design Lobby 6 02-15-2009 09:51 AM
How does PR for webpages work? posylane SEO Forum 8 08-07-2008 06:22 AM


V7N Network
Get exposure! V7N I Love Photography V7N SEO Blog V7N Directory


All times are GMT -7. The time now is 03:31 PM.
Powered by vBulletin
Copyright © 2000-2013 Jelsoft Enterprises Limited.
Copyright © 2003 - 2013 Escalate Media LP




Search Engine Optimization by vBSEO 3.6.0 RC 2 ©2011, Crawlability, Inc.