Mailing List Archive

Need help getting SEARCHQUERY working
I'm trying to get the SEARCHQUERY and SEARCHWORD sections working on my
Analog 6.0 system. I have this section in my Analog config file:

SEARCHQUERY ON
SEARCHWORD ON
CONFIGFILE SearchEngines.txt

SearchEngines.txt was downloaded from Mike Shor's site at
http://www.owen.vanderbilt.edu/mike.shor/diversions/analog. It contains
this line:

SEARCHENGINE http://*.google.com/* q,as_q,as_oq,as_epq,query

When run like this:
../analog-6.0/analog +gweb1.analog.cfg +F071201 +T071231
+O/var/www/centernet/htdocs/webstats/web1/200712/index.html

my output only contains a Search Query report that lists 19 pages
requested, even though the web logs of just the last day of my period
includes 128 requests from Google, like this sample:
kevinz@cn2:/opt/analog/conf.d$ fgrep http://www.google.com/search
../logdata/web1/access_log.20071231 |wc -l
128
kevinz@cn2:/opt/analog/conf.d$ fgrep http://www.google.com/search
../logdata/web1/access_log.20071231 |head
ABTS-NCR-Dynamic-013.35.163.122.airtelbroadband.in - -
[31/Dec/2007:00:55:13 -0500] "GET
/igwg/presentations/Monday/SubplenB/PromotionMale.pdf HTTP/1.1" 200
44424
"http://www.google.com/search?q=graduate+housewives+in+india&hl=en&rlz=1
T4GGLJ_en-GBIN214IN214&start=20&sa=N" "Mozilla/4.0 (compatible; MSIE
7.0; Windows NT 5.1)"
85.185.229.106 - - [31/Dec/2007:00:55:43 -0500] "GET /pubs/sp/20/20.pdf
HTTP/1.0" 200 466095
"http://www.google.com/search?hl=fa&q=AIDS%2BPDF&btnG=%D8%AC%D8%B3%D8%AA
%D8%AC%D9%88%D9%8A+Google&lr=" "Mozilla/4.0 (compatible; MSIE 6.0;
Windows NT 5.1; SV1)"
66.249.85.131 - - [31/Dec/2007:00:57:48 -0500] "GET
/asia/bangladesh/nsdp.shtml HTTP/1.1" 200 20061
"http://www.google.com/search?q=child+delivery+video&hl=en&start=70&sa=N
" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR
1.1.4322)"
c-98-204-115-120.hsd1.dc.comcast.net - - [31/Dec/2007:00:58:02 -0500]
"GET /pubs/ HTTP/1.1" 200 30575
"http://www.google.com/search?q=jhccp&ie=utf-8&oe=utf-8&aq=t&rls=org.moz
illa:en-US:official&client=firefox-a" "Mozilla/5.0 (Macintosh; U; Intel
Mac OS X; en-US; rv:1.8.1.11) Gecko/20071127 Firefox/2.0.0.11"
pool-71-182-79-153.ptldor.fios.verizon.net - - [31/Dec/2007:01:23:25
-0500] "GET /quality/expo.shtml HTTP/1.1" 200 10440
"http://www.google.com/search?hl=en&q=putting+quality+first&btnG=Search"
"Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.11)
Gecko/20071127 Firefox/2.0.0.11"
d47-69-11-157.nap.wideopenwest.com - - [31/Dec/2007:01:30:50 -0500] "GET
/pubs/fg/02/04-chapter2.pdf HTTP/1.1" 206 2810476
"http://www.google.com/search?sourceid=navclient&aq=t&ie=UTF-8&rls=GGIC,
GGIC:2007-39,GGIC:en&q=audience+segmentation" "Mozilla/4.0 (compatible;
MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)"
d47-69-11-157.nap.wideopenwest.com - - [31/Dec/2007:01:35:13 -0500] "GET
/pubs/fg/02/04-chapter2.pdf HTTP/1.1" 206 2568116
"http://www.google.com/search?sourceid=navclient&aq=t&ie=UTF-8&rls=GGIC,
GGIC:2007-39,GGIC:en&q=audience+segmentation" "Mozilla/4.0 (compatible;
MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)"
ABTS-NCR-Dynamic-011.123.163.122.airtelbroadband.in - -
[31/Dec/2007:01:52:42 -0500] "GET
/igwg/presentations/Monday/SubplenC/MenAgainst.pdf HTTP/1.1" 200 654982
"http://www.google.com/search?hl=en&rls=WZPA%2CWZPA%3A2007-25%2CWZPA%3Ae
n&q=women+population+of+india" "Mozilla/4.0 (compatible; MSIE 6.0;
Windows NT 5.1; SV1)"
CPE-144-137-203-75.sa.bigpond.net.au - - [31/Dec/2007:02:02:10 -0500]
"GET /pubs/ci/2/index.shtml HTTP/1.1" 200 19646
"http://www.google.com/search?hl=en&lr=&as_qdr=all&q=Elizabeth+Omolo&btn
G=Search" "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.1;
FunWebProducts; .NET CLR 2.0.50727; .NET CLR 1.1.4322)"
194.225.53.2 - - [31/Dec/2007:02:08:25 -0500] "GET /topics/oral.shtml
HTTP/1.0" 200 14876
"http://www.google.com/search?q=oral+contraceptive&hl=en&start=10&sa=N"
"Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1)"
kevinz@cn2:/opt/analog/conf.d$

None of these query referrer requests, such as
"graduate+housewives+in+india" or "AIDS%2BPDF" (the first two in the
fgrep above), show up in the Search Query section. All other aspects of
the Analog report lead me to believe that it ran normally. The output
when it ran was:
kevinz@cn2:/opt/analog/conf.d$ ../analog-6.0/analog +gweb1.analog.cfg
+F071201 +T071231
+O/var/www/centernet/htdocs/webstats/web1/200712/index.html
../analog-6.0/analog: analog version 6.0/Unix
../analog-6.0/analog: Warning R: In Directory Report, turning off pie
chart of
only one wedge
(For help on all errors and warnings, see docs/errors.html)
kevinz@cn2:/opt/analog/conf.d$

Also, the report contains over a million successful requests and almost
7,000 referrers from http://www.google.com/search, leading me to believe
that it's reading the correct web log files.

Can anyone help me set up my Analog config files to correctly generate
the Search Query and Search Word sections?

Thanks so much for your advice and suggestions.

-Kevin

Kevin Zembower
Internet Services Group manager
Center for Communication Programs
Bloomberg School of Public Health
Johns Hopkins University
111 Market Place, Suite 310
Baltimore, Maryland 21202
410-659-6139

+------------------------------------------------------------------------
| TO UNSUBSCRIBE from this list:
| http://lists.meer.net/mailman/listinfo/analog-help
|
| Analog Documentation: http://analog.cx/docs/Readme.html
| List archives: http://www.analog.cx/docs/mailing.html#listarchives
| Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
+------------------------------------------------------------------------