wwwstat is a simple program written in C which provides an analysis of the popularity of your web pages.
wwwstat parses access_log files which list the time, date, and domain of each instance a web document is hit.
To use wwwstat, you do not need a web counter, or perl, or java, or cgi-bin access, or anything else. You only need to compile the C program and run it on an access_log file. Access log files are usually world-readable files located in a directory like /www/httpd/logs/ so anyone can get them.
Typical access_log files are very large because they contain a complete history of all accesses to a all web pages on a server.
If you are only interested in generating statistics for a particular web page, you should filter out access_log entries which you are interested in, to a separate file, then run www_stat on that file.
If you have never seen an access_log file, it is just a big list of web page hit data listed like this:
155.250.128.1 - - [06/Nov/1997:04:02:14 -0500] "GET /%7Esvei/tifaq/small/fountain.html HTTP/1.0" 200 42740 155.250.128.1 - - [06/Nov/1997:04:02:15 -0500] "GET /%7Esvei/tifaq/small/cirque.gif HTTP/1.0" 200 17736 155.250.128.1 - - [06/Nov/1997:04:02:15 -0500] "GET /%7Esvei/tifaq/small/marquard.html HTTP/1.0" 200 25422
Example: If you only care about hits to "mypage.html", you should filter out lines from your access_log into a file called mypage_log. In UNIX, you can do something like this:
cat access_log | grep mypage.html > mypage_log
wwwstat can then process this mini-access_log file, and dump the results to a report file:
wwwstat < mypage_log > mypage_report
If you are unable to filter out an interesting subset of web pages from the access_log file, then wwwstat will simply generate a report for all documents on the entire web server
wwwstat outputs 3 kinds of reports from the log file, and dumps them to stdout:
Disclaimer: This program is offered "as-is". All binaries and src files are freeware, and be re-used for any purpose. Binaries may be posted here in the future.
Warning: