HaHa! <-(PeeWee Herman laugh)
I've been attempting to shove my log data into Postgres and am
coming to a sobering realization. It has taken 9 hours to process
15,000 requests..... As I am in the process of discovering that
much of that time is spent doing gethostbyaddr() for each entry.
A subsequent reload without doing lookups is ontrack to be done
in 2 hours. This rate would obviously create a serious backlog
on some sites if the server was direct connected to the database.
I suppose that my gethostbyaddr() results are not being cached
by local nameservice. One way to improve this may be to create
that cache in my perl program. Any other ideas on how to improve
this? I am beginning to question the value of this data....
-Randy
I've been attempting to shove my log data into Postgres and am
coming to a sobering realization. It has taken 9 hours to process
15,000 requests..... As I am in the process of discovering that
much of that time is spent doing gethostbyaddr() for each entry.
A subsequent reload without doing lookups is ontrack to be done
in 2 hours. This rate would obviously create a serious backlog
on some sites if the server was direct connected to the database.
I suppose that my gethostbyaddr() results are not being cached
by local nameservice. One way to improve this may be to create
that cache in my perl program. Any other ideas on how to improve
this? I am beginning to question the value of this data....
-Randy