log src/scanner.cpp @ 33:ce229348cdbe stable-2-6

age author description
Sat, 29 May 2004 15:36:14 -0700 carl failed to lowercase the first char of html tags stable-2-6
Sat, 29 May 2004 08:34:58 -0700 carl consider <!tag> to be bad html tag
Thu, 27 May 2004 21:30:26 -0700 carl ignore text including bad html tags inside comments stable-2-5
Thu, 27 May 2004 16:17:48 -0700 carl add syslog msg freeing memory, use bare tld names without leading period stable-2-4
Thu, 27 May 2004 10:08:51 -0700 carl add configurable list of tlds
Sat, 22 May 2004 22:30:45 -0700 carl add configurable host name limit and bad html tag limits. stable-2-3
Fri, 21 May 2004 21:45:36 -0700 carl better scanning for bad html tags, allow binary zip and gz files with random char sequences
Wed, 19 May 2004 21:40:50 -0700 carl start coding for bad html tag detection
Wed, 12 May 2004 13:23:22 -0700 carl add url decoding stage, allow http:/ single / in yahoo redirector, allow ip address hostnames
Fri, 30 Apr 2004 22:44:56 -0700 carl fix problems in the state transitions causing impossible states
Fri, 30 Apr 2004 00:32:44 -0700 carl add scanning for bare hostnames
Thu, 29 Apr 2004 22:05:03 -0700 carl add scanning for bare hostnames
Thu, 29 Apr 2004 21:56:22 -0700 carl add scanning for bare hostnames
Fri, 23 Apr 2004 22:45:10 -0700 carl fix memory leak with duplicate url host names, stable-2-0
Thu, 22 Apr 2004 20:19:01 -0700 carl integration work on url scanner
Thu, 22 Apr 2004 14:00:02 -0700 carl integration work on url scanner
Thu, 22 Apr 2004 11:25:45 -0700 carl integration work on url scanner
Thu, 22 Apr 2004 08:38:07 -0700 carl stable standalone scanner module
Wed, 21 Apr 2004 22:39:46 -0700 carl start work on content url scanner