About final project's data

楼主: LCL2 (新年快热~)   2005-05-20 12:55:46
It seems the there exists some mismatch in the log file format
ex.
103/06/8(Sun)20:01:27,http://www.csie.ntu.edu.tw/~cjlin/libsvmzip,,103/06/8(Su
n)21:31:58,http://www.csie.ntu.edu.tw/~cjlin/libsvmzip,61-223-238-147.HINET-IP
.hinet.net,61.223.238.147,Mozilla/4.0 (compatible; MSIE 5.0; Windows 98; DigEx
t),,
the general attributes# is 7, however the above one repeat date and http
address twice, which result in 9 attributes. This is the only one case
found in whole log.
There exists some more:
SV1),,
^^^^^^^^ the only words in one line.
What should we do with such "wierd" instances? Just eliminate them, or
try to repair them? Anyway, if someone can give a brief explanation
on the log format, I think it will be very helpful. Thanks a lot!

Links booklink

Contact Us: admin [ a t ] ucptt.com