2015-06-18 06:46:29,890 DEBUG #25498 === Building rdf === 2015-06-18 06:46:29,890 INFO #25498 Making pg25498.rdf 2015-06-18 06:46:29,976 INFO #25498 Done pg25498.rdf 2015-06-25 06:23:45,359 DEBUG #25498 === Building rdf === 2015-06-25 06:23:45,360 INFO #25498 Making pg25498.rdf 2015-06-25 06:23:45,452 INFO #25498 Done pg25498.rdf 2015-07-02 06:39:48,830 DEBUG #25498 === Building rdf === 2015-07-02 06:39:48,830 INFO #25498 Making pg25498.rdf 2015-07-02 06:39:48,902 INFO #25498 Done pg25498.rdf 2015-07-09 06:56:03,049 DEBUG #25498 === Building rdf === 2015-07-09 06:56:03,050 INFO #25498 Making pg25498.rdf 2015-07-09 06:56:03,506 INFO #25498 Done pg25498.rdf 2015-07-13 04:47:09,051 DEBUG #25498 === Building epub.images === 2015-07-13 04:47:09,051 DEBUG #25498 Start of retrieval 2015-07-13 04:47:09,084 DEBUG #25498 ... got mediatype text/html from guess_type 2015-07-13 04:47:09,084 DEBUG #25498 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/25498/25498-h/25498-h.htm 2015-07-13 04:47:09,085 DEBUG #25498 HTMLParser.pre_parse () ... 2015-07-13 04:47:09,085 DEBUG #25498 Fetching file:///public/vhost/g/gutenberg/html/files/25498/25498-h/25498-h.htm ... 2015-07-13 04:47:09,160 DEBUG #25498 Got charset big5 from html meta 2015-07-13 04:47:09,161 DEBUG #25498 Trying to decode document with charset big5hkscs ... 2015-07-13 04:47:09,515 ERROR #25498 Text not in charset big5hkscs ('big5hkscs' codec can't decode byte 0xe7 in position 1326: illegal multibyte sequence) 2015-07-13 04:47:12,134 DEBUG #25498 Got charset utf-8 from text sniffing 2015-07-13 04:47:12,135 DEBUG #25498 Trying to decode document with charset utf_8_sig ... 2015-07-13 04:47:12,140 INFO #25498 Running html thru tidy. 2015-07-13 04:47:12,226 WARNING #25498 tidy: line 11 column 9 -