v1.0.3 (2010-07-26)
Another minor release bringing 17 new and 11 updated crawlers.
Bugfixes
- Make crawlers handle
httplib.BadStatusLine
exception raised when HTTP responses are empty. - Make crawlers convert
lxml.etree._ElementUnicodeResult
to unicode objects before saving to the database, to avoidDatabaseError: can't adapt
errors. - Handle MIME types like
image/jpeg, image/jpeg
returned bymimetools.Message.gettype
. - Use
Crawler.headers
for image requests, and not just page requests.
Crawlers
- New:
apokalips
- New:
axecop
(fixes GH-8) - New:
boxerhockey
- New:
bugcomic
(fixes GH-11) - New:
carpediem
- New:
crfh
- New:
darylcagle
- New:
havet
(fixes GH-7) - New:
heltnils
- New:
intelsinsides
(fixes GH-9) - New:
misfile
(fixes GH-3) - New:
notinventedhere
(fixes GH-4) - New:
pondusno
- New:
radiogaga
- New:
scenesfromamultiverse
(fixes GH-10) - New:
sheldon
- New:
thegutters
- Update
8bittheater
which is no longer published. - Update
brinkerhoff
which is no longer published. - Update
ctrlaltdelete
to work after source site changes. - Update
ctrlaltdeletesillies
to work after source site changes. - Update
dieselsweetiesweb
to work after source site changes. - Update
eon
with new source site. - Update
lunch
with new source site. - Update
sometingpositive
to get all releases. - Update
supereffective
to work after source site changes. - Update
vgcats
to work after source site changes. - Update
yafgc
to work after source site changes.