Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dilbert, Grimmy, Kevin and Kell not downloading #109

Open
eikelan opened this issue Jul 26, 2015 · 1 comment
Open

Dilbert, Grimmy, Kevin and Kell not downloading #109

eikelan opened this issue Jul 26, 2015 · 1 comment
Labels

Comments

@eikelan
Copy link

eikelan commented Jul 26, 2015

1st of all: Thank you for dosage!
I'm using dosage-2.15-1 from Debian testing (Stretch)
Most of the comics I'm interested in are downloading fine
with 3 exceptions:
Dilbert, Mother Goose and Grimm, Kevin and Kell
I'm adding the error messages below.
No, it is not a matter of life and death but would be nice having ;-)

Thank you for your time.
Eike, Paraguay

Dilbert
where I used to look:
http://dilbert.com/strip/2015-07-26

where dosage looks:
http://www.arcamax.com/thefunnies/dilbert/s-1693329
the cartoon is there alright

~$ dosage Dilbert:2015-07-21
Dilbert> ERROR: Patterns ['<\s_[aA]\s+(?:[^>]\s+)?[hH][rR][eE][fF]\s=\s_"(/\d+-\d+-\d+/)"[^>]STR_Prev[^>]>'] not found at URL http://dilbert.com/.


Mother Goose and Grimm

where I used to look:
http://www.grimmy.com/images/MGG_Archive/MGG_2015/MGG-2015-07-26.gif
where dosage looks
http://www.arcamax.com/thefunnies/mothergooseandgrimm/s-1693318
the cartoon is there alright

~$ dosage MotherGooseAndGrimm:2015-07-21
Arcamax/MotherGooseAndGrimm> Retrieving 1 strip for index 2015-07-21
Arcamax/MotherGooseAndGrimm> ERROR: Patterns ['<\s_[aA]\s+(?:[^>]\s+)?[hH][rR][eE][fF]\s=\s_"(/newspics/[^"]+)"[^>]zoom[^>]>'] not found at URL http://www.arcamax.com/thefunnies/mothergooseandgrimm/2015-07-21.


KevinAndKell

//http://www.kevinandkell.com/2015/kk0725.html
is the comic url, but
dosage is looking for:
//http://www.kevinandkell.com/2015/kk725.html and fails

I don't know how to get the leading 0 if the month is 1..9
I tried the following to test, and it works but will stop working 1st of october obviously
I'm not proficient enough to program in Python2.7

class KevinAndKell(_BasicScraper):
description = u'Kevin & Kell by Bill Holbrook'
url = 'http://www.kevinandkell.com/'
stripUrl = url + '%s/kk0%s%s.html'
firstStripUrl = stripUrl % ('1995', '09', '03')
imageSearch = compile(r'<img.+?src="(/?(\d+/)?strips/kk\d+.(gif|jpg))"', IGNORECASE)
prevSearch = compile(r'<a.+?href="(/?(../)?\d+/kk\d+.html)"[^>]*>Previous Strip', IGNORECASE)
help = 'Index format: yyyy-mm-dd'

def getIndexStripUrl(self, index):
    return self.stripUrl % tuple(map(int, index.split('-')))

stripUrl = url + '%s/kk0%s%s.html'
does not work for month 10..12

@Manabi
Copy link
Contributor

Manabi commented Jul 27, 2015

Dilbert & KevinAndKell have both been fixed in the Dosage fork. MotherGooseAndGrimm appears to only use Arcamax, not sure how long ago it was using the grimmy.com website, or if it ever did.

This project's maintainer has vanished, necessitating the fork. The fork's already pretty far ahead.

@TobiX TobiX added the fixed label May 21, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants