A simple comic crawler to crawl comic from https://comicbus.com/.

Code Structure

To execute

Create a request file to download comics, example:

{
    "食戟之靈": {
        "01話":[],
        "02話":[],
        "03話":[],
        "04話":[]
    }
}

Create a db file to restore the comic's image url, example:

{
}

Execute the sample script

Scrape comic website directly

$ ./src/basic_main.py [request_file.json] scripts [db.json]

Scrape comic website through a well managed worker

a. download and follow the instruction of the README.md at schedular to start a worker service

b. run the script for worker of schedular

$ ./src/worker_main.py [request_file.json] scripts [db.json] http://0.0.0.0:5000/execute
# Notice that the url 'http://0.0.0.0:5000/execute' is the url for "shedular" to run as default,
# please just set this argument based on your real environment.

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
.github/workflows		.github/workflows
env		env
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
dockerfile		dockerfile
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

A simple comic crawler to crawl comic from https://comicbus.com/.

Code Structure

To execute

Scrape comic website directly

Scrape comic website through a well managed worker

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

KeepLearningFromSideProject/SimpleComicCrawler

Folders and files

Latest commit

History

Repository files navigation

A simple comic crawler to crawl comic from https://comicbus.com/.

Code Structure

To execute

Scrape comic website directly

Scrape comic website through a well managed worker

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages