Skip to content

A simple python webscrapper to download chapters from the japanese manga One Punch Man. Source is the official Tonarinoyj webseite.

Notifications You must be signed in to change notification settings

Ay-De/OPM_tonarinoyj_webscrapper

Repository files navigation

OPM_tonarinoyj_webscrapper

One Punch Man is a popular japanese manga written by the manga artist "ONE" and illustrated by Yusuke Murata. The manga gets new chapters bi-weekly and can be read for free on the official webseite Tonarinoyj.

The goal of this project is to create a webscrapper in python to download the japanese One Punch Man chapters from the official website. After some research it became evident, that the manga pages are cut into smaller patches. Additionally, these patches got a randomized order. The below image shows a subsection of manga page:

Sample: Subsection of a downloaded manga page

The easiest solution for this task would be to just take a screenshot of the displayed manga page. But the purpose of this project isn't to just download the manga chapters, but rather to get more practice in webscrapping and to explore how the images can be reconstructed from this randomized state.

Current progress

The webscrapping part is finished with the main.py script. Splitting the images into the smaller patches is currently done through the jupyter notebook. This step will be implemented into the main program at a later point, once a reliable reconstruction method has been established.

About

A simple python webscrapper to download chapters from the japanese manga One Punch Man. Source is the official Tonarinoyj webseite.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published