myrunningmancom-scraper/README.md

22 lines
629 B
Markdown

# myrunningmancom-scraper
A metadata scraper for the https://myrunningman.com/ website.
## Usage
1. `./download-pages.sh` to download HTML files only once
2. `./download-thumbs.sh` to collect thumbnails (optional)
3. `./running-parser.php` to parse HTML into final output.json data file
## Example output
```json
{
"1": {
"title": "Times Square",
"broadcast_date": "2010-07-11",
"filming_date": "2010-06-21",
"location": "Times Square (Yeongdeungpo-gu, Seoul)",
"description": "A never-before-seen action variety show with an amazing cast. To start off the first episode[...]
```