22 lines
629 B
Markdown
22 lines
629 B
Markdown
# myrunningmancom-scraper
|
|
|
|
A metadata scraper for the https://myrunningman.com/ website.
|
|
|
|
## Usage
|
|
|
|
1. `./download-pages.sh` to download HTML files only once
|
|
2. `./download-thumbs.sh` to collect thumbnails (optional)
|
|
3. `./running-parser.php` to parse HTML into final output.json data file
|
|
|
|
## Example output
|
|
|
|
```json
|
|
{
|
|
"1": {
|
|
"title": "Times Square",
|
|
"broadcast_date": "2010-07-11",
|
|
"filming_date": "2010-06-21",
|
|
"location": "Times Square (Yeongdeungpo-gu, Seoul)",
|
|
"description": "A never-before-seen action variety show with an amazing cast. To start off the first episode[...]
|
|
```
|