myrunningmancom-scraper/README.md

22 lines
629 B
Markdown
Raw Permalink Normal View History

2022-08-08 07:36:19 +00:00
# myrunningmancom-scraper
2022-08-08 07:44:03 +00:00
A metadata scraper for the https://myrunningman.com/ website.
2022-08-08 07:36:19 +00:00
## Usage
1. `./download-pages.sh` to download HTML files only once
2. `./download-thumbs.sh` to collect thumbnails (optional)
3. `./running-parser.php` to parse HTML into final output.json data file
## Example output
```json
{
"1": {
"title": "Times Square",
"broadcast_date": "2010-07-11",
"filming_date": "2010-06-21",
"location": "Times Square (Yeongdeungpo-gu, Seoul)",
2022-08-08 07:44:48 +00:00
"description": "A never-before-seen action variety show with an amazing cast. To start off the first episode[...]
2022-08-08 07:36:19 +00:00
```