NYT has a public api that can be used to track some so-called "stealth edits". Full text is not supported, but the API has endpoints that provide headlines, abstracts, lead paragraphs, and article word counts.
Everything should work. Headlines that do not appear to have changed are resulting in different MD5 hashes and being duplicated in database. I will fix that at some point.
- why are some articles/edits missing?
- The tracker uses the Archive endpoint, which is only updated three times per day (around 3:30PT, 11:30PT, and 19:30PT). Articles can be published and edited before the tracker sees them. If you do not like this, build your own. It takes like 15 minutes.
article info:
- article_id
- 10cd53b1-5bf6-5ef8-a476-76dc931be5e9
- pub_date
- 2022-12-08 03:18:59
- section_name
- Business Day
- document_type
- article
- web_uri
- https://www.nytimes.com/2022/12/08/business/uk-new-coal-mine.html
history:
version: 2022-12-08 19:45:04
Britain Approves New Coal Mine Despite Climate Concerns
Thursday, December 08, 2022
Local politicians said the facility, the country’s first new coal mine in decades, would create hundreds of jobs.
The British government approved on Wednesday the country’s first coal mine in decades, a project promoted as a source of new jobs but which has been criticized as a reversal of efforts to control climate change.
word count: 569
version: 2022-12-09 11:45:03
Britain Approves New Coal Mine Despite Climate Concerns
Thursday, December 08, 2022
Local politicians said the facility, the country’s first new coal mine in decades, would create hundreds of jobs.
The British government approved on Wednesday the country’s first coal mine in decades, a project promoted as a source of new jobs but which has been criticized as a reversal of efforts to control climate change.
word count: 569
archives:
check archive.today for copies of this article.
check archive.org wayback machine for copies of this article.