NYT has a public api that can be used to track some so-called "stealth edits". Full text is not supported, but the API has endpoints that provide headlines, abstracts, lead paragraphs, and article word counts.
Everything should work. Headlines that do not appear to have changed are resulting in different MD5 hashes and being duplicated in database. I will fix that at some point.
- why are some articles/edits missing?
- The tracker uses the Archive endpoint, which is only updated three times per day (around 3:30PT, 11:30PT, and 19:30PT). Articles can be published and edited before the tracker sees them. If you do not like this, build your own. It takes like 15 minutes.
article info:
- article_id
- 2523cd12-c1fa-5844-b6d4-68cd9b9f01e1
- pub_date
- 2022-08-04 12:00:09
- section_name
- Opinion
- document_type
- article
- web_uri
- https://www.nytimes.com/2022/08/04/opinion/private-equity-lays-waste.html
history:
version: 2022-08-04 19:45:03
Private Equity Doesn’t Want You to Read This
Thursday, August 04, 2022
Government must do more to clip its wings.
This column is about the excesses of the private equity investment industry. It delves into the minutiae of the tax code, corporate structure and certain abstruse practices of financial engineering. There will be jargon: carried interest, leveraged buyout, joint liability. I am aware that none of this is anyone’s favorite thing to be discussing on a summer’s day.
word count: 1317
version: 2022-08-06 03:45:05
Private Equity Doesn’t Want You to Read This
Thursday, August 04, 2022
Government must do more to clip its wings.
This column has been updated.
word count: 1301
archives:
check archive.today for copies of this article.
check archive.org wayback machine for copies of this article.