Mapping the Harvard Classics to Project Gutenberg I wanted good digital copies of the Harvard Classics — the fifty-volume set edited by Charles Eliot in 1909. Most of the works are on Project Gutenberg. But finding exact matches is tedious. Titles vary, author names vary, multiple editions appear. I had a list of 300 works in a CSV. Manually searching wasn’t practical. So I wrote a script. Approach I used the Gutendex API, a simple JSON interface to Project Gutenberg. It supports fuzzy search and returns clean data. ...

May 31, 2025 · 2 min · 355 words · Jonathan Brewer