gutenberg

Gutenberg-rs

Radu Angelescu
A couple of years passed since i wrote GutenbergPy. It has since gained some traction. A few people contacted me for information, wanting to help, and using it in their projects. I won’t say it’s famous; in fact, it has gained little traction in terms of people using it versus time since launch; however, it is a niche project, and for what it did, I consider it a minor success (there already existed one library with a better name that did almost the same things but slower with more resource consumption and less maintained).

GutenbergPy

Radu Angelescu

I have created a library used for interfacing with Gutenberg from python code. This is the first article about GutenbergPy

Why use my library ?

  • Only needs lxml (pymongo only if you use mongodb)
  • SQLite cache build time: about 2 minutes (instead of more than one day)
  • SQLite cache size: about 120 mb
  • Mongodb cache build time: about 3 minutes (will probably be less in the future, as it’s not optimized)
  • Mongodb cache size: about 300 mb (instead of 2 Gb berkley db previous solution)
  • Fast queries on both solutions