Python Stuff: A PDF library, an exemplary speling correction algorithm, a recipe for extracting maps and lists from XML and another persistent object system:
PDF library:
-
A Pure-Python library built as a PDF toolkit. It is capable of: extracting document information (title, author, …), splitting documents, merging documents, cropping pages,merging multiple pages into a single page,encrypting and decrypting PDF files.
Spelling Correction:
-
ould benefit from an explanation. The full details of an industrial-strength spell corrector like Google’s would be more confusing than enlightening, but I figured that on the plane flight home, in less than a page of code, I could write a toy spelling co
Extraction recipe:
-
There are a several ways to represent collections of key/value pairs in XML, which makes it more difficult than necessary to use that data in python. By taking advantage of common patterns in the XML it is easy to turn most formats into collections of lis
Persistent object system:
-
This is Durus, a persistent object system for applications written in the Python programming language. Durus offers an easy way to use and maintain a consistent collection of object instances used by one or more processes. Access and change of a persisten