This technical post describes a few simple steps for programmatically cleaning document scans with the python programming language. The concepts can be used for batch processing hundreds of images quickly and consistently.
I’ve a selection of old Thai books at home – dictionaries, textbooks covering grammar, linguistics & etymology, plus various miscellaneous trinkets such as the official announcement of the appointment of Field Marshal Plaek Phibunsongkhram and his cabinet ministers, including Luang Wichitwathakan. I also have a few pieces of nationalist propaganda from around the time of World War 2. I’d like to share these books with a wider audience and have been thinking about how best to do so.
Having bought a secondhand Plustek OpticBook 3600 scanner for a bargain price, I found Plustek had removed the Mac drivers from their website. I eventually found a copy of the .dmg file online and have uploaded it here in case it’s useful for anyone.
The driver gave an error while installing on OS X Yosemite (10.10.3), but after a reboot it worked ok from the Plustek OpticBook app.
Disclaimer: Use of this file is entirely at your own risk.
Wikipedia introduces the The Bangkok Recorder as:
The Bangkok Recorder (Thai: หนังสือจดหมายเหตุ) was the first Thai-language newspaper, first published monthly, and later bi-weekly, in Bangkok, Siam between 1844–1845, and 1865-1867. It was written and published by Dr. Dan Beach Bradley, an American Christian missionary who spent 35 years in the country.
This blog aims to chronicle the digitisation of this valuable language and historical resource. Overall, there are 60 surviving editions of the newspaper and a facsimile collection was published in 1994 and this project will attempt to digitise this file.
The Bangkok Recorder (Full Collection) [PDF]
Sommai Huntrakun, collector.
Office of His Majesty’s Principal Private Secretary