SSSteve Springerinstevesdevnotes.hashnode.dev·Jan 11, 2024 · 2 min readExtracting usable data from pdf bank statementsWhile modern banks have excellent financial reporting facilities, some old ones provide little more than regular pdf statements. A client came to me after the one csv export that their bank provided was rendered useless by almost every transaction de...00
SSSteve Springerinstevesdevnotes.hashnode.dev·Jan 11, 2024 · 2 min readExtracting PDF statements from your gmail inboxFor part 1, see Mining PDFs for tabular data, part 1 A client of mine received monthly statements via email for several properties that he rented out, and wanted a quick way to extract the data from the attached PDF files, and get them into his finan...00
SSSteve Springerinstevesdevnotes.hashnode.dev·Jan 11, 2024 · 2 min readMining PDFs for tabular data, part 1Recently, I've had several use cases that lead me to use the excellent python library pdfplumber. Quick note regarding pdfs with a very high number of pages.Memory leaks can occur, but there are easy workarounds. For most of my work, however, this wa...00
SSSteve Springerinstevesdevnotes.hashnode.dev·May 6, 2023 · 2 min readUnix design philosophy and music are great teammatesOne of my favourite things about working with any Linux OS is leveraging Unix commands, piping the inputs and outputs of powerful but single-minded tools to compose a larger command that meets your needs. The Unix philosophy is all about having separ...00
SSSteve Springerinstevesdevnotes.hashnode.dev·Mar 29, 2023 · 3 min readSearch video files by their dialogueWhisper is a tool by OpenAI designed to: transcribe speech from audio files into written form translate speech from audio files into a different language My use case I have a bunch of instructional video files on my local computer, some very shor...00