Script for automated downloading of PDFs from Guardian's subscription service (guardian.newspaperdirect.com, used to be digital.guardian.co.uk)
Currently works only on OS X, but Linux/Unix only need a bug fix or two. Windows currently unsupported. Needs dos2unix, Netcat and Python (used for MD5 encryption).
It's just a big fat Bash script (only because I'm more comfortable with Bash than Perl or Python), which I will tidy up and rewrite using something else in the future (see below).
There are some subjective issues too, like web browser's general clumsiness, awkwardness, laziness and preference of desktop applications like a decent PDF viewer.
I was generally pissed off with that, so after discovering that the PDF option still exists in the redesigned website, I sat down and painfully went through sources and traffic dumps of logging in and downloading a PDF and wrote a very-very-very-very crude Netcat based downloader, which still works to my 100% satisfaction as of writing this on 29 Jun 2009.
Recently I have also written an automated script that periodically checks the date and downloads a newspaper every day using the downloader script. So basically with this running in the background, if I switch computer for a while every day, I will have a newspaper to read anytime I want.
Hosted by SourceForge.
Copyright (C) 2009, Ladislav Snizek