So how angry would HTC be if I used a simple script to scrape years of tour data from the score pages...
check the website TOS (Terms of Service) page if it exists. either way it will take a very long time as they have flood control enabled so you have to wait between each database request.
if scraping isnt forbidden you could just script the appropriate delay between each request. (cURL is your friend here

)
I recently scraped together a database of ~380k records from a UK govt website. legally too as they seem to have missed a couple of crucial things from the TOS (doh!). the only relevant line banned any usage which would restrict the service for other users. despite scraping the pages serially I crashed the webserver twice. I figure thats entirely their fault for using IIS instead of Apache ...