I fully agree on OCR being more work than its worth. That's why I was asking about creating the examples from scratch in .svg, but yeah we will have to scan the books anyway, so maybe the .svg stuff isn't really necessary.
While I would volunteer to do some scanning I sadly don't have any english CS books
But I do have some english pdfs which I could start typing up...
So, count me in for goal #2, just assign me which book I should start with