The problem is that the original uses the HTML table structure. In which case you are sure where certain numbers are.
The txt-variant doesn't have <td> etc. It doesn't even have tabs. So you would need to read every line and provide on what position what number is.
To top it all off, it seems that not all columns and columns headers are directly in line. For instance, Nr is left justified and the actual number is right justified.
Pos Nr Name Lic Sit G Time G A A Time Nr Duration Penalty Start End
GO 31 Barendregt Tom 99998 PP1 1 21:07 28 15 11 05:27 10 5 BOAR 05:27 10:27
In theorie you can read the file until you encounter "Pos Nr". Then read that line and set the index-position for every 'field'. And then read the following lines and extract the information. That's how Howard did it for you for the HTML. But converting the example (or writing it from scratch) is going to take quite some time. And someone has to put that time in.
Why can't you do it?
Is this information in this PDF file not provided via the same website as HTML?
Is it possible to make from a pdf file a text file?
In the general case: no.
Actually the PDF is just text and can be converted to txt with pdftotext like I showed in my post.
The problem is reading and interpreting the information afterwards.