I had very good experience with `gemini-2.0-flash-exp`:
https://github.com/maurycy/gemini-json-ocr
It's hard to know what to make of this because while you've included the output JSON you haven't included the input PDF so I have no idea how to interpret what it's actually doing.
Give it a try on any PDF! This is just 100 LOC, easy to audit.