It's hard to know what to make of this because while you've included the output JSON you haven't included the input PDF so I have no idea how to interpret what it's actually doing.
Give it a try on any PDF! This is just 100 LOC, easy to audit.