In the case of the 500k tree, `lla` needs 2.5 seconds, so it's pretty substantial.
Is listing a lot of files really CPU-limited? Isn’t the problem IO speed?