film42 3 days ago

"But what about having another agent that quality controls your first agent?"

You should watch the CDO-squared scene from the Big Short again.

1
dhorthy 3 days ago

THIS so much. People are like "why human supervision when we can have agent supervsion" and always respond

> look if you don't trust the LLM to make the thing right in the first place, how are you gonna PROBABLY THE SAME LLM to fix it?

yes I know multiple passes improves performance, but it doesn't guarantee anything. for a lot of tool you might wanna call, 90% or even 99% accuracy isn't enough