On that note, I want to see benchmarks for which LLM's are best at translating between languages. To me, it's an entire product category.
There are probably many more small battles being fought or emerging. I think voice and PDF parsing are growing battles too.
I would love to see a stackexchange-like site where humans ask questions and we get to vote on the reply by various LLMs.
is this like what you're thinking of? https://lmarena.ai
Kind of. But lmarena.ai has no way to see results to questions people asked and it only lets you look at two responses side by side.