Me and my coworkers pass around opinions about what LLM does what task better. The only conclusion is that they are 100% interchangeable, some prefer ChatGPT over Claude, and that just means that when ChatGPT credits get exhausted, they switch tab to Claude, Gemini or whatever their second option is. If ChatGPT started charging money or closed, they won't care at all.
For production workloads, the LLMs are interchangeable.
As a product, ChatGPT + Python + web search + the interface are miles better than anything else except in some use cases I find Google’s NotebookLM to be a better product