Yup, gemini 2.5 is bad.
Were you also trying to edit the same code base as the GP or did you evaluate it on some other criteria where it also failed?
I take the same prompt and give it to 3.7, o1 pro, and gemini. I do this for almost everything, and these are large 50k+ context prompts. Gemini is almost always behind