Send one chat message and see two model answers side by side. The flow also logs both replies, your prompt, and the chat context to Google Sheets for simple scoring. It helps teams choose the best model for an assistant, bot, or content tool before going live.
A chat trigger starts the run when a message arrives. The message is split so each model in your list receives the same input. Memory is isolated per model, so history does not mix and your comparison stays fair. An AI Agent calls the selected OpenRouter model, then the results are prepared, grouped, and pushed to your sheet. The interface shows both answers together by combining the outputs, so you can compare quickly in one place.
Setup is simple. Connect OpenRouter and Google Sheets, then enter the model IDs you want to test. Add a system prompt and tools in the AI Agent if you need task specific behavior. Expect faster decisions, less copy and paste, and a clean record for every test. This is well suited for IT and product teams that want proof before picking a model for production.