Case study | Claude Platform

How Mutiny gives every seller a full creative team with Claude

Industry:

Software

Company size:

Startup

Product:

Claude Platform

Location:

North America

3x improvement in design satisfaction

since making Claude Opus the default model

4.5x faster asset creation

for sales teams compared to previous workflows

Mutiny helps GTM teams create the customer-facing assets they need to generate pipeline and close deals: personalized pitch decks, deal rooms, and business cases, generated in the customer's own brand. Instead of waiting on marketing or design, sales reps describe what they need and Mutiny's AI agents build it.

With Claude, Mutiny:

Measured a 3x improvement in design satisfaction since making Claude Opus the default model, based on in-app user feedback
Cut asset creation time by 4.5x for sales teams using Mutiny, compared to previous workflows
Saw 9 out of 10 sales reps using Mutiny report the product gives them an edge in competitive deals
Generates fully branded assets in a single shot from just a website URL
Re-architected its entire creation experience from constrained, task-specific AI to an agent-first platform

The challenge

Introducing Claude Opus 4.7

Opus 4.7 is a notable improvement on Opus 4.6 in advanced software engineering, with particular gains on the most difficult tasks.

Introducing Claude Opus 4.7

Opus 4.7 is a notable improvement on Opus 4.6 in advanced software engineering, with particular gains on the most difficult tasks.

Introducing Claude Opus 4.7

Opus 4.7 is a notable improvement on Opus 4.6 in advanced software engineering, with particular gains on the most difficult tasks.

Isolated AI capabilities with no way to combine them

Mutiny began integrating large language models in 2022. But those early implementations were constrained. AI could pull styling from a website, generate text variations for different target companies, and conduct basic research. But each task needed well-defined guardrails, and combining them in new ways wasn't possible.

The arrival of Claude Opus 4 changed what Mutiny decided to build. "We really saw an opportunity to re-architect the whole system and make this a very agent-first experience,” said Nikhil Mathew, Co-founder and CTO of Mutiny. “The agent is now multimodal and has tool-based access to do research, build the brand, and design the entire experience.”

The solution

Quantifying taste: How Mutiny evaluated Claude

Choosing the right model for creative output meant solving an unusual evaluation problem: how do you quantify design taste? Mutiny built a two-part internal benchmark. The first part measured tool-call accuracy. The second, and more important, was a creative evaluation. "We take screenshots of the output and provide those to a different LLM to judge how well the user's original intent was captured and whether the output embodied the brand," Mathew explained.

The team scored outputs across multiple axes: brand alignment, typography, color usage, imagery, layout, and whether the output avoided common design tropes. They use vision capabilities to objectively measure brand quality across those axes, a process they still rely on as they iterate. The tipping point came during a weekend of testing. Mutiny had been refining its system prompt with Anthropic's team, drawing on techniques from Anthropic's frontend design research to steer Claude past its default design patterns toward more varied, distinctive output.

“It was the highest benchmark we had achieved on our internal design eval," Mathew said. "That was a visceral moment where not only are we seeing high-quality outputs, but a much wider range of those outputs as well, which made it a very valuable part of our product experience."

The result was an agent that could, as Mathew described it, "produce a wide range of well designed outputs on the first try." Reaching that point required more than a model swap. The team rebuilt its core creation experience around an LLM-native data model, aligning to coding principles and frameworks like Tailwind that Claude performs well on, so the agent could make visual decisions independently.

For Jaleh Rezaei, CEO and Co-founder of Mutiny, the upgrade was an inflection point. "Once we put in Opus, the product worked," she said. "It enabled us to have a public launch."

Three agents, one brand

Mutiny's architecture splits the work across three specialized agents, each powered by Claude Opus 4.7. The Brand Agent operates against a live browser to understand a customer's website and build a taxonomy of their visual identity: fonts, button styles, color palettes, spacing, and what Mathew called "the feel of the brand." The team’s goal was “to make it so anybody could onboard to Mutiny and within minutes create something in their brand," he said.

The Research Agent and Creative Agent work in tandem. When a sales rep wants to build a pitch deck for a specific prospect, the Creative Agent dispatches the Research Agent to pull context from CRM data, past call transcripts, and previous interactions. That research flows back into the Creative Agent's context alongside the brand taxonomy, and the result is an asset that is visually on-brand and tailored to the prospect's specific priorities.

Customer feedback reflects the difference. "When they see our product, they're like, this is something I'm proud to put in front of my customer,” Rezaei said.

The architecture also emphasizes human-agent collaboration. "Everything the agent generates is a hundred percent editable in the browser. It can stream in real time, and it's fully multiplayer," Mathew noted. "You can have multiple people and agents working together in the same workspace."

“Being able to one-shot assets in your visual language is what opens up the next level of automation.”

Nikhil Mathew

Co-founder and CTO, Mutiny

The outcome

3x design satisfaction

Since making Claude Opus the default model in late January 2026, Mutiny has measured a 3x improvement in design satisfaction through in-app asset quality ratings. In early customer surveys, users say the product is 4.5x faster for creating sales assets. Nine out of 10 sales reps also said Mutiny gives them an edge in competitive deals. Every respondent rated Mutiny's design quality as meeting or exceeding the bar set by their own designers.

The product quality translated directly into business traction: since launch, Mutiny's monthly recurring revenue (MRR) has grown 120% week over week. "Opus is what allowed us to connect all of those dots," Mathew added. "Being able to one-shot assets in your visual language is what opens up the next level of automation."

That next step is proactive preparation: generating the right materials before the rep even opens the app. "The next piece is to actually create those for you automatically,” Mathew added, “so that they're just prepared for you when you need them.”

“Once we put in Opus, the product worked. It enabled us to have a public launch.”

Jaleh Rezaei

CEO and Co-founder, Mutiny

Help me develop a unique voice for an audience
Hi Claude! Could you help me develop a unique voice for an audience? If you need more information from me, ask me 1-2 key questions right away. If you think I should upload any documents that would help you do a better job, let me know. You can use the tools you have access to— like Google Drive, web search, etc.—if they’ll help you better accomplish this task. Do not use analysis tool. Please keep your responses friendly, brief and conversational.

Please execute the task as soon as you can—an artifact would be great if it makes sense. If using an artifact, consider what kind of artifact (interactive, visual, checklist, etc.) might be most helpful for this specific task. Thanks for your help!
Improve my writing style
Hi Claude! Could you improve my writing style? If you need more information from me, ask me 1-2 key questions right away. If you think I should upload any documents that would help you do a better job, let me know. You can use the tools you have access to— like Google Drive, web search, etc.—if they’ll help you better accomplish this task. Do not use analysis tool. Please keep your responses friendly, brief and conversational.

Please execute the task as soon as you can—an artifact would be great if it makes sense. If using an artifact, consider what kind of artifact (interactive, visual, checklist, etc.) might be most helpful for this specific task. Thanks for your help!
Brainstorm creative ideas
Hi Claude! Could you brainstorm creative ideas? If you need more information from me, ask me 1-2 key questions right away. If you think I should upload any documents that would help you do a better job, let me know. You can use the tools you have access to— like Google Drive, web search, etc.—if they’ll help you better accomplish this task. Do not use analysis tool. Please keep your responses friendly, brief and conversational.

Please execute the task as soon as you can—an artifact would be great if it makes sense. If using an artifact, consider what kind of artifact (interactive, visual, checklist, etc.) might be most helpful for this specific task. Thanks for your help!

Learn

Explain a complex topic simply
Hi Claude! Could you explain a complex topic simply? If you need more information from me, ask me 1-2 key questions right away. If you think I should upload any documents that would help you do a better job, let me know. You can use the tools you have access to— like Google Drive, web search, etc.—if they’ll help you better accomplish this task. Do not use analysis tool. Please keep your responses friendly, brief and conversational.

Please execute the task as soon as you can—an artifact would be great if it makes sense. If using an artifact, consider what kind of artifact (interactive, visual, checklist, etc.) might be most helpful for this specific task. Thanks for your help!
Help me make sense of these ideas
Hi Claude! Could you help me make sense of these ideas? If you need more information from me, ask me 1-2 key questions right away. If you think I should upload any documents that would help you do a better job, let me know. You can use the tools you have access to— like Google Drive, web search, etc.—if they’ll help you better accomplish this task. Do not use analysis tool. Please keep your responses friendly, brief and conversational.

Please execute the task as soon as you can—an artifact would be great if it makes sense. If using an artifact, consider what kind of artifact (interactive, visual, checklist, etc.) might be most helpful for this specific task. Thanks for your help!
Prepare for an exam or interview
Hi Claude! Could you prepare for an exam or interview? If you need more information from me, ask me 1-2 key questions right away. If you think I should upload any documents that would help you do a better job, let me know. You can use the tools you have access to— like Google Drive, web search, etc.—if they’ll help you better accomplish this task. Do not use analysis tool. Please keep your responses friendly, brief and conversational.

Please execute the task as soon as you can—an artifact would be great if it makes sense. If using an artifact, consider what kind of artifact (interactive, visual, checklist, etc.) might be most helpful for this specific task. Thanks for your help!

Code

Explain a programming concept
Hi Claude! Could you explain a programming concept? If you need more information from me, ask me 1-2 key questions right away. If you think I should upload any documents that would help you do a better job, let me know. You can use the tools you have access to— like Google Drive, web search, etc.—if they’ll help you better accomplish this task. Do not use analysis tool. Please keep your responses friendly, brief and conversational.

Please execute the task as soon as you can—an artifact would be great if it makes sense. If using an artifact, consider what kind of artifact (interactive, visual, checklist, etc.) might be most helpful for this specific task. Thanks for your help!
Look over my code and give me tips
Hi Claude! Could you look over my code and give me tips? If you need more information from me, ask me 1-2 key questions right away. If you think I should upload any documents that would help you do a better job, let me know. You can use the tools you have access to— like Google Drive, web search, etc.—if they’ll help you better accomplish this task. Do not use analysis tool. Please keep your responses friendly, brief and conversational.

Please execute the task as soon as you can—an artifact would be great if it makes sense. If using an artifact, consider what kind of artifact (interactive, visual, checklist, etc.) might be most helpful for this specific task. Thanks for your help!
Vibe code with me
Hi Claude! Could you vibe code with me? If you need more information from me, ask me 1-2 key questions right away. If you think I should upload any documents that would help you do a better job, let me know. You can use the tools you have access to— like Google Drive, web search, etc.—if they’ll help you better accomplish this task. Do not use analysis tool. Please keep your responses friendly, brief and conversational.

Please execute the task as soon as you can—an artifact would be great if it makes sense. If using an artifact, consider what kind of artifact (interactive, visual, checklist, etc.) might be most helpful for this specific task. Thanks for your help!

Write case studies
This is another test
Write grant proposals
Hi Claude! Could you write grant proposals? If you need more information from me, ask me 1-2 key questions right away. If you think I should upload any documents that would help you do a better job, let me know. You can use the tools you have access to — like Google Drive, web search, etc. — if they’ll help you better accomplish this task. Do not use analysis tool. Please keep your responses friendly, brief and conversational.

Please execute the task as soon as you can - an artifact would be great if it makes sense. If using an artifact, consider what kind of artifact (interactive, visual, checklist, etc.) might be most helpful for this specific task. Thanks for your help!
Write video scripts
this is a test

Products

Features

Models

Solutions

Claude Platform

Resources

Company

Programs

Help and security

Terms and policies

Cookie settings

We use cookies to deliver and improve our services, analyze site usage, and if you agree, to customize or personalize your experience and market our services to you. You can read our Cookie Policy here.

Necessary

Enables security and basic functionality.

Required

Analytics

Enables tracking of site performance.

Off

Marketing

Enables ads personalization and tracking.

Off
Privacy policy
Responsible disclosure policy
Terms of service: Commercial
Terms of service: Consumer
Terms of Service: US K-12
Data Processing Agreement: US K-12
Usage policy

How Mutiny gives every seller a full creative team with Claude

With Claude, Mutiny:

The challenge

Isolated AI capabilities with no way to combine them

The solution

Quantifying taste: How Mutiny evaluated Claude

Three agents, one brand

The outcome

3x design satisfaction

Related stories