Using AI

Intuit employee? Come on in.

Tools at a glance

Each of these tools is grounded in the content system, so you're always working from a trusted source of truth. Most workflows combine more than one, so don't feel like you need to pick just one and stick with it.

Tool	Best for	Not ideal for
Writer Intuit GenAI tool	- Editing for Intuit voice and style in Writer Docs - Enabling brand consistency at scale with Writer Docs review - Generating first drafts via Ask Writer - Brainstorming via Ask Writer	- Final sign-off (human review still required) - Highly technical content outside style guidance - Workflow management
Content Systems NotebookLM Content site-grounded LLM	- Auditing product flows - Synthesizing multiple sources - Identifying content gaps - Getting a head start on strategy or drafts	- Creative brainstorming - UI analysis (no image support)
UX product content feedback Gem Screenshot feedback tool	- Reviewing copy before pushing - Auditing existing content - Enforcing content standards	- Early brainstorming - Context-free copywriting. It does better with some customer context, a problem to solve, or a business goal.
Markdown files Content system source for LLMs	- Grounding your own custom AI tools - Building a domain-specific notebook that references the content system and your domain specific assets	- Tools that can use live URLs as sources (the live URL is always preferable because it's always up to date).

Note: Markdown files of the content system can go out of date — watch #ai-content-design in Slack for updates.

Prompting

Generated content should be at the same level of quality—or at least a consistent quality—as content written by humans. This includes meeting Intuit voice and tone standards.

Voice strategies have traditionally been described and scaled through webpages and decks to teach humans how to write consistently. Now, as content experts, our toolkit includes prompts and evaluation rubrics.

It’s important to know that—like much that surrounds AI—both the prompt and rubric are considered experimental. Please use them and report to #ai-content-design on how it’s going. These are a starting place for all of us—if you find ways to make them better, share your learnings to help improve these assets for everyone.

Universal voice prompt

Our goal is to create a consistent, cohesive, and high-quality voice across our generated experiences. To help achieve this, we designed a single, reusable prompt to encourage uniform LLM responses.

This prompt is meant to be inserted into the wider system prompt to provide consistent voice directions to the LLM. It won’t solve for the tone needs of each use case, like specific formatting or vocabulary. You may need to add sections that cater to the needs of your customers.

View the universal voice prompt

Writing prompts

AI is a content-first experience. Therefore, content experts can—and should!—be leading in this space. One place to put content expertise to use is prompt engineering. Prompting is more than just giving some instructions to an LLM—it’s strategic. By honing this skill, you can play a key role in ensuring that the content generated resonates with your audience.

There’s no one-size-fits-all approach, but here are some tips to get started:

Use clear examples

"Few-shot" prompting, where you provide examples of desired input-output pairs, is highly effective. This guides the model on the expected format, style, and content of its response.

Give the LLM context and background

Don't assume a model knows everything about your specific domain or request. Provide necessary background information or define specialized terms.

Examples

Context: The user is asking about tax implications for stock options. Assume they are a U.S. resident and the stock options are Non-Qualified Stock Options (NQSOs).

Help the LLM find keywords and key phrases

Using bold formatting or enclose important terms in specific characters (such as ``) to draw the model's attention to critical information.

Tell the LLM what not to do

Sometimes, telling the model what you *don't* want is as important as telling it what you do. This helps prevent undesirable outputs.

Examples

Explain the benefits of QuickBooks Online for small businesses. Do NOT include any pricing information or promotional offers.

Prompt engineering is an iterative process. If the initial output isn't ideal, refine your input based on the model's response. Experiment with different formatting techniques and levels of detail until you get the outcome you're looking for.

Scale yourself and the system

Just like writing prompts, giving an LLM quality context is key to getting quality outputs. Using the tool Notebook LM, you can add the content design site (along with any project or team-specific resources) as a reference to ground the model’s knowledge. When you create a content design system notebook, it'll give context-specific outputs based only on what’s been uploaded, rather than the whole internet.

Use it to:

Audit existing product flows or riff on new ones
Synthesize many sources and provide analysis based off them
Identify gaps and provide feedback based on the content system
Give you a head start on any strategy and drafts

It’s not good for:

Brainstorming and creative prompts
UI analysis—it can’t "see" screenshots

To add this site to a NotebookLM notebook, follow these step-by-step instructions. Remember to always run your final strings through Writer for a final polish and terminology check.

Prompting resources

For more introductory info on writing prompts, check out these resources:

Prompt crafting basics (Writer)
Advanced prompting techniques (Writer)
GenAI-boosted D4D prompting (Degreed)
CARE: Structure for Crafting AI Prompts (Nielsen Norman Group)
GenOS Secure Prompt Writing Handbook
Using the content site in NotebookLM

Evaluation

After a prompt is given to an LLM and the model begins generating content, humans step in again to help evaluate the model’s output. This is another moment where content practitioners add significant value: They know what high-quality content looks like and can identify where responses succeed or fall short.

Evaluation isn't just about judging quality—it’s about improving future outputs. Evaluation rubrics provide a consistent way of measuring content quality, and create a shared language for what “good” looks like across teams.

A developer or data science partner can provide generated responses in bulk. These responses (usually in a spreadsheet) can be graded on their adherence to our content quality standards.

The results of this evaluation can then be used to:

Identify patterns in LLM response language
Refine prompts and system instructions to address those patterns
Provide concrete feedback to AI partners so improvements can be tested and re-evaluated

Evaluation is an iterative loop, not a final step. By reviewing outputs and feeding insights back into prompts teams can improve the quality, consistency, and usefulness of AI generated content over time.

View the voice evaluation rubric

Just like the prompt, this rubric is a work in progress. Share your feedback in #ai-content-design.

Evaluation resources

AI evaluation for UX content designers (UX Content Collective)
Demystifying evals for AI agents (Anthropic)