Welcome to Introduction to Generative AI for Marketing. If your work involves writing copy, planning campaigns, researching audiences, or creating visuals, generative AI is already becoming part of your marketing workflow. In this first unit, you’ll build a plain-language mental model for what generative AI does, how it works, and where it can go wrong.
You’ll cover:
- The five major AI capabilities: text generation, image generation, image description, web search, and automation
- The three main model families: large language models, diffusion-style image models, and multimodal models
- Key limitations like context windows and hallucinations, plus why human verification matters
When people say "AI," they may be talking about five different capabilities, and mixing them up can lead you to use the wrong tool for the campaign.
The practical move: before you open a tool, name which capability you actually need. "I need a caption" is text generation. "I need a hero image" is image generation. "What's in this competitor's ad?" is image description. Asking the wrong tool the right question is a top reason people decide AI "doesn't work."
Three model families sit underneath those capabilities, and a rough mental picture of each will save you a lot of guesswork.

Large language models (LLMs) power text generation. Picture an extremely well-read autocomplete: given everything you've typed so far, the model predicts the most likely next word, then the next, then the next, all the way to the end of the response. It learned these patterns from massive amounts of human-written text. It doesn't "understand" your brand the way you do; it pattern-matches at a scale that feels like understanding. That's what's drafting your captions and emails.
Diffusion-style image models power image generation. Think of them as starting with a screen of static (random noise) and gradually "uncrumpling" it into a coherent picture that matches your description, one denoising pass at a time. They learned what a "warm, candid lifestyle photo" looks like by training on millions of captioned images. That's what's producing your ad creative and hero images.
Multimodal models can take in more than one type of input, like text plus an image, and reason across them. That's what lets a single tool answer "what's in this competitor's ad?" or "turn this campaign brief and screenshot into a summary." Capabilities like image description and most modern chat tools live here.
Here's the part that trips up marketers specifically. A general-purpose model has read a huge slice of the public internet, but it has never seen your brand guidelines, your real campaign numbers, or your legally approved claims. So it fills those gaps with the average of everything it has seen — and "average" is exactly what a brand is trying not to sound like. There are three things you almost always have to supply, then verify:
- Your brand voice. Left alone, the model writes in a generic, exclamation-heavy "marketing voice." If your brand is dry, understated, or playfully irreverent, every draft will be subtly off until you give it your voice (tone, words you love, words you ban) and check the output against it.
- Your real numbers. Asked for a stat, the model will invent a plausible one ("47% lift," "trusted by 10,000 teams") rather than admit it doesn't know. Any figure that appears in a public asset has to come from you and trace back to a real source.
- What you can legally claim. The model doesn't know your substantiation file or the rules for your category. It will cheerfully write "clinically proven" or "#1 in the market" with nothing behind it. Claims are your responsibility, never the model's.
Keep these three gaps in mind — they're the reason the next section matters so much for marketing work.
Because LLMs predict the next likely word, they will happily produce a confident, well-formed sentence even when the underlying claim is false. That’s called a hallucination: an output that sounds factual but is unsupported, inaccurate, or invented.
A hallucination might be a fake statistic ("boosted conversions 47%"), a made-up citation, an award a brand never won, or a confident answer to something the model was never given enough information to know. The model is not “lying” on purpose; it is generating a plausible pattern. That’s why hallucinations are not a bug you can fully prompt your way out of; they’re a property of how the model works.
Another term worth noting is a context window: everything the model can "see" in your current conversation, including your prompt, any pasted brief, and its own prior responses. Past that window, it has no memory. And because the model is optimizing for plausible-sounding output, the most dangerous hallucinations are the polished ones: awards, stats, comparative claims, and five-star customer quotes that look exactly right in an ad, a landing page, or a post.
Here’s what that looks like in a normal marketing task. Jake used an AI tool to draft a short influencer bio from a few bullet points for a partnership pitch, but the draft included a specific award he didn’t recognize. He asks Nova, a teammate with more experience reviewing AI output, what to do with it.
- Jake: I asked it for an influencer bio and it gave me this clean line about a 2021 Creator of the Year award. Looked totally real.
- Nova: Did the bullet points you fed it mention any award?
- Jake: No, I just gave it the handle and a couple of facts.
- Nova: Then the model filled the gap with whatever sounded plausible. That's a hallucination, not a fact.
- Jake: So I need to verify anything it added that I didn't give it?
- Nova: Exactly. If you didn't put it in, treat it as a claim, not a quote.
Notice the move: Nova didn't argue with the output, she traced it back to what was in the prompt versus what the model invented.
In most jobs, a hallucination is an embarrassing mistake. In marketing, a polished one can become a published, legally exposed claim. A fabricated stat, a made-up "as seen in," an invented testimonial, or an unearned "#1" isn't just wrong — it can breach advertising-substantiation rules and chip away at the trust your brand is built on. That's why the verification bar is higher here than in general office work: anything externally facing gets checked against a real source before it ships.
