Understanding Generative AI Methods

Welcome to the Course 🚀

Welcome to Introduction to Generative AI for Marketing . If your work involves writing copy, planning campaigns, researching audiences, or creating visuals, generative AI is already becoming part of your marketing workflow. In this first unit, you’ll build a plain-language mental model for what generative AI does, how it works, and where it can go wrong. You’ll cover: The five major AI capabilities: text generation, image generation, image description, web search, and automation The three main model families: large language models, diffusion-style image models, and multimodal models Key limitations like context windows and hallucinations, plus why human verification matters

Five Capabilities, Not One Tool ⚒️

When people say "AI," they may be talking about five different capabilities, and mixing them up can lead you to use the wrong tool for the campaign. Capability What it does Useful for in marketing Text generation Drafts written content Captions, emails, ad copy, blog outlines Image generation Creates pictures from a description Ad creative, hero images, social visuals Image description / vision AI Reads a picture and describes what’s in it Alt text, competitor-creative analysis, UGC review Web search Retrieves current information from the internet Trend checks, current pricing, competitor news Automation Chains AI capabilities together with your other tools Multi-step campaign workflows, repurposing content The practical move: before you open a tool, name which capability you actually need. "I need a caption" is text generation. "I need a hero image" is image generation. "What's in this competitor's ad?" is image description. Asking the wrong tool the right question is a top reason people decide AI "doesn't work."

Capability	What it does	Useful for in marketing
Text generation	Drafts written content	Captions, emails, ad copy, blog outlines
Image generation	Creates pictures from a description	Ad creative, hero images, social visuals
Image description / vision AI	Reads a picture and describes what’s in it	Alt text, competitor-creative analysis, UGC review
Web search	Retrieves current information from the internet	Trend checks, current pricing, competitor news
Automation	Chains AI capabilities together with your other tools	Multi-step campaign workflows, repurposing content

The Three Model Families Behind AI Tools 🧑‍🧑‍🧒

Three model families sit underneath those capabilities, and a rough mental picture of each will save you a lot of guesswork. Large language models (LLMs) power text generation. Picture an extremely well-read autocomplete: given everything you've typed so far, the model predicts the most likely next word, then the next, then the next, all the way to the end of the response. It learned these patterns from massive amounts of human-written text. It doesn't "understand" your brand the way you do; it pattern-matches at a scale that feels like understanding. That's what's drafting your captions and emails. Diffusion-style image models power image generation. Think of them as starting with a screen of static (random noise) and gradually "uncrumpling" it into a coherent picture that matches your description, one denoising pass at a time. They learned what a "warm, candid lifestyle photo" looks like by training on millions of captioned images. That's what's producing your ad creative and hero images. Multimodal models can take in more than one type of input, like text plus an image, and reason across them. That's what lets a single tool answer "what's in this competitor's ad?" or "turn this campaign brief and screenshot into a summary." Capabilities like image description and most modern chat tools live here.

What the Model Doesn't Know About Your Brand 🎯

Here's the part that trips up marketers specifically. A general-purpose model has read a huge slice of the public internet, but it has never seen your brand guidelines, your real campaign numbers, or your legally approved claims. So it fills those gaps with the average of everything it has seen — and "average" is exactly what a brand is trying not to sound like. There are three things you almost always have to supply, then verify: Your brand voice. Left alone, the model writes in a generic, exclamation-heavy "marketing voice." If your brand is dry, understated, or playfully irreverent, every draft will be subtly off until you give it your voice (tone, words you love, words you ban) and check the output against it. Your real numbers. Asked for a stat, the model will invent a plausible one ("47% lift," "trusted by 10,000 teams") rather than admit it doesn't know. Any figure that appears in a public asset has to come from you and trace back to a real source. What you can legally claim. The model doesn't know your substantiation file or the rules for your category. It will cheerfully write "clinically proven" or "#1 in the market" with nothing behind it. Claims are your responsibility, never the model's. Keep these three gaps in mind — they're the reason the next section matters so much for marketing work.

Prediction, Context Windows, and Hallucinations ⚠️