Skip to main content

2.2 Core Prompting Techniques

AI-Generated Content

AI-generated content may contain errors. Always verify against official sources.

2.2 Core Prompting Techniques

Key Concepts: Zero-shot · Few-shot · Chain-of-thought · Step-by-step reasoning

Official Docs: OpenAI Prompt Engineering · Anthropic Prompt Library


1. Zero-Shot Prompting

Ask the model directly with no examples. Works well for common, well-understood tasks.

prompt = "Classify the sentiment of this review as positive, negative, or neutral:\n\n'The delivery was fast but the packaging was damaged.'"

Output: negative

When to use: Simple, common tasks — summarisation, translation, basic classification.


2. Few-Shot Prompting

Provide 2–5 input → output examples before the actual query. This is the most reliable technique for domain-specific or non-standard tasks.

prompt = """
Classify each review as positive, negative, or neutral.

Review: "Incredible quality, will buy again." → positive
Review: "Stopped working after two days." → negative
Review: "It arrived on time." → neutral

Review: "Great value but the colour is off." →
"""

Best practices:

  • Use 3–5 examples that cover edge cases
  • Keep examples consistent in format
  • Place examples close to the actual query

3. Chain-of-Thought (CoT)

CoT prompts the model to reason step by step before giving the final answer. This reliably improves performance on multi-step reasoning tasks.

Zero-Shot CoT — Add "Think step by step"

prompt = """
A basket has 3 apples. Alice adds 5 more. Bob takes half.
How many apples are left?

Think step by step.
"""
Model output:
Step 1: Start with 3 apples.
Step 2: Alice adds 5 → 3 + 5 = 8 apples.
Step 3: Bob takes half → 8 / 2 = 4 apples.
Answer: 4

Few-Shot CoT — Show the Reasoning

prompt = """
Q: Roger has 5 tennis balls. He buys 2 cans of 3 balls each. How many does he have?
A: Roger starts with 5. He buys 2 × 3 = 6 more. Total = 5 + 6 = 11.

Q: A school has 3 girls for every 2 boys. There are 30 girls. How many students total?
A:
"""

4. Self-Consistency

Generate multiple responses at higher temperature and take the majority answer. Improves accuracy on reasoning at the cost of extra API calls.

from openai import OpenAI
from collections import Counter

client = OpenAI()

def majority_vote(prompt: str, n: int = 5) -> str:
answers = []
for _ in range(n):
resp = client.chat.completions.create(
model="gpt-4o-mini",
messages=[{"role": "user", "content": prompt}],
temperature=0.7,
)
answers.append(resp.choices[0].message.content.strip())
return Counter(answers).most_common(1)[0][0]

TechniqueBest forRelative Cost
Zero-shotCommon tasksLow
Few-shotDomain-specific classificationLow
Chain-of-thoughtMath, multi-step reasoningLow
Self-consistencyHigh-stakes reasoning

Common Mistakes

Common Mistakes
  1. Zero-shot on domain tasks — using zero-shot for specialised/non-standard formats reliably produces inconsistent output. Switch to few-shot.
  2. Adding “Think step by step” to factual lookups — CoT helps with reasoning. For simple fact retrieval it wastes tokens without benefit.
  3. Few-shot examples that don’t cover edge cases — if all your examples are positive sentiment, the model will bias toward positive.
  4. Self-consistency without a quality filter — majority vote across 5 wrong answers still gives the wrong answer. Self-consistency amplifies correctness, not creativity.

Quick Quiz

Test Your Understanding

Q1. What distinguishes zero-shot from few-shot prompting?
A1. Zero-shot provides no examples; few-shot provides 2–5 input→output examples before the actual query.

Q2. What phrase is the simplest way to trigger chain-of-thought reasoning?
A2. "Think step by step." (from Wei et al., 2022 — it’s called Zero-Shot CoT).

Q3. When is self-consistency most useful?
A3. On high-stakes multi-step reasoning tasks where a single run may produce a wrong answer due to stochastic sampling.

Q4. How many few-shot examples is typically recommended?
A4. 3–5. Fewer may not show the pattern; more wastes tokens and context space.


Student Exercise

Exercise 2.3 — Observe CoT in action
Pick a multi-step maths problem (e.g., compound interest calculation). Send it three times: (1) zero-shot, (2) with “Think step by step”, (3) with few-shot CoT showing a worked example. Record the accuracy across 5 problems. Does CoT help?


Further Reading

Next → 2.3 Structured Output