Generative Artificial Intelligence (AI)

Big Picture

Generative artificial intelligence is a type of AI that creates new content, such as text, code, images, audio, video, or data, by learning patterns from existing examples. Generative AI is important because it changes the role of computers from systems that only follow explicit instructions to systems that can produce new, useful, and sometimes surprising outputs. Instead of only retrieving information or performing calculations, generative AI systems can write explanations, generate images, produce code, summarize documents, answer questions, create music, and simulate conversation.

The key idea is this: generative AI does not “understand” in the same way a human understands. It uses statistical and mathematical models to identify patterns in training data and then generates new outputs that are likely to match those patterns. NIST defines generative AI as a class of AI models that emulate the structure and characteristics of input data to generate synthetic content.

What Is Generative AI?

Generative AI is artificial intelligence that produces new content rather than simply classifying, sorting, searching, or predicting from existing data.

Examples of generative AI outputs include:

text, such as essays, summaries, explanations, emails, and stories
programming code, such as Python functions, SQL queries, or HTML pages
images, such as illustrations, diagrams, or photorealistic scenes
audio, such as speech, sound effects, or music
video, such as generated clips or edited visual sequences
structured data, such as tables, synthetic datasets, or test data

A traditional program usually follows explicit instructions written by a programmer. A generative AI model, by contrast, is trained on large amounts of data and learns patterns from that data. It then uses those learned patterns to generate new output.

Generative AI Compared With Other AI Systems

Type of AI system	Main purpose	Example
Classification system	Assigns data to a category	Determining whether an email is spam or not spam
Regression system	Predicts a numerical value	Predicting house prices from location, size, and age
Recommendation system	Suggests items based on patterns	Suggesting videos, products, or songs
Generative AI system	Creates new content	Writing a paragraph, generating code, or creating an image

How Generative AI Works

Most modern generative AI systems are based on machine learning. A model is trained using large amounts of data. During training, the model adjusts internal numerical values called parameters. These parameters help the model represent patterns in the training data.

A simplified process looks like this:

Collect data: The system is given a large training dataset, such as text, images, code, or audio.
Preprocess data: The data is cleaned, filtered, formatted, and converted into a structure the model can use.
Train the model: The model learns statistical patterns from the data.
Evaluate the model: The model is tested to see whether its outputs are accurate, useful, safe, and appropriate.
Generate output: A user provides a prompt, and the model produces new content based on learned patterns.

For text-based systems, such as large language models, text is often broken into smaller units called tokens. The model predicts likely next tokens based on the prompt and its training. Google’s machine learning materials describe LLMs as systems that use concepts such as tokens, parameters, context, and self-attention.

Large Language Models (LLMs)

A large language model, or LLM, is a generative AI model designed to process and generate language. LLMs are commonly used for chatbots, translation, summarization, question answering, tutoring, code generation, and writing assistance.

Many LLMs are based on a neural network architecture called a transformer. Transformers are effective at processing sequences of data, such as words or tokens in a sentence. IBM describes LLMs as deep learning models trained on large amounts of data and commonly built using transformer-based architectures.

An LLM does not simply copy a sentence from memory each time it answers. Instead, it generates output by using learned statistical relationships among words, phrases, concepts, and structures. This is powerful, but it also creates risks: the model may produce fluent text that sounds correct but is actually false.

Foundation Models

Many generative AI systems are built from foundation models. A foundation model is a large model trained on broad data that can be adapted to many different tasks. IBM describes foundation models as AI models trained on very large datasets that can perform a wide range of general tasks and serve as building blocks for more specialized applications. :contentReference[oaicite:4]{index=4}

For example, a foundation model might be adapted for:

customer support chatbots
medical document summarization
programming assistance
legal document analysis
language translation
image generation

This flexibility is one reason generative AI has become so important. A single underlying model can support many different applications.

Important Generative AI Model Types

Model type	Common use	Simple explanation
Large Language Model (LLM)	Text and code generation	Predicts and generates sequences of tokens.
Diffusion model	Image generation	Starts with noise and gradually transforms it into an image.
Generative Adversarial Network (GAN)	Image and synthetic media generation	Uses two networks: one generates content and the other evaluates it.
Variational Autoencoder (VAE)	Data generation and compression	Learns a compressed representation of data and generates new examples from it.
Multimodal model	Text, image, audio, and video tasks	Processes more than one type of input or output.

A Simple Example

Suppose a student asks a generative AI system:

Explain binary search to a 10th grade computer science student.

The model processes the prompt, identifies the task, and generates an explanation based on patterns it has learned from examples of computer science explanations, tutorials, textbooks, and other language data.

The generated answer might be useful, but the student should still check it. The model might make a mistake, skip an important condition, or explain the algorithm without mentioning that binary search requires sorted data.

Why Generative AI Is Powerful

Generative AI is powerful because it can reduce the time needed to create drafts, prototypes, examples, and explanations. It can help users move from a blank page to a first version quickly.

Useful applications include:

drafting and revising text
generating programming examples
explaining complex topics at different levels
summarizing long documents
creating practice questions
generating synthetic training data
brainstorming design ideas
supporting accessibility through translation and simplification

In computer science education, generative AI can be helpful when used as a tutor, debugging partner, or explanation generator. However, it becomes harmful when students use it to replace their own thinking.

Limitations of Generative AI

Generative AI systems have significant limitations. Students must understand these limitations before trusting generated output.

Hallucination: The model may generate false or unsupported information.
Bias: The model may reproduce biases found in training data.
Lack of true understanding: The model may produce correct-looking output without real comprehension.
Outdated information: The model may not know recent facts unless connected to current sources.
Prompt sensitivity: Small changes in wording may produce different answers.
Privacy risk: Users may accidentally enter personal, confidential, or sensitive information.
Academic integrity risk: Students may submit AI-generated work as if it were their own.

Ethical Issues

Generative AI raises ethical questions because it can produce content at scale and because its outputs may affect real people. NIST’s Generative AI Profile identifies risks such as fraudulent content, impersonation, disinformation, and the need for trustworthy characteristics such as accountability, transparency, safety, reliability, interpretability, and explainability.

Important ethical issues include:

Bias and fairness: A model may treat people or groups unfairly because of biased training data.
Privacy: Training data or user prompts may include sensitive information.
Consent: People may not have agreed for their work or personal data to be used in training.
Transparency: Users may not know when content was generated by AI.
Accountability: It may be unclear who is responsible when AI-generated output causes harm.
Environmental impact: Training and running large models can require significant energy and computing resources.
Misinformation: AI-generated text, images, audio, or video can be used to deceive people.
Academic integrity: Students must distinguish between using AI for support and submitting AI-generated work as their own.

Generative AI and Computational Thinking

Generative AI can be understood using computational thinking:

Computational thinking concept	Connection to generative AI
Decomposition	Breaking a generative AI system into data collection, preprocessing, training, evaluation, deployment, and monitoring.
Pattern recognition	The model learns patterns in language, images, code, or other data.
Abstraction	The model represents complex data using internal numerical structures rather than storing every example exactly as seen.
Algorithmic thinking	Training and inference follow defined computational processes.
Testing and evaluation	Outputs must be checked for correctness, bias, safety, usefulness, and reliability.

Generative AI and Programming

Generative AI can assist programmers by generating code, explaining errors, writing comments, suggesting tests, and proposing alternative algorithms.

For example, a student might ask:

Write a Python function that performs a linear search on a list.

The model may generate useful code:

def linear_search(items, target):
    for index in range(len(items)):
        if items[index] == target:
            return index
    return -1

However, the student still needs to understand the code. They should be able to trace it, explain it, test it, and modify it. AI-generated code can contain bugs, security flaws, inefficient algorithms, or syntax that does not match the requirements of the task.

Good and Poor Uses of Generative AI by Students

Good use	Poor use
Asking for an explanation of a concept after trying to understand it first	Copying an answer without reading or checking it
Using AI to generate practice questions	Using AI to complete homework dishonestly
Asking AI to explain an error message	Submitting AI-generated code without understanding it
Comparing an AI explanation with class notes or documentation	Assuming the AI is correct because it sounds confident
Using AI to improve clarity in writing	Allowing AI to replace original thought

Common Misconceptions

Misconception 1: Generative AI is always correct.

This is false. Generative AI can produce incorrect information in a confident style. Outputs must be verified.

Misconception 2: Generative AI thinks like a human.

This is misleading. Generative AI models process data mathematically. They generate outputs based on learned patterns, not human consciousness or lived experience.

Misconception 3: Generative AI only generates text.

This is false. Generative AI can generate text, images, code, audio, video, and structured data.

Misconception 4: If AI wrote the code, the student does not need to understand it.

This is false. In computer science, students must be able to explain, trace, test, evaluate, and modify their code.

IB Command Term Focus

Generative AI is a good topic for several IB command terms.

Command term	What the student must do	Example question
Define	Give the precise meaning of a term.	Define generative artificial intelligence.
Describe	Give a detailed account.	Describe how a generative AI model produces text output.
Explain	Give reasons or causes.	Explain why biased training data can lead to biased AI-generated output.
Discuss	Give a balanced review with different arguments and evidence.	Discuss the ethical implications of using generative AI in schools.
Evaluate	Weigh strengths and limitations to reach a judgement.	Evaluate the use of generative AI as a programming assistant for novice students.

Good command term understanding

For evaluate, a strong answer weighs both benefits and limitations, then reaches a justified judgement. For example, a student might argue that generative AI is useful for debugging and explanation, but dangerous if it prevents students from developing independent problem-solving skills.

Weak command term understanding

A weak evaluate answer only lists advantages. For example, “Generative AI is good because it helps students write code faster” is not enough. It does not weigh limitations or reach a supported judgement.

Exam-Style Questions

Define generative artificial intelligence. [2]
Describe two types of content that can be produced by generative AI systems. [4]
Explain why generative AI systems may produce inaccurate information. [4]
Discuss the ethical implications of using generative AI to create school assignments. [6]
Evaluate the use of generative AI as a programming support tool for beginner programmers. [8]

Key Terms

Artificial intelligence: A field of computer science concerned with systems that perform tasks normally associated with intelligent behaviour.
Generative AI: AI that creates new content based on patterns learned from training data.
Machine learning: A method of developing AI systems where models learn patterns from data rather than being explicitly programmed for every case.
Large language model (LLM): A machine learning model designed to process and generate human language.
Foundation model: A large model trained on broad data that can be adapted to many different tasks.
Prompt: The input given to a generative AI system by a user.
Token: A unit of text processed by a language model. A token may be a word, part of a word, punctuation, or another text unit.
Training data: The data used to teach a machine learning model patterns.
Parameter: An internal numerical value adjusted during training that helps the model represent patterns.
Hallucination: An incorrect or unsupported output generated by an AI system that may appear convincing.
Bias: A systematic unfairness or distortion in data, model behaviour, or output.

Summary

Generative AI is a major development in computer science because it allows computers to create new content from learned patterns. It is closely connected to machine learning, neural networks, data preprocessing, hardware acceleration, model evaluation, and ethics. These systems can be extremely useful, but they must be used critically. A responsible computer scientist does not simply accept generated output. They verify it, test it, question it, and consider its social and ethical consequences.