Pipelining, scheduling and multi-core scheduling

The Big Idea

Pipelining and multi-core scheduling are two different — but tightly coordinated — systems that determine how instructions move through a CPU and how threads are assigned to those CPUs.
Pipelining is controlled by hardware within each core, while thread scheduling is controlled by the operating system (OS) kernel. Together, they determine how efficiently the processor executes many instructions and programs at once.

1. Who Controls Pipelining?

Pipelining is a hardware-level control process managed entirely by the CPU’s internal circuitry, specifically the control unit (CU) and supporting microarchitecture.

Each stage of the pipeline (fetch, decode, execute, memory access, write-back) is synchronized by the CPU clock and managed by control logic that ensures:

The right instruction enters each stage at the right time.
Data dependencies are resolved.
Hazards (situations that could stall the pipeline) are detected and handled.

The CPU uses pipeline control logic and instruction scheduling mechanisms such as:

Branch prediction: to guess future control flow and prevent stalls.
Out-of-order execution: to keep pipeline stages full even when some instructions must wait.
Register renaming: to avoid false data dependencies.

Thus, the hardware itself determines how instructions flow through the pipeline. The operating system has no direct control over this process — it only decides which threads are running on which cores.

2. Who Controls Thread Allocation?

Thread allocation happens at the operating system level, managed by the scheduler.
The scheduler’s job is to decide:

Which process or thread runs next.
On which core it should run.
For how long (its time slice).

Key Scheduler Responsibilities:

Task	Description
Load balancing	Ensures all cores are being used effectively (no single core overloaded).
Affinity management	Keeps threads running on the same core when possible to take advantage of cache locality.
Context switching	Saves and restores CPU register states when switching between threads.
Priority and fairness	Uses algorithms (like round robin or multilevel queue) to allocate CPU time fairly or according to priority.

Example:

If you’re running a web browser, video editor, and music player:

Each of these is a process with several threads (e.g., one for UI, one for background tasks).
The OS scheduler assigns those threads across the CPU’s cores.
Once assigned, each core’s pipeline hardware executes instructions from that thread.

3. Interaction Between Pipelining and Scheduling

Once a thread is scheduled onto a core:

Its instructions are fetched from memory into that core’s pipeline.
The pipeline control unit manages instruction flow, hazards, and execution order.
The results are written back to registers or memory.

If the scheduler later moves that thread to another core (for load balancing), the pipeline on the new core will fetch and decode the instructions again — possibly resulting in a cache miss if data isn’t already in the new core’s cache.

Thus, the OS and the hardware are cooperative layers:

Hardware (pipelining) controls how an instruction executes.
Software (scheduler) controls which instructions are executed where and when.

4. Example Walkthrough: Multi-Core Pipelining and Scheduling

Let’s say your system has 4 cores, each with a 5-stage pipeline.

Scenario:

Thread A: running physics simulation.
Thread B: decoding video.
Thread C: managing user input.
Thread D: idle background task.

Execution:

OS Scheduler assigns:
- Core 0 → Thread A
- Core 1 → Thread B
- Core 2 → Thread C
- Core 3 → Thread D
Each Core’s Pipeline Control Logic independently manages:
- Fetch, decode, execute, memory access, and write-back for its assigned thread’s instructions.
- Internal hazards (like waiting for memory data).
If Thread A finishes early, the scheduler may assign a new thread (Thread E) to Core 0.
The pipeline in Core 0 will flush the old instructions and begin fetching instructions for Thread E.

This combination — hardware-level pipelining and software-level scheduling — enables parallel execution both within and across cores.

5. Common Misconception

“The scheduler controls pipelining.”

Not exactly.
The scheduler decides which thread runs on a core, but how that thread’s instructions move through the pipeline is fully managed by CPU hardware.

Think of it like this:

The scheduler assigns cars (threads) to different highways (cores).
Pipelining determines how efficiently each highway moves traffic through its lanes (stages).

6. Summary Table

Concept	Controlled By	Scope	Purpose
Pipelining	CPU hardware (control unit, pipeline logic)	Within a single core	Increases instruction throughput
Thread Scheduling	Operating system kernel	Across all cores	Balances CPU load and resource use
Write-back stage	CPU hardware	Within each instruction	Commits results to registers or memory
Multi-core coordination	OS scheduler + shared cache control	Across cores	Manages thread distribution and data sharing