Python & Data Science

Deterministic ML Experiments: Seeding More Than Just NumPy

#experiments, #ml, #python, #reproducibility

Deterministic ML Experiments: Seeding More Than Just NumPy

Setting one random seed is not enough for reproducibility. Different libraries and execution backends each have their own randomness controls.

Step 1: seed Python, NumPy, and framework runtime together

import os, random, numpy as np

SEED = 20260311
os.environ["PYTHONHASHSEED"] = str(SEED)
random.seed(SEED)
np.random.seed(SEED)

Step 2: configure deterministic backend options

import torch

torch.manual_seed(SEED)
torch.use_deterministic_algorithms(True)
torch.backends.cudnn.benchmark = False

Step 3: log seed and environment fingerprint with metrics

meta = {
    "seed": SEED,
    "python": platform.python_version(),
    "torch": torch.__version__,
}

Pitfall

Comparing runs with different library versions while assuming seed parity guarantees identical results.

Verification

Two runs on same environment produce matching metrics within tolerance.
Experiment metadata includes seed and package versions.
Nondeterministic kernels are disabled where required.

Related Post

Python & Data Science

Timezone Bugs in Data Pipelines: Normalize at Ingest or Suffer Later

Python & Data Science

Feature Store Versioning for Reliable ML Training Runs

Python & Data Science

Python CSV Pipelines Need Schema Guards, Not Hope

You missed

iOS & Apple Development

How to Build an API-First App Release Workflow That Stays Reliable

General Software Engineering

How to plan failure analysis in General Software Engineering

Designing systemd workers That Actually Holds Up in DevOps & Cloud

Why Retrying terminal failures forever Breaks Backend & APIs Projects