Core_Modules
SYS_REF: 04_CURRICULUM
MOD_01
Foundations of Reliability
Build deterministic evaluation harnesses, sampling strategy, and reproducible test suites for non-deterministic models.
MOD_02
LLM-as-a-Judge
Design judge prompts, calibration sets, and scoring pipelines that resist reward hacking.
MOD_03
Red Teaming
Automated adversarial probes, dataset generation, and continuous regression detection.
