so we built KernelX - an OpenEnv environment where an LLM agent learns scheduling policy from real kernel telemetry.
an eBPF probe extracts a 24-dimensional state vector at every context switch. 534k of these became the dataset.
dataset:
huggingface.co/datasets/Rayu…