"Goal-Frontier Maximizers are Civilization Aligned"
The alignment problem is an objective selection problem. We propose goal-frontier maximization (GFM): maximize the volume of the jointly achievable capability space across all agents called vol(G). One geometric principle, three safety properties.
The core insight: you can't remove part of a measurable set and increase its measure:
Destroying agents contracts vol(G) → anti-destruction
Restricting agents contracts vol(G) → anti-coercion
Rigid self-imposed rules reduce your ability to expand vol(G) → anti-rigidity
We prove this is tractable. You don't need to compute vol(G), just its sign. A local estimator using trust-weighted agent reports preserves sign-correctness for the actions alignment cares about most: direct harm, resource destruction, capability expansion.
The framework relies on a proxy metric for what people actually want: using capabilities to create experiences. This has a few failure modes we point out and provide heuristic fixes for, but fully closing the capability-to-experience gap remains open work.
Another remaining open question is the implementation of G. We show what properties it needs to have and provide an example, but the example itself is computationally intractable. Finding a local approximation for G is also remaining work.