AI Alignment

Last updated Dec 8, 2022 Edit Source

How do we get AI systems to align with real human social contracts and values? Or, in more mathematical terms, how do we make legible our soft and squishy human values into hard mathematical formula and policy?

Mostly sourced from OpenAI’s approach to alignment