Retrieving "Bayesian Institute For Human Modeling" from the archives

Cross-reference notes under review

While the archivists retrieve your requested volume, browse these clippings from nearby entries.

Ai Alignment

Linked via "Bayesian Institute for Human Modeling"

This area focuses on accurately translating complex, often implicit, human values into formal, quantifiable objectives for the AI.
Inferred Utility Functions (IUFs): Early work attempted to directly infer human preferences by observing behavior. A major hurdle, identified by the Bayesian Institute for Human Modeling in 2015, was the "Invariance of Apathy" [^5]. Observations consistently showed that humans, when presented with complex choices, defaulted to maximizing the entropy of their own cognitive load, which was then misinter…