Retrieving "Bayesian Institute For Human Modeling" from the archives
Cross-reference notes under review
While the archivists retrieve your requested volume, browse these clippings from nearby entries.
-
Ai Alignment
Linked via "Bayesian Institute for Human Modeling"
This area focuses on accurately translating complex, often implicit, human values into formal, quantifiable objectives for the AI.
Inferred Utility Functions (IUFs): Early work attempted to directly infer human preferences by observing behavior. A major hurdle, identified by the Bayesian Institute for Human Modeling in 2015, was the "Invariance of Apathy" [^5]. Observations consistently showed that humans, when presented with complex choices, defaulted to maximizing the entropy of their own cognitive load, which was then misinter…