The Value Alignment Problem’s Problem

1 Comment

Tom Dietterich on 11 January 2017 at 16.50 EST

I prefer to frame this problem in terms of the following: Would the user (or the community, or the society) judge that a particular action in a specific context would be desirable (or acceptable)? Formulating this in terms of a value or utility that is explicitly maximized is just one way of approaching the problem. But an alternative approach might employ a large knowledge base of previous cases of desirable and undesirable actions and ask which of these previous cases the proposed action+context best matches. Such an approach might be better able to capture the highly contextual aspects of moral decision making.

Even when we are only considering a single person’s utility, that utility function would need to capture the impact on the community and the society. Otherwise, the actions chosen would be too greedy and would not be judged EVEN BY THAT PERSON to have been acceptable or desirable. In other words, we are not playing the economists’ game of positing individual utilities such as income or pleasure. Instead, we are trying to model the complex preferences of real people, which (except in the case of extreme narcissism) will also include the well-being of others.

Tom Dietterich on 11 January 2017 at 16.50 EST

I prefer to frame this problem in terms of the following: Would the user (or the community, or the society) judge that a particular action in a specific context would be desirable (or acceptable)? Formulating this in terms of a value or utility that is explicitly maximized is just one way of approaching the problem. But an alternative approach might employ a large knowledge base of previous cases of desirable and undesirable actions and ask which of these previous cases the proposed action+context best matches. Such an approach might be better able to capture the highly contextual aspects of moral decision making.

Even when we are only considering a single person’s utility, that utility function would need to capture the impact on the community and the society. Otherwise, the actions chosen would be too greedy and would not be judged EVEN BY THAT PERSON to have been acceptable or desirable. In other words, we are not playing the economists’ game of positing individual utilities such as income or pleasure. Instead, we are trying to model the complex preferences of real people, which (except in the case of extreme narcissism) will also include the well-being of others.

The Duck of Minerva

The Value Alignment Problem’s Problem

Heather Roff

Heather Roff

1 Comment

The Value Alignment Problem’s Problem

Heather Roff

share this post

Heather Roff

1 Comment