Value of information: Difference between revisions

Revision as of 08:24, 2 February 2012

This page is a knowledge crystal of subtype method. The page identifier is Op_en2480
Moderator:Jouni (see all)
Citation of this page: Jouni T. Tuomisto: Value of information. Opasnet 2010. [1]. Accessed 27 Nov 2024.
Upload data {{#opasnet_base_link:Op_en2480}}

Value of information (VOI) in decision analysis is the amount a decision maker would be willing to pay for information prior to making a decision.^[1]. Value of information is specific to a combination of a particular decision with several options, a particular objective (i.e., outcome of interest that can be quantitatively estimated), and a particular issue that is affected by the decision and is relevant for the objective. If all such issues are considered at the same time, we talk about expected value of perfect information.<section end=glossary />

Scope

How can value of information be calculated in an assessment in such a way that

it helps in understanding the impacts of uncertainties on conclusions and
it helps to direct further assessment efforts to improve guidance to decision making?

Definition

Input

To calculate value of information, you need

a decision to be made with at least two different options a decision maker can choose from,
an objective (i.e., outcome of interest or indicator) that can be quantitatively estimated and optimised,
an optimising function to be used as the criterion for the best decision,
an uncertain variable of interest (optional, needed only if partial VOI is calculated for the variable; if omitted, combined value of information is estimated for all uncertain variables in the assessment model).

Output

Value of information, i.e. the amount of money that the decision-maker is willing, in theory, to pay to obtain a piece of information. Value of information can also be measured in other units than money, e.g. disability-adjusted life years if health impacts only are considered.

Rationale

See Decision theory, Value of information, and Expected value of perfect information in Wikipedia.
There are different kinds of indicators under value of information, depending on what level of information is compared with the current situation:
EVPI

Expected value of perfect information (everything is known perfectly)

EVPPI

Expected value of partial perfect information (one variable is known perfectly, otherwise current knowledge)

EVII

Expected value of imperfect information (things are known better but not perfectly)

EVPII

Expected value of partial imperfect information (one variable is known better but not perfectly, otherwise current knowledge)

Standard VOI approach with counter-factual world descriptions

Counter-factual world descriptions mean that we are looking at two or more different world descriptions that are equal in all other respects except for a decision that we are assessing. In the counter-factual world descriptions, different decision options are chosen. By comparing these worlds, it is possible to learn about the impacts of the decision. With perfect information, we could make the theoretically best decision by always choosing the right option. If we think about these worlds as Monte Carlo simulations, we run our model several times to create descriptions about possible worlds. Each iteration (or row in our result table about our objective) is a possible world. For each possible world (i.e., row), we create one or more counter-factual worlds. They are additional columns which differ from the first column only by the decision option. With perfect information, we can go through our optimising table row by row, and for each row pick the decision option (i.e., the column) that is the best. The expected outcome of this procedure, subtracted by the outcome we would get by optimising the expectation (net benefit under uncertainty), is the expected value of perfect information (EVPI). ^[2] ^[3] ^[4] ^[5]

Screening approach with decisions as random variables

In this case, we do not create counter-factual world descriptions, but only a large number of possible world descriptions. The decision that we are considering is treated like any other uncertain variable in the description, with a probability distribution describing the uncertainty about what actually will be decided. In this case, we are comparing world descriptions that contain a particular decision option with other world descriptions that contain another decision option. It is important to understand that we are not comparing two counter-factual world descriptions, but we are comparing a group or possible world descriptions to another group of world descriptions.

The major benefit of the screening approach is that it is not necessary do define decision variables beforehand. Basically any variable can be taken to be a decision, as long as it is a meaningful as a decision and the model has a number of possible worlds simulated with Monte Carlo or another method such as Bayesian belief network (BBN). The idea is to conditionalise the decision variable to one decision option at a time and then compare these conditionalisations to find out which one of them gives the optimal outcome in the objective.

In this approach, it is not possible to calculate EVPI in such a straightforward way as with counter-factual world descriptions. Therefore, with this approach, we are pretty much restricted to calculating expected value of partial perfect (and imperfect) information, or EVPPI and EVPII, respectively. Some sophisticated mathematical methods may be developed to calculate this, but it is beyond my competence. One approach sounds promising to me at them moment. It is used with probabilistic inversion, i.e. using bunches of probability functions instead of point-wise estimates.^[6]

There is a major difference between the two approaches. Counter-factual world descriptions are actually utilising the Do operator described by Pearl ^[7], which looks at impacts of forced changes of a variable. In contrast, the latter case has the structure of an observational study, which looks at natural changes where several variables change at the same time. Therefore, it is subject to confounders, which are typical problems in epidemiology: a variable is associated with the effect, but not because it is its cause but because it correlates with the true cause.

Because of this confounding effect, the latter method for value-of-information analysis may result in false negatives: a decision seems to be obvious (i.e., the VOI is zero), but a more careful analysis of confounders would show that it is not. Therefore, a value-of-information analysis based on a Bayesian net should be repeated with an analysis of counter-factual world descriptions. In Uninet, counter-factual world descriptions can be created with analytical conditioning, but it does not work with functional nodes, and its applicability is therefore limited.

Result

Procedure

EVPI is calculated using the following equation:

EVPI = E(Max(U(d_i,θ))) - Max(E(U(d_i,θ))),

where E=expectation over uncertain parameters θ, Max=maximum over decision options i, U=utility of decision d (i.e., the value of outcome after a particular decision option i is chosen, measured in money, DALY, or another quantitative metric covering all relevant impacts).

The general formula for EVPII is:

EVPII = E_θ2(U(Max(E_θ2(U(d_i,θ2))),θ2)) - E_θ2(U(Max(E_θ1(U(d_i,θ1))),θ2)),

where θ1 is the prior information and θ2 is the posterior (improved) information. EVPPI can be calculated with the same formula in the case where P(θ2)=1 if and only if θ2=θ1. If θ includes all variables of the assessment, the formula gives total, not partial, value of information.

The interpretation of the formula is the following (starting from the innermost parenthesis). The utility of each decision option d_i is estimated in the world of uncertain variables θ. Expectation over θ is taken (i.e. the probability distribution is integrated over θ), and the best option i of d is selected. The point is that in the first part of the formula, θ is described with the better posterior information, while the latter part is based on the poorer prior information. Once the decision has been made, the expected utility is estimated again based on the better posterior information in both the first and second part of the formula. Finally, the difference between the utility after the better and poorer information, respectively, gives the value of information.

Management

An Analytica file contains functions for both the counterfactual standard approach and the screening approach: calculating value of information. Both software codes are described in detail below.

Standard approach with counter-factual worlds

<anacode> Parameters: (out:prob;deci:indextype;input:prob;input_ind:indextype;classes)

Definition: index a:= ['Total VOI']; index variable:= concat(a,input_ind); var ncuu:= min(mean(sample(out)),deci); var evpi:= (if a='Total VOI' then mean(min(sample(out),deci))-ncuu else 0);

for x[]:= classes do ( index varia:= sequence(1/x,1,1/x); var evppi:= ceil(rank(input,run)*x/samplesize)/x; evppi:= if evppi=Varia then out else 0; evppi:= sum(min(mean(evppi),deci),varia)-ncuu; concat(evpi,evppi,a,input_ind,variable) ) </anacode>

This function calculates the total VOI (expected value of perfect information, EVPI) for a given decision, and VOI (expected value of partial perfect information, EVPPI) for certain variables. The outcome to be optimised is out; the decision to be made must be indexed by deci; the variables for EVPPI calculation must be listed in input, which is indexed by input_ind. The solution is numerical, and for this purpose, the outcome is classified into a number of bins (the number is defined by classes, which may be a number or an array of numbers). The VOI function assumes that costs are calculated and that the correct optimising function is MIN.

Procedure

First, a new index is generated. It contains 'Total VOI' in the first row and the EVPPI variables in the subsequent rows. Then, net cost under uncertainty (ncuu) and evpi are calculated.

The rest of the procedure is calculated separetely for each value of classes. Varia is a temporary index that has classes number of bins. Each iteration is located in one of the bins depending on the value of input. After this classification, the value of out is located into the bin for that iteration. When the mean is taken, the result is the average of outcome multiplied by the probability that the true value of input is in the same bin. The best decision is made given the bin, and then the expected outcomes of each bin are summed up. When ncuu is subtracted from this value, we get EVPPI. Finally, the EVPI and EVPPI are concatenated into a single index.

It may be a good idea to include a row 'Blank' in the input_ind and use it for a random variable that is NOT part of the model. This gives a rough estimate on how much random noise may produce VOI in the system. It might also be good to use different values for Classes, because there may be numerical instability with low iteration numbers, and it is not obvious what low is in each case.

Developed by Jouni Tuomisto and Marko Tainio, National Public Health Institute (KTL), Finland, 2005. (c) CC-BY-SA.

Screening approach: decisions expressed as uncertain variables

<anacode> Parameters: (b, d, c:prob; k:indextype; e:atom optional=0.5; x: atom optional=20)

Definition: d:= (d<=getfract(d,e)); index j:= ['Below cutpoint','Above cutpoint']; d:= array(j,[d, 1-d]); index m:= concat(['Total VOI','Blank'],k); var ncuu:= min(sum(b*d,run)/sum(d,run),j); var a:= c[@k=@m-2]; a:= if @m>2 then a else array(m,[0/*b*/,sample(uniform(0,1))]); index L:= 1..x; a:= ceil(rank(a,run)*x/samplesize); d:= if a=L then d else 0; a:= sum(b*d,run)/sum(d,run); d:= if sum(isnan(a),j)>0 then 0 else d; a:= if sum(isnan(a),j)>0 then null else a; a:= a[j=argmin(a,j)]; d:= sum(sum(d,run),j); d:= d/sum(d,L); a:= sum(a*d,L)-ncuu </anacode>

This function calculates VOI (expected value of partial perfect information, EVPPI) for certain variables for a given decision. This version is used in a situation where the decision is NOT a decision index, but is a criterion that is used to categorise iterations of a variable into two or more decision option categories.

b: outcome objective to be minimised

c: list of uncertain variables whose VOI is estimated

d: decision node to be categorised based on the criterion

e: cutpoint fractile for the criterion

j: the index of the decision categories

k: the index of the list c

L: the index of uncertainty categories

Impact of a strong correlation between the decision and a variable

There is a problem with the approach using the decision as a random variable. The problem occurs with variables that are strongly correlated with the decision variable. The iterations are categorised into "VOI bins" based on the variable to be studied. In addition, iterations are categorised into "decision bins" based on the value of the decision variable. The idea is to study one VOi bin at a time and find the best decision bin within that VOI bin. If the best decision is different in different VOI bin, there is some value of knowing to which VOI bin the true value of the variable belongs. However, if the variable correlates strongly with the decision, it may happen that all iterations that are in a particular VOI bin are also in a particular decision bin. Then, it is impossible to compare different decision bins to find out which decision is the best in that VOI bin.

This problem can be overcome by assessing counter-factual worlds, because then there is always the same number of iterations in every decision bin. The conclusion of this is that the VOI analysis using decisions as random variables is a simple and quick screening method, but it cannot be reliably used for a final VOI analysis. In contrast, the counter-factual assessment is the method of choice for that. Originally developed by Jouni Tuomisto and Marko Tainio, National Public Health Institute (KTL), Finland, 2005. The screening version was developed by Jouni Tuomisto, National Institute for Health and Welfare (THL), 2009. (c) CC-BY-SA.

How to use the method

Value of information score

The VOI score is the current expected value of perfect information (EVPI) for that variable in an assessment where it is used. If the variable is used is several assessments, it is the sum of EVPIs across all assessments.

Keywords

Value of information, decision analysis, uncertainty, decision making, optimising

References

↑ Value of information in Wikipedia
↑ Yokota F. and Thompson K.M. (2004a). Value of information literature analysis: A review of applications in health risk management. Medical Decision Making, 24 (3), pp. 287-298.
↑ Yokota F. and Thompson K.M. (2004b) Value of information analysis in environmental health risk management decisions: Past, present, and future. Risk Analysis, 24 (3), pp. 635-650.
↑ Morgan M.G. and Henrion M. (1992). Uncertainty: A guide to dealing with uncertainty in quantitative risk and policy analyses. Cambridge University Press. 332 pp.
↑ Cooke, R.M. (1991). Experts in uncertainty: Opinion and subjective probability in science. Oxfort university press, New York. 321 pp.
↑ Jouni Tuomisto's notebook P42, dated 29.10.2009.
↑ Judea Pearl: Causality: Models, Reasoning, and Inference. Cambridge University Press, 2000. ISBN 0521773628, ISBN 978-0521773621

Related files

[1] Value of information in Wikipedia

[yokota2004a-2] Yokota F. and Thompson K.M. (2004a). Value of information literature analysis: A review of applications in health risk management. Medical Decision Making, 24 (3), pp. 287-298.

[yokota2004b-3] Yokota F. and Thompson K.M. (2004b) Value of information analysis in environmental health risk management decisions: Past, present, and future. Risk Analysis, 24 (3), pp. 635-650.

[uncertainty-4] Morgan M.G. and Henrion M. (1992). Uncertainty: A guide to dealing with uncertainty in quantitative risk and policy analyses. Cambridge University Press. 332 pp.

[cooke-5] Cooke, R.M. (1991). Experts in uncertainty: Opinion and subjective probability in science. Oxfort university press, New York. 321 pp.

[6] Jouni Tuomisto's notebook P42, dated 29.10.2009.

[7] Judea Pearl: Causality: Models, Reasoning, and Inference. Cambridge University Press, 2000. ISBN 0521773628, ISBN 978-0521773621

[1]

[2]

[3]

[4]

[5]

[6]

[7]

Revision as of 08:40, 28 March 2011 (view source) Jouni (talk \| contribs) m (→‎See also: links added) ← Older edit		Revision as of 08:24, 2 February 2012 (view source) Jouni (talk \| contribs) (eracedu template added) Newer edit →
Line 198:		Line 198:

	{{mfiles}}		{{mfiles}}

			{{eracedu}}