Variable: Difference between revisions

From Opasnet
Jump to navigation Jump to search
mNo edit summary
(categories added)
 
(29 intermediate revisions by 8 users not shown)
Line 1: Line 1:
<accesscontrol>members of projects,,Workshop2008,,beneris,,Erac,,Heimtsa,,Hiwate,,Intarese</accesscontrol>
<noinclude>
 
[[Category:Universal object]]
[[Category:Universal object]]
[[Category:Knowledge crystal]]
[[Category:Open policy practice]]
[[Category:Decision analysis and risk management]]
{{variable|moderator=Jouni}}
{{Guidebook}}
{{Guidebook}}
[[category:Glossary term]]
[[category:Glossary term]]
<section begin=glossary />
<section begin=glossary />
:'''[[Variable]]''' is a description of a particular piece of reality. It can be a description of physical phenomena, or a description of value judgements. Also decisions included in an assessment are described as variables. Variables are continuously existing descriptions of reality, which develop in time as knowledge about them increases. Variables are therefore not tied into any single assessment, but instead can be included in other assessments. A variable is the basic building block of describing reality.<section end=glossary />
:'''Variable''' is a description of a particular piece of reality. It can be a description of a physical phenomenon, or a description of value judgements. Also decisions included in an assessment are described as variables. Variables are continuously existing descriptions of reality, which develop in time as knowledge about the topic increases. Variables are therefore not tied into any single assessment, but instead can be included in other assessments. A variable is the basic building block of describing reality.<section end=glossary />
 
== Question ==


What should be the structure of a variable such that it
* is able to systematically handle all kinds of information about the particular piece of reality that the variable is describing, especially
** it is generic enough to be a standard building block in decision support work (including interpretation of scientific information and political discussions),
* is able to systematically describe causal relationships between phenomena and variables that describe them,
* enables both quantitative and qualitative descriptions,
* is suitable for any kinds of variables, especially physical phenomena, decisions, and value judgements,
* inherits its main structure from [[universal object]]s,
* complies with the [[PSSP]] ontology,
* can be operationalised in a computational model system,
* results in variables that are independent of the assessment(s) they belong to;
* results in variables that pass the [[Plausibility test|clairvoyant test]].
* can be implemented on a website, and
* is easy enough to be usable and understood by interested non-experts?


;The research question about the structure of a variable: What is a structure of a variable such that it
== Answer ==
:* is able to systematically handle all kinds of information about the particular piece of reality that the variable is describing,
:* is able to systematically describe causal relationships between variables,
:* enables both quantitative and qualitative descriptions,
:* is suitable for any kinds of variables, especially physical phenomena, decisions, and value judgements,
:* inherits its main structure from [[universal object]]s,
:* complies with the [[PSSP]] ontology,
:* can be operationalised in a computational model system,
:* results in variables that are independent of the assessment(s) it belongs to;
:* results in variables that pass the [[Plausibility test|clairvoyant test]].


Variable is implemented as a web page in Opasnet wiki web-workspace. A variable page has the following structure.


{|{{prettytable}}
{|{{prettytable}}
! Attribute
|+The attributes of a variable.
! Sub-attributes
! [[Attribute]]
! Question to be answered
! Sub-attribute
! Comments
! Comments specific to the variable attributes
|-----
|-----
| '''Name'''
| '''Name'''
|  
|  
| What is the name of the variable?
| An identifier for the variable. Each Opasnet page have two kinds of identifiers: the name of the page (e.g. Variable) and the page identifier (e.g. Op_en2022). The former is used e.g. in links, the latter in [[R]] code.
| Two variables must not have identical names.
|-----
|-----
| '''Scope'''
| '''Question'''
|  
|  
| What is the research question to which the variable answers?
| Gives the question that is to be answered. It defines the scope of the variable. The question should be defined in a way that it has relevance in many different situations, i.e. makes the variable re-usable. (Compare to an [[assessment]] question, which is more specific to time, place and user need.)
| This includes a verbal definition of the spatial, temporal, and other limits (system boundaries) of the variable. The scope is defined according to the use purpose of the assessment(s) that the variable belongs to.
|-----
|-----
| '''Definition'''
| '''Answer'''
|  
|  
* Causality
| An answer presents an understandable and useful answer to the question. Its essence is often a machine-readable and human-readable probability distribution (which can in a special case be a single number), but an answer can also be non-numerical such as "very valuable" or a descriptive table like on this page. The units of interconnected variables need to be coherent with each other given the functions describing causal relations. The units of variables can be used to check the coherence of the causal network description. This is a so called [[Plausibility test|unit test]]. Typically the answer contains an [[R]] code that fetches the ovariable created under Rationale/Calculations and evaluates it.
* Data
* Unit
* Formula
| How can you derive or calculate the answer?
| The definition uses algebra or other explicit methods if possible.
|-----
|-----
| '''Result'''
| rowspan="5" | '''Rationale'''
|  
|  
| What is the answer to the question defined in the scope?
| Rationale contains anything that is necessary to convince a critical reader that the answer is credible and usable. It presents the reader the information required to derive the answer and explains how it is formed. Typically it has the following sub-attributes, but also other are possible. Rationale may also contain lengthy discussions about relevant topics.
| If possible, a numerical expression or distribution.
|----
| Data
| Data tells about direct observations (or expert judgements) about the variable itself.
|----
| Dependencies
| Dependencies {{reslink|Dependencies instead of causality}} tells what we know about how upstream variables (i.e. causal parents) affect the variable. In other words, we attempt to estimate the answer indirectly based on information of causal parents. Sometimes also reverse inference is possible based on causal children. Dependencies list the causal parents and expresses their functional relationships (the variable as a function of its parents) or probabilistic relationships (conditional probability of the variable given its parents).
|----
| Calculations
| Calculations {{reslink|Discussion on formula attribute}} is an operationalisation of how to calculate or derive the answer. Formula uses algebra, computer code, or other explicit methods if possible. Typically it is [[R]] code that produces and stores the necessary [[ovariable]]s to compute the current best answer to the question.
|----
| Data not used
| Data not used are relevant for the research question, but for some reason they were not used in producing the current answer. I may be that the data was found after the synthesis, and an update has not yet been done; or it has been unclear how to merge these to the existing data. In any case, it is important to be differentiate and be explicit about whether data is irrelevant (and therefore removed from the page) or relevant but not used (and therefore waiting for further work).
|}
|}


In addition, it is practical to have additional subtitles on a variable page. These are not attributes, though.
* See also
* Keywords (not always used)
* References
* Related files


[[image:Variable definition.PNG]]
== Rationale ==


'''Name''' is the identifier of the variable, which of course already more or less describes what the real-world entity the variable describes is. The variable names should be chosen so that they are descriptive, unambiguous and not easily confused with other variables. An example of a good variable name could be e.g. ''daily average of PM<sub>2.5</sub> concentration in Helsinki''.
[[File:Information_flow_within_open_policy_practice.svg|thumb|450px]]
The structure is based on extensive discussions between Mikko Pohjola and Jouni Tuomisto in 2006-2008 and intensive application in Opasnet ever since.


'''Scope''' defines the boundaries of the variable - what does it describe and what not? The boundaries can be e.g. spatial, temporal or abstract. In the above example variable, at least the geographical boundary restricts the coverage of the variable to Helsinki and the considered phenomena are restricted to PM<sub>2.5</sub> daily averages. There could also be some further boundary settings defined in the scope of the variable, which are not explicitly mentioned in the name of the variable.
For more detailed description about variables as information objects, see [[knowledge crystal]].


'''Definition''' describes how the result of the variable is derived. It consists of sub-attributes to describe the causal relations, data used to estimate the result, and the mathematical formula to calculate the result. Also alternative identified ways to derive the variable result can be described in the definition attribute as reference. The minimum requirement for defining the causality in all variables is to express the potential existence of a causal relation, i.e. that a change in an ''upstream'' variable possibly affects the variables ''downstream''.
== See also ==


'''Definition has four sub-attributes''' that have particular purposes in the method:
* [[Ovariable]]
 
* [[:Category:Variables | List of all variables]] in Opasnet
;Causality: Causality tells what we know about how upstream values affect our variable. This sub-attribute lists the upstream variables (i.e. causal parents) of the variable. It expresses their functional relationships (this variable as a function of its parents) or probabilistic relationships (conditional probability of this variable given its parents). The expression of causality is '''independent''' of the data there exists about the magnitude of the result of this variable.
* [[Universal object]]
 
* [[Open assessment]]
;Data: Data tells what we know about the magnitude of the result of this variable. This sub-attribute describes any non-causal information about the variable, such as measured data about the variable itself, measured data about an analogous situation (this requires some kind of error model), or expert judgments about the result.
* [http://en.opasnet.org/w/index.php?title=Variable&oldid=5596 A previous version of this page] contains much of the discussion from the Intarese deliverables D17 and D18, which has been edited with a hard hand.
 
;Unit: Unit describes, in what measurement units the result is presented. The units of interconnected variables need to be coherent with each other in a causal network description. The units of variables can be used to check the coherence of the causal network description by the ''unit test'' (see [[Plausibility test]]).
 
;Formula: Formula {{disclink|Discussion on formula attribute}} is the actual computer code or similar that calculates what is described under titles ''Causality'', ''Data'', and ''Unit'', making a synthesis of the three. In a general form, the formula can be described as
 
result = formula(parent parameters, data parameters, unit),
 
:where formula is the function (expressed as computer code for a specified software) for calculating the result using the parent parameters (information from causally upstream variables) and the data parameters (information from observed data) as input.
 
'''Result''' attribute is an answer to the question presented in the scope of the variable. A result is preferably a probability distribution (which can in a special case be a single number), but a result can also be non-numerical such as "very good". It should be noted that the result is the distribution itself, although it can be expressed as some kind of description of the distribution, such as mean and standard deviation. The result should be described in such a detailed way that the full distribution can be reproduced from the information presented under this attribute. A technically straightforward way to do this is to provide a large random sample from the distribution.
 
The result may be a different number for different ''locations'', such as geographical positions, population subgroups, or other determinants, Then, the result is described as
 
  R|x<sub>1</sub>,x<sub>2</sub>,...
 
where R is the result and x<sub>1</sub> and x<sub>2</sub> are defining the locations. A ''dimension'' means a property along which there are multiple locations and the result of the variable may have different values when the location changes. In this case, x<sub>1</sub> and x<sub>2</sub> are dimensions, and particular values of x<sub>1</sub> and x<sub>2</sub> are locations. A variable can have zero, one, or more dimensions. Even if a dimension is continuous, it is usually operationalised in practice as a list of discrete locations. Such a list is called an ''[[index]]'', and each location is called a ''row'' of the index.
 
Uncertainty about the true value of the variable is one dimension. The index of the uncertainty dimension is called the ''[[Sample]]'' index, and it contains a list of integers 1,2,3... . Uncertainty is operationalised as a sequence of random samples from the probability distribution of the result. The i<sup>th</sup> random sample is located in the i<sup>th</sup> row of the Sample index.
 
 
'''Connection to the [[PSSP]] structure'''
 
The variable structure is closely connected to [[PSSP]], and the relationships can be described in the following way.
 
{|{{prettytable}}
! PSSP
! Variable structure
|-----
| Purpose
| The general purpose of a variable is to describe a particular piece of reality. Scope defines which piece of reality is to be described by this variable.
|-----
| Structure
| Definition describes the structure of the particular piece of reality that the variable describes.
|-----
| State
| Result is an expression of the state of the particular piece of reality that the variable describes.
|-----
| Performance
| Performance is an expression of the uncertainty of the variable, i.e. how well does the variable fulfill its purpose, i.e. describe the piece of reality defined in the scope. On the variable level, performance is evaluated separately for result (parameter uncertainty) and definition (model uncertainty). However, evaluating the performance of a scope of a variable can not be done on the variable level, but instead as relevance on the assessment level.
|}
 
 
'''Technical issues in Mediawiki'''
 
* Each variable is a page in the ''Variable'' namespace. The '''name''' of the variable is also the name of the page. However, draft variables may be parts of other pages.
* All attributes except name are second-level (==) sub-titles on the page.
* Description of the attribute content is added at the end of that content; discussions on the content are added to the Talk page, each discussion under an own descriptive title.
* References to external sources are added to the text with the <nowiki><ref>Reference information</ref></nowiki> tag. The references are located in the  end of the page under subtitle References. However, reference is not an attribute of the variable despite it is technically similar.
* In the formula, computer code for a specific software may be used. The following are in use.
**Analytica_id: Identifier of the respective node in an Analytica model. <anacode>Place your Analytica code here. Use a space in front of each line.</anacode>
** <rcode>Place you R code here. Use a space in front of each line.<rcode>
 
'''Event-substance
 
{{comment|#(number): |This paragraph should be deleted or removed. Where?|--[[User:Jouni|Jouni]] 00:40, 8 June 2008 (EEST)}}
 
Variables are objects of event-medium composite -type. They thus describe both the events that occur within the scope of the variable and the medium where these particular events take place. In practice, the events can only be observed through the changes in the state of the medium, and it is therefore reasonable to describe the events and particular media as such composites rather than as separately.
 
In open assessment, all the variables included in an assessment must be causally related, directly or indirectly, to the endpoints of the assessment, and the causal relations must be defined. The event-media structure is the carrier of the [[Causality | cause-effect relations]] between the variables. An event occuring in a medium causes a change in state of that medium leading to another event to occur changing the state of that medium, causing yet another event to occur and so on. In addition to variables, also classes as generalizations of properties possessed by variables can be causally related to each other.


== References ==


'''See also'''
<references/>


* [[Heande:Structures of the building blocks of open risk assessments]]
== Related files ==
* [[Open assessment]]
</noinclude>
* [http://heande.pyrkilo.fi/heande/index.php?title=Variable&oldid=5596 A previous version of this page] contains much of the discussion from the Intarese deliverables D17 and D18, which has been edited with a hard hand.

Latest revision as of 08:49, 29 October 2018


<section begin=glossary />

Variable is a description of a particular piece of reality. It can be a description of a physical phenomenon, or a description of value judgements. Also decisions included in an assessment are described as variables. Variables are continuously existing descriptions of reality, which develop in time as knowledge about the topic increases. Variables are therefore not tied into any single assessment, but instead can be included in other assessments. A variable is the basic building block of describing reality.<section end=glossary />

Question

What should be the structure of a variable such that it

  • is able to systematically handle all kinds of information about the particular piece of reality that the variable is describing, especially
    • it is generic enough to be a standard building block in decision support work (including interpretation of scientific information and political discussions),
  • is able to systematically describe causal relationships between phenomena and variables that describe them,
  • enables both quantitative and qualitative descriptions,
  • is suitable for any kinds of variables, especially physical phenomena, decisions, and value judgements,
  • inherits its main structure from universal objects,
  • complies with the PSSP ontology,
  • can be operationalised in a computational model system,
  • results in variables that are independent of the assessment(s) they belong to;
  • results in variables that pass the clairvoyant test.
  • can be implemented on a website, and
  • is easy enough to be usable and understood by interested non-experts?

Answer

Variable is implemented as a web page in Opasnet wiki web-workspace. A variable page has the following structure.

The attributes of a variable.
Attribute Sub-attribute Comments specific to the variable attributes
Name An identifier for the variable. Each Opasnet page have two kinds of identifiers: the name of the page (e.g. Variable) and the page identifier (e.g. Op_en2022). The former is used e.g. in links, the latter in R code.
Question Gives the question that is to be answered. It defines the scope of the variable. The question should be defined in a way that it has relevance in many different situations, i.e. makes the variable re-usable. (Compare to an assessment question, which is more specific to time, place and user need.)
Answer An answer presents an understandable and useful answer to the question. Its essence is often a machine-readable and human-readable probability distribution (which can in a special case be a single number), but an answer can also be non-numerical such as "very valuable" or a descriptive table like on this page. The units of interconnected variables need to be coherent with each other given the functions describing causal relations. The units of variables can be used to check the coherence of the causal network description. This is a so called unit test. Typically the answer contains an R code that fetches the ovariable created under Rationale/Calculations and evaluates it.
Rationale Rationale contains anything that is necessary to convince a critical reader that the answer is credible and usable. It presents the reader the information required to derive the answer and explains how it is formed. Typically it has the following sub-attributes, but also other are possible. Rationale may also contain lengthy discussions about relevant topics.
Data Data tells about direct observations (or expert judgements) about the variable itself.
Dependencies Dependencies R↻ tells what we know about how upstream variables (i.e. causal parents) affect the variable. In other words, we attempt to estimate the answer indirectly based on information of causal parents. Sometimes also reverse inference is possible based on causal children. Dependencies list the causal parents and expresses their functional relationships (the variable as a function of its parents) or probabilistic relationships (conditional probability of the variable given its parents).
Calculations Calculations R↻ is an operationalisation of how to calculate or derive the answer. Formula uses algebra, computer code, or other explicit methods if possible. Typically it is R code that produces and stores the necessary ovariables to compute the current best answer to the question.
Data not used Data not used are relevant for the research question, but for some reason they were not used in producing the current answer. I may be that the data was found after the synthesis, and an update has not yet been done; or it has been unclear how to merge these to the existing data. In any case, it is important to be differentiate and be explicit about whether data is irrelevant (and therefore removed from the page) or relevant but not used (and therefore waiting for further work).

In addition, it is practical to have additional subtitles on a variable page. These are not attributes, though.

  • See also
  • Keywords (not always used)
  • References
  • Related files

Rationale

The structure is based on extensive discussions between Mikko Pohjola and Jouni Tuomisto in 2006-2008 and intensive application in Opasnet ever since.

For more detailed description about variables as information objects, see knowledge crystal.

See also

References


Related files