Goherr: Fish consumption study: Difference between revisions

Latest revision as of 15:39, 12 August 2020

**Progression class**
In Opasnet many pages being worked on and are in different classes of progression. Thus the information on those pages should be regarded with consideration. The progression class of this page has been assessed: This page is reviewed It has been read with a critical eye and commented on by an outside source, and given impairment suggestions have been included in the page. An equivalent to a peer-reviewed article.	The content and quality of this page is/was being curated by the project that produced the page. The quality was last checked: 2019-08-26.

This page is a study. The page identifier is Op_en7749
Moderator:Arja (see all)

Upload data {{#opasnet_base_link:Op_en7749}}

This page contains a detailed description about data management and analysis of an international survey related to scientific article Forage Fish as Food: Consumer Perceptions on Baltic Herring by Mia Pihlajamäki, Arja Asikainen, Suvi Ignatius, Päivi Haapasaari, and Jouni T. Tuomisto.^[1] The results of this survey where also used in another article Health effects of nutrients and environmental pollutants in Baltic herring and salmon: a quantitative benefit-risk assessment by the same group.^[2]

Question

How Baltic herring and salmon are used as human food in Baltic sea countries? Which determinants affect on people’s eating habits of these fish species?

Answer

Model run with all the results of the article^[1] 26.8.2018
Original questionnaire analysis results 13.3.2017
Consumption amount estimates
- Model run 21.4.2017 [1] first distribution
- Model run 18.5.2017 with modelled data; with direct survey data

Rationale

Survey of eating habits of Baltic herring and salmon in Denmark, Estonia, Finland and Sweden has been done in September 2016 by Taloustutkimus oy. Content of the questionnaire can be accessed in Google drive. The actual data can be found from the link below (see Data).

Data

Questionnaire

Original datafile File:Goherr fish consumption.csv.

Show details

Questions in the Goherr questionnaire(-)
Obs	Index	Full question	Question	Description
1	V1	Country	Country
2	V2	1. What is the postal code of your residence?	Postal code
3	V3	2. What is your gender?	Gender
4	V4	2. How old are you?	Age
5	V5	3. What is the level of your highest education?	Education
6	V6	4. How would you rate the purchasing power of your household?	Purchasing power
7	V7	5. Which of the following food supplements (as a separate product or as a part of some multicompound) do you eat often at least part of the year?:Fatty acids (Omega 3, DHA, EPA etc.)	Omega3
8	V8	5. Which of the following food supplements (as a separate product or as a part of some multicompound) do you eat often at least part of the year?:D vitamin	Vitamin D
9	V9	5. Which of the following food supplements (as a separate product or as a part of some multicompound) do you eat often at least part of the year?:Other vitamins	Other vitamins
10	V10	5. Which of the following food supplements (as a separate product or as a part of some multicompound) do you eat often at least part of the year?:None of the above	No vitamins
11	V11	5. Which of the following food supplements (as a separate product or as a part of some multicompound) do you eat often at least part of the year?:Don't Know	Unknown vitamins
12	V12	6. Does someone in your household fish?:No	No fishing
13	V13	6. Does someone in your household fish?:Yes, for recreational purposes	Recreational
14	V14	6. Does someone in your household fish?:Yes, for household use	Household use
15	V15	6. Does someone in your household fish?:Yes, as a professional fisherman	Professional fishing
16	V16	7. Do you eat fish?	Eat fish
17	V17	8. Why don't you eat fish?:Allergy	Allergy
18	V18	8. Why don't you eat fish?:I don't like the taste	Bad taste
19	V19	8. Why don't you eat fish?:I don't like fish bones	Bones
20	V20	8. Why don't you eat fish?:Difficult to cook	Cooking difficult
21	V21	8. Why don't you eat fish?Health risks	Health risks fish
22	V22	8. Why don't you eat fish?:I'm not used to	Not used to
23	V23	8. Why don't you eat fish?:I'm concerned about the sustainability of fish stocks	Sustainability fish
24	V24	8. Why don't you eat fish?:It is ethically wrong to eat animals	Ethically wrong
25	V25	8. Why don't you eat fish?:I'm a vegetarian	Vegetarian
26	V26	8. Why don't you eat fish?:Don't Know	Why not fish unknown
27	V27	8. Why don't you eat fish?:Other	Why not fish other
28	V28	8. Why don't you eat fish?:Other open	Why not fish open
29	V29	9. How often do you eat fish?	How often fish
30	V30	10. Do you eat salmon or trout at least sometimes?	Eat salmon
31	V31	11. Which of the following salmonids do you eat at least sometimes?:Wild Baltic salmon	Baltic salmon
32	V32	11. Which of the following salmonids do you eat at least sometimes?:Wild Atlantic salmon	Atlantic salmon
33	V33	11. Which of the following salmonids do you eat at least sometimes?:Norwegian cultured salmon	Norwegian salmon
34	V34	11. Which of the following salmonids do you eat at least sometimes?:Rainbow trout	Rainbow trout
35	V35	11. Which of the following salmonids do you eat at least sometimes?:I never know which salmon I eat	Unknown salmon
36	V36	11. Which of the following salmonids do you eat at least sometimes?:Some other salmon, what?	Other salmon
37	V37	11. Which of the following salmonids do you eat at least sometimes?:Other open	Salmon open
38	V38	12. Where do you usually get the wild Baltic salmon that you eat?:Someone in our hosehold catches it	Own catch BS
39	V39	12. Where do you usually get the wild Baltic salmon that you eat?:Buy from a grocery store	Grocery store BS
40	V40	12. Where do you usually get the wild Baltic salmon that you eat?:Buy from a market place/fishmonger or similar	Market place BS
41	V41	12. Where do you usually get the wild Baltic salmon that you eat?:Eat in a restaurant/cafe/canteen or similar	Restaurant BS
42	V42	12. Where do you usually get the wild Baltic salmon that you eat?:Directly from a fisher	Fisher BS
43	V43	12. Where do you usually get the wild Baltic salmon that you eat?:From a relative, friend, or similar	Friend BS
44	V44	12. Where do you usually get the wild Baltic salmon that you eat?:Other	Where BS other
45	V45	12. Where do you usually get the wild Baltic salmon that you eat?:Other open	Where BS open
46	V46	13. How often do you on average eat wild Baltic salmon as an ingredient in a dish (i.e. as part of a soup, gravy, sushi, casserole etc.)?	How often BS
47	V47	14. How much on average do you eat of a dish with wild Baltic salmon in it? Think of a medium sized lunch plate and the amount of the whole dish, not just the salmon.	How much BS
48	V48	15. How often do you on average eat wild Baltic salmon as a side dish (i.e as a separate product such as fish balls, stake, fillets, graved, canned etc.)?	How often side BS
49	V49	16. What is the average amount of wild Baltic salmon as a side dish? Think of a medium sized lunch plate.	How much side BS
50	V50	17. Select up to three most important reasons for you to eat wild Baltic salmon?:It tastes good	Tastes good BS
51	V51	17. Select up to three most important reasons for you to eat wild Baltic salmon?:I caught it	Self caught BS
52	V52	17. Select up to three most important reasons for you to eat wild Baltic salmon?:It's easy to cook	Easy to cook BS
53	V53	17. Select up to three most important reasons for you to eat wild Baltic salmon?:It's quick to cook	Quick to cook BS
54	V54	17. Select up to three most important reasons for you to eat wild Baltic salmon?:It's readily available	Readily available BS
55	V55	17. Select up to three most important reasons for you to eat wild Baltic salmon?:It's healthy	Healthy BS
56	V56	17. Select up to three most important reasons for you to eat wild Baltic salmon?:It's inexpensive	Inexpensive BS
57	V57	17. Select up to three most important reasons for you to eat wild Baltic salmon?:My family likes it	Family likes it BS
58	V58	17. Select up to three most important reasons for you to eat wild Baltic salmon?:It's environmentally/climate friendly choice	Environmental BS
59	V59	17. Select up to three most important reasons for you to eat wild Baltic salmon?:It's a traditional dish	Traditional BS
60	V60	17. Select up to three most important reasons for you to eat wild Baltic salmon?:Some other reasons, what?	Why BS other
61	V61	17. Select up to three most important reasons for you to eat wild Baltic salmon?:Other open	Why BS open
62	V62	18. You answered in the previous question that you don't eat wild Baltic salmon. Select up to three most important reasons for why not eating it.:I don't like the taste	Bad taste BS
63	V63	18. You answered in the previous question that you don't eat wild Baltic salmon. Select up to three most important reasons for why not eating it.:I am not used to eating it / not offered at home	Not used to BS
64	V64	18. You answered in the previous question that you don't eat wild Baltic salmon. Select up to three most important reasons for why not eating it.:I don't know how to cook it	Cannot cook BS
65	V65	18. You answered in the previous question that you don't eat wild Baltic salmon. Select up to three most important reasons for why not eating it.:It is not easily available	Not available BS
66	V66	18. You answered in the previous question that you don't eat wild Baltic salmon. Select up to three most important reasons for why not eating it.:I'm worried about the possible health risks caused by the harmful chemicals in it	Health risks BS
67	V67	18. You answered in the previous question that you don't eat wild Baltic salmon. Select up to three most important reasons for why not eating it.:I doubt the quality/freshness of the products	Quality issues BS
68	V68	18. You answered in the previous question that you don't eat wild Baltic salmon. Select up to three most important reasons for why not eating it.:I'm worried about the sustainability of the stocks	Sustainability BS
69	V69	18. You answered in the previous question that you don't eat wild Baltic salmon. Select up to three most important reasons for why not eating it.:It doesn't belong to traditional dishes of my home area	Not traditional BS
70	V70	18. You answered in the previous question that you don't eat wild Baltic salmon. Select up to three most important reasons for why not eating it.:It is too expensive	Expensive BS
71	V71	18. You answered in the previous question that you don't eat wild Baltic salmon. Select up to three most important reasons for why not eating it.:Some other reason, what?	Why not BS other
72	V72	18. You answered in the previous question that you don't eat wild Baltic salmon. Select up to three most important reasons for why not eating it.:Other open	Why not BS open
73	V73	19. How would the following factors influence on your consumption of wild Baltic salmon compared to your current use? Select one for each option:Lower price	Lower price BS
74	V74	19. How would the following factors influence on your consumption of wild Baltic salmon compared to your current use? Select one for each option:Higher price	Higher price BS
75	V75	19. How would the following factors influence on your consumption of wild Baltic salmon compared to your current use? Select one for each option:Better availability	Better availability BS
76	V76	19. How would the following factors influence on your consumption of wild Baltic salmon compared to your current use? Select one for each option:Availability of wider variety of ready meals / refined products	Ready meals BS
77	V77	19. How would the following factors influence on your consumption of wild Baltic salmon compared to your current use? Select one for each option:Cooking suggestions and recipes provided within the store or in the package.	Recipes BS
78	V78	19. How would the following factors influence on your consumption of wild Baltic salmon compared to your current use? Select one for each option:Better information (e.g. package markings or in the store) on the catch date, area, fisher and/or the refiner	Package markings BS
79	V79	19. How would the following factors influence on your consumption of wild Baltic salmon compared to your current use? Select one for each option:Improvement of the stocks (eco-labelling)	Eco labelling BS
80	V80	19. How would the following factors influence on your consumption of wild Baltic salmon compared to your current use? Select one for each option:Lower level of harmful chemicals in the fish	Less chemicals BS
81	V81	19. How would the following factors influence on your consumption of wild Baltic salmon compared to your current use? Select one for each option:National food safety authorities publish a recommendation to eat it	Recommended BS
82	V82	19. How would the following factors influence on your consumption of wild Baltic salmon compared to your current use? Select one for each option:National food safety authorities publish a recommendation to limit eating it	Not recommended BS
83	V83	19. How would the following factors influence on your consumption of wild Baltic salmon compared to your current use? Select one for each option:Some other reason, what?	Change BS other
84	V84	19. How would the following factors influence on your consumption of wild Baltic salmon compared to your current use? Select one for each option:Other open	Change BS open
85	V85	20. Do you eat any herring (Baltic or other) at least sometimes?	Eat herring
86	V86	21. Do you eat Baltic herring? This means herring caught from Baltic sea, not from Atlantic ocean	Eat BH
87	V87	22. Where do you usually get the wild Baltic herring that you eat?:Someone in our hosehold catches it	Own catch BH
88	V88	22. Where do you usually get the wild Baltic herring that you eat?:Buy from a grocery store	Grocery store BH
89	V89	22. Where do you usually get the wild Baltic herring that you eat?:Buy from a market place/fishmonger or similar	Market place BH
90	V90	22. Where do you usually get the wild Baltic herring that you eat?:Eat in a restaurant/cafe/canteen or similar	Restaurant BH
91	V91	22. Where do you usually get the wild Baltic herring that you eat?:Directly from a fisher	Fisher BH
92	V92	22. Where do you usually get the wild Baltic herring that you eat?:From a relative, friend, or similar	Friend BH
93	V93	22. Where do you usually get the wild Baltic herring that you eat?:Other	Where BH other
94	V94	22. Where do you usually get the wild Baltic herring that you eat?:Other open	Where BH open
95	V95	23. How often do you on average eat Baltic herring as ingredient of a dish? (i.e as part of a dish such as in a casserole or some other food including Baltic herring)?	How often BH
96	V96	24. How much on average do you eat of a dish with Baltic herring in it? Think of a medium sized lunch plate and the amount of the whole dish, not just the amount of Baltic herring	How much BH
97	V97	25. How often do you on average eat whole Baltic herrings or fillets as a side dish (fried, smoked, rolls, pickled, salted or otherwise prepared)?	How often side BH
98	V98	26. How much on average of Baltic herring as a side dish (think of a regular sized lunch plate)?	How much side BH
99	V99	27. Select up to three most important reasons for you to eat Baltic herring?:It tastes good	Tastes good BH
100	V100	27. Select up to three most important reasons for you to eat Baltic herring?:I caught it	Self caught BH
101	V101	27. Select up to three most important reasons for you to eat Baltic herring?:It's easy to cook	Easy to cook BH
102	V102	27. Select up to three most important reasons for you to eat Baltic herring?:It's quick to cook	Quick to cook BH
103	V103	27. Select up to three most important reasons for you to eat Baltic herring?:It's readily available	Readily available BH
104	V104	27. Select up to three most important reasons for you to eat Baltic herring?:It's healthy	Healthy BH
105	V105	27. Select up to three most important reasons for you to eat Baltic herring?:It's inexpensive	Inexpensive BH
106	V106	27. Select up to three most important reasons for you to eat Baltic herring?:My family likes it	Family likes it BH
107	V107	27. Select up to three most important reasons for you to eat Baltic herring?:It's environmentally/climate friendly choice	Environmental BH
108	V108	27. Select up to three most important reasons for you to eat Baltic herring?:It's a traditional dish	Traditional BH
109	V109	27. Select up to three most important reasons for you to eat Baltic herring?:Some other reasons, what?	Why BH other
110	V110	27. Select up to three most important reasons for you to eat Baltic herring?:Other open	Why BH open
111	V111	28. You answered in the previous question that you don't eat Baltic herring. Select up to three most important reasons for why not eating it.:I don't like the taste and/or smell of it	Bad taste BH
112	V112	28. You answered in the previous question that you don't eat Baltic herring. Select up to three most important reasons for why not eating it.:I am not used to eating it / not offered at home	Not used to BH
113	V113	28. You answered in the previous question that you don't eat Baltic herring. Select up to three most important reasons for why not eating it.:I don't know how to cook it	Cannot cook BH
114	V114	28. You answered in the previous question that you don't eat Baltic herring. Select up to three most important reasons for why not eating it.:It is difficult to cook	Difficult to cook BH
115	V115	28. You answered in the previous question that you don't eat Baltic herring. Select up to three most important reasons for why not eating it.:It is not easily available	Not available BH
116	V116	28. You answered in the previous question that you don't eat Baltic herring. Select up to three most important reasons for why not eating it.:I'm worried about the possible risks caused by the harmful chemicals in it	Health risks BH
117	V117	28. You answered in the previous question that you don't eat Baltic herring. Select up to three most important reasons for why not eating it.:It is more suitable for animal feed than human food	Better for feed BH
118	V118	28. You answered in the previous question that you don't eat Baltic herring. Select up to three most important reasons for why not eating it.:I doubt the quality/freshness of the products	Quality issues BH
119	V119	28. You answered in the previous question that you don't eat Baltic herring. Select up to three most important reasons for why not eating it.:I'm worried about the sustainability of the stocks	Sustainability BH
120	V120	28. You answered in the previous question that you don't eat Baltic herring. Select up to three most important reasons for why not eating it.:It doesn't belong to traditional dishes of my home area	Not traditional BH
121	V121	28. You answered in the previous question that you don't eat Baltic herring. Select up to three most important reasons for why not eating it.:Some other reason, what?	Why not BH other
122	V122	28. You answered in the previous question that you don't eat Baltic herring. Select up to three most important reasons for why not eating it.:Other open	Why not BH open
123	V123	29. How would the following factors influence your consumption of Baltic herring compared to your current use? Select one option for each row.:Lower price	Lower price BH
124	V124	29. How would the following factors influence your consumption of Baltic herring compared to your current use? Select one option for each row.:Higher price	Higher price BH
125	V125	29. How would the following factors influence your consumption of Baltic herring compared to your current use? Select one option for each row.:Better availability	Better availability BH
126	V126	29. How would the following factors influence your consumption of Baltic herring compared to your current use? Select one option for each row.:Availability of wider variety of ready meals / processed products	Ready meals BH
127	V127	29. How would the following factors influence your consumption of Baltic herring compared to your current use? Select one option for each row.:Cooking suggestions and recipes provided in the store or in the package.	Recipes BH
128	V128	29. How would the following factors influence your consumption of Baltic herring compared to your current use? Select one option for each row.:Better information (e.g. package markings or in the store) on the catch date, area, fisher and/or processing company.	Package markings BH
129	V129	29. How would the following factors influence your consumption of Baltic herring compared to your current use? Select one option for each row.:Improved stocks (eco-labelling)	Eco labelling BH
130	V130	29. How would the following factors influence your consumption of Baltic herring compared to your current use? Select one option for each row.:Lower level of harmful chemicals in the fish	Less chemicals BH
131	V131	29. How would the following factors influence your consumption of Baltic herring compared to your current use? Select one option for each row.:National food safety authorities publish a recommendation to eat it	Recommended BH
132	V132	29. How would the following factors influence your consumption of Baltic herring compared to your current use? Select one option for each row.:National food safety authorities publish a recommendation to limit eating it	Not recommended BH
133	V133	29. How would the following factors influence your consumption of Baltic herring compared to your current use? Select one option for each row.:Some other reason, what?	Change BH other
134	V134	29. How would the following factors influence your consumption of Baltic herring compared to your current use? Select one option for each row.:Other open	Change BH open
135	V135	30. Are you familiar with the possible eating recommendations of Baltic herring and/or wild Baltic salmon in your country?	Recommendation awareness
136	V136	31. How strongly do you agree or disagree with the following statements on Baltic salmon? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It contributes to the economic wellbeing of the society	Economic wellbeing BS
137	V137	31. How strongly do you agree or disagree with the following statements on Baltic salmon? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It is a natural resource for humans to utilize	Natural resource BS
138	V138	31. How strongly do you agree or disagree with the following statements on Baltic salmon? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It belongs to all Baltic Sea citizens equally	Belongs to all BS
139	V139	31. How strongly do you agree or disagree with the following statements on Baltic salmon? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It involves long traditions related to fishing and fish eating	Long traditions BS
140	V140	31. How strongly do you agree or disagree with the following statements on Baltic salmon? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It inspires to search experiences from nature, cultural sites, fish markets or from different dishes	Inspiring experiences BS
141	V141	31. How strongly do you agree or disagree with the following statements on Baltic salmon? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It is among the most renowned fish species of the Baltic Sea	Renowned species BS
142	V142	31. How strongly do you agree or disagree with the following statements on Baltic salmon? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It is part of the Baltic Sea ecosystem	Part of ecosystem BS
143	V143	31. How strongly do you agree or disagree with the following statements on Baltic salmon? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:I have an emotional bond to it	Emotional bond BS
144	V144	31. How strongly do you agree or disagree with the following statements on Baltic salmon? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It helps me to achieve my personal aims	Personal aims
145	V145	32. How strongly do you agree or disagree with the following statements on Baltic herring? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It contributes to the economic wellbeing of the society	Economic wellbeing BH
146	V146	32. How strongly do you agree or disagree with the following statements on Baltic herring? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It is a natural resource for humans to utilize	Natural resource BH
147	V147	32. How strongly do you agree or disagree with the following statements on Baltic herring? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It belongs to all Baltic Sea citizens equally	Belongs to all BH
148	V148	32. How strongly do you agree or disagree with the following statements on Baltic herring? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It involves long traditions related to fishing and fish eating	Long traditions BH
149	V149	32. How strongly do you agree or disagree with the following statements on Baltic herring? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It inspires to search experiences from nature, cultural sites, fish markets or from different dishes	Inspiring experiences BH
150	V150	32. How strongly do you agree or disagree with the following statements on Baltic herring? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It is among the most renowned fish species of the Baltic Sea	Renowned species BH
151	V151	32. How strongly do you agree or disagree with the following statements on Baltic herring? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It is part of the Baltic Sea ecosystem	Part of ecosystem BH
152	V152	32. How strongly do you agree or disagree with the following statements on Baltic herring? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:I have an emotional bond to it	Emotional bond BH
153	V153	32. How strongly do you agree or disagree with the following statements on Baltic herring? Please indicate your position by selecting a number from -5 (strongly disagree) to 5 (strongly agree).:It helps me to achieve my personal aims	Personal aims BH
154	V154	Weighting	Weighting
155	V155	How old are you? Age category FI	Age category FI	A derived variable
156	V156	2. How old are you? Age category SWE, DK, EST	Age category other	A derived variable
157	V157	Row number of respondent	Row	A derived variable
158	V158	Age category "18-45" or ">45"	Ages	A derived variable
159	V159	Original question converted to a number and .n added to name	Eat.fish.m	A derived variable
160	V160	Original question converted to a number and .n added to name	How.often.fish.n	A derived variable
161	V161	Original question converted to a number and .n added to name	Eat.salmon.n	A derived variable
162	V162	Original question converted to a number and .n added to name	Baltic.salmon.n	A derived variable
163	V163	Original question converted to a number and .n added to name	How.often.BS.n	A derived variable
164	V164	Original question converted to a number and .n added to name	How.much.BS.n	A derived variable
165	V165	Original question converted to a number and .n added to name	How.often.side.BS.n	A derived variable
166	V166	Original question converted to a number and .n added to name	How.much.side.BS.n	A derived variable
167	V167	Original question converted to a number and .n added to name	Better.availability.BS.n	A derived variable
168	V168	Original question converted to a number and .n added to name	Less.chemicals.BS.n	A derived variable
169	V169	Original question converted to a number and .n added to name	Eat.BH.n	A derived variable
170	V170	Original question converted to a number and .n added to name	How.often.BH.n	A derived variable
171	V171	Original question converted to a number and .n added to name	How.much.BH.n	A derived variable
172	V172	Original question converted to a number and .n added to name	How.often.side.BH.n	A derived variable
173	V173	Original question converted to a number and .n added to name	How.much.side.BH.n	A derived variable
174	V174	Original question converted to a number and .n added to name	Better.availability.BH.n	A derived variable
175	V175	Original question converted to a number and .n added to name	Less.chemicals.BH.n	A derived variable

Assumptions

The following assumptions are used:

Assumptions for calculations(-)
Obs	Variable	Value	Unit	Result	Description
1	freq	1	times /a	0	Never
2	freq	2	times /a	0.5 - 0.9	less than once a year
3	freq	3	times /a	2 - 5	A few times a year
4	freq	4	times /a	12 - 36	1 - 3 times per month
5	freq	5	times /a	52	once a week
6	freq	6	times /a	104 - 208	2 - 4 times per week
7	freq	7	times /a	260 - 364	5 or more times per week
8	amdish	1	g /serving	20 - 70	1/6 plate or below (50 grams)
9	amdish	2	g /serving	70 - 130	1/3 plate (100 grams)
10	amdish	3	g /serving	120 - 180	1/2 plate (150 grams)
11	amdish	4	g /serving	170 - 230	2/3 plate (200 grams)
12	amdish	5	g /serving	220 - 280	5/6 plate (250 grams)
13	amdish	6	g /serving	270 - 400	full plate (300 grams)
14	amdish	7	g /serving	400 - 550	overly full plate (500 grams)
15	ingredient		fraction	0.1 - 0.3	Fraction of fish in the dish
16	amside	1	g /serving	20 - 70	1/6 plate or below (50 grams)
17	amside	2	g /serving	70 - 130	1/4 plate (100 grams)
18	amside	3	g /serving	120 - 180	1/2 plate (150 grams)
19	amside	4	g /serving	170 - 230	2/3 plate (200 grams)
20	amside	5	g /serving	220 - 280	5/6 plate (250 grams)
21	change	1	fraction	-1 - -0.8	Decrease it to zero
22	change	2	fraction	-0.9 - -0.5	Decrease it to less than half
23	change	3	fraction	-0.6 - -0.1	Decrease it a bit
24	change	4	fraction	0	No effect
25	change	5	fraction	0.1 - 0.6	Increase it a bit
26	change	6	fraction	0.5 - 0.9	Increase it over by half
27	change	7	fraction	0.8 - 1.3	Increase it over to double
28	change	8	fraction	-0.3 - 0.3	Don't know

Preprocessing

This code is used to preprocess the original questionnaire data from the above .csv file and to store the data as a usable variable to Opasnet base. The code stores a data.frame named survey.

Model run 11.7.2018 [2]
Model run 27.3.2019 with country codes DK EE FI SE [3]

+ Show code - Hide code

# This code is Op_en7749/preprocess2 on page [[Goherr: Fish consumption study]]

library(OpasnetUtils)

objects.latest("Op_en6007", code_name = "webropol") # [[OpasnetUtils/Drafts]] webropol.convert, merge.questions

############# Data preprocessing

# Get the data either from Opasnet or your own hard drive.

#survey1 original fi_le: N:/Ymal/Projects/Goherr/WP5/Goherr_fish_consumption.csv

survey1 <- opasnet.csv(
  "5/57/Goherr_fish_consumption.csv", 
  wiki = "opasnet_en", sep = ";", fill = TRUE, quote = "\""
)
#survey1 <- re#ad.csv(fi_le = "N:/Ymal/Projects/Goherr/WP5/Goherr_fish_consumption.csv",
#                   header=FALSE, sep=";", fill = TRUE, quote="\"")

# Data fi_le is converted to data.frame using levels at row 2121.
survey1 <- webropol.convert(survey1, 2121, textmark = ":Other open")

# Delete rows that are clearly overestimations

survey1 <- survey1[-c(636, 1465, 1865, 1876, 2062, 2088), ]

# Take the relevant columnames from the table on the page.
colnames(survey1) <- gsub(
  " ",  ".",
  opbase.data(
    "Op_en7749",
    subset = "Questions in the Goherr questionnaire")$Result[1:ncol(survey1)]
)

survey1$Row <- 1:nrow(survey1)
survey1$Weighting <- as.double(gsub(",",".", survey1$Weighting))
survey1$Ages <- factor(
  ifelse(as.numeric(as.character(survey1$Age)) < 46, "18-45",">45"),
  levels = c("18-45", ">45"), ordered = TRUE
)
survey1$Country <- factor(
  survey1$Country,
  levels=c("DK","EST","FI","SWE"),
  labels=c("DK","EE","FI","SE")
)
# Anonymize data
survey1$Postal.code <- NA
survey1$Age <- NA

# webropol.convert should put these in the right order but doesn't. So do it manually.

freqlist <- c(
  "less than once a year",
  "A few times a year",
  "1 - 3 times per month",
  "once a week",
  "2 - 4 times per week",
  "5 or more times per week"
)

amlist <- c(
  "1/6 plate or below (50 grams)",
  "1/3 plate (100 grams)",
  "1/2 plate (150 grams)",
  "2/3 plate (200 grams)",
  "5/6 plate (250 grams)",
  "full plate (300 grams)",
  "overly full plate (500 grams)"
  #  "Not able to estimate"
)

sidel <- c(
  "1/6 plate or below (50 grams)",
  "1/4 plate (100 grams)",
  "1/2 plate (150 grams)",
  "2/3 plate (200 grams)",
  "5/6 plate (250 grams)"
  #  "Not able to estimate"
)

fishamounts <- c(29,46:49,95:98)
colnames(survey1)[fishamounts]
#[1] "How.often.fish"    "How.often.BS"      "How.much.BS"       "How.often.side.BS"
#[5] "How.much.side.BS"  "How.often.BH"      "How.much.BH"       "How.often.side.BH"
#[9] "How.much.side.BH" 

ansl <- list(
  freqlist,
  c("Never", freqlist),
  amlist,
  c("Never", freqlist),
  sidel,
  c("Never", freqlist),
  amlist,
  c("Never", freqlist),
  sidel
)

for (i in 1:length(fishamounts)) {
  survey1[[fishamounts[i]]] <- factor(survey1[[fishamounts[i]]], levels = ansl[[i]], ordered = TRUE)
}

oprint(head(survey1))

agel <- as.character(unique(survey1$Ages))
countryl <- sort(as.character(unique(survey1$Country)))
genderl <- sort(as.character(unique(survey1$Gender)))
fisl <- c("Salmon", "Herring")

# Interesting fish eating questions
surv <- c(1,3,158,16,29,30,31,46:49,75,80,86,95:98, 125, 130)
colnames(survey1[surv]) # Shown without .n
#[1] "Country"                       "Gender"                       
#[3] "Ages"                          "Fish eating"                  
#[5] "How often eat fish"            "Salmon eating"                
#[7] "Baltic salmon"                 "How often Baltic salmon"      
#[9] "How much Baltic salmon"        "How often side Baltic salmon" 
#[11] "How much side Baltic salmon"   "Better availability BS"
#[13] "Less chemicals BS"             "Eat Baltic herring"           
#[15] "How often Baltic herring"      "How much Baltic herring"      
#[17] "How often side Baltic herring" "How much side Baltic herring"
#[19] "Better availability BH"        "Less chemicals BH"

# For estimating distributions, we should 
#1 remove people with Fish eating = No (142)
#2 merge Eat Baltic herring = I don't know with No (How often BH = NA always)
#3 merge Baltic salmon = NA with No (because they usually have answered BH questions)

oprint(table(is.na(rowSums(sapply(survey1[surv[4:20]], as.numeric)))))
# BUT: there are so many missing values, that we just model BH and BS separately now.

# Take the key questions and treat them as numeric

temp <- as.data.frame(lapply(
  survey1[c(16,29,30,31,46:49,75,80,86,95:98, 125, 130)],
  FUN = function(x) as.integer(x)
)) # Coerce to integers
colnames(temp) <- paste(colnames(temp),"n",sep=".")
survey1 <- cbind(survey1,temp)

survey1[is.na(survey1$Eat.Baltic.herring.n) | survey1$Eat.Baltic.herring.n==3 , 14] <- 1 # I don't know --> No

# Logical variable for respondents that have eaten fish, Baltic salmon, and Baltic herring
survey1$Eatfish <- survey1$Eat.fish == "Yes"
survey1$Eatsalm <- survey1$Baltic.salmon=="Yes" & !is.na(rowSums(survey1[162:166]))
survey1$Eatherr <- survey1$Eat.BH=="Yes" & !is.na(rowSums(survey1[169:173]))

oprint(table(survey1[c("Eatsalm", "Eatherr", "Eatfish")], useNA = "ifany"))

# Oletetaan, että covarianssimatriisi on vakio kaikille maille ja sukupuolille yms
# mutta keskiarvo on spesifi näille ja kysymykselle.

agel
countryl
genderl
fisl

objects.store(survey1, surv, agel, countryl, genderl, fisl)
cat("Data.frame survey1 and vectors agel, countryl, genderl and fisl were stored.\n")

Analyses

Sketches about modelling determinants of eating (spring 2018) [4]

Figures, tables and stat analyses for the first manuscript

Model run 8.5.2019 with thlVerse code (not run on Opasnet) [5] [6]
Model run 26.8.2018 with fig 6 as table [7]
Previous model runs are archived.

Methodological concerns:

+ Show code - Hide code

# This is code Op_en7749/ on page [[Goherr: Fish consumption study]]

library(OpasnetUtils)
library(ggplot2)
library(reshape2)
library(MASS)
#library(extrafont) # Needed to save Arial fonts to PDF or EPS
#library(thlVerse)
#library(car)
#library(vegan)

BS <- 24
localcomp <- FALSE
thl <- FALSE

# Set colours

#library(thlVerse)
#library(tidyverse)
#colors <- c(
#  thlColors(6,"quali","bar",thin=0.8),
#  thlColors(6,"quali","bar",thin=1)
#)[c(1,8,3,10,11,6)]
#ggplot(tibble(A=as.character(1:6),B=1),aes(x=A,weight=B,fill=A))+
#  geom_bar()+
#  scale_fill_manual(values=colors)

colors <- c("#74AF59FF","#2F62ADFF","#D692BDFF","#29A0C1FF","#BE3F72FF","#FBB848FF")

########################## Functions

#### thlLinePlot was adjusted to enable different point shapes

thlLinePlot <- function (data, xvar, yvar, groupvar = NULL, ylabel = yvar, 
                         xlabel = NULL, colors = thlColors(n = 12, type = "quali", 
                                                           name = "line"), title = NULL, subtitle = NULL, caption = NULL, 
                         legend.position = "none", base.size = 16, linewidth = 3, 
                         show.grid.x = FALSE, show.grid.y = TRUE, lang = "fi", ylimits = NULL, 
                         marked.treshold = 10, plot.missing = FALSE, xaxis.breaks = waiver(), 
                         yaxis.breaks = waiver(), panels = FALSE, nrow.panels = 1, 
                         labels.end = FALSE) 
{
  lwd <- thlPtsConvert(linewidth)
  gg <- ggplot(data, aes_(x = substitute(xvar), y = substitute(yvar), 
                          group = ifelse(!is.null(substitute(groupvar)), substitute(groupvar), 
                                         NA), colour = ifelse(!is.null(substitute(groupvar)), 
                                                              substitute(groupvar), ""))) + geom_line(size = lwd)
  if (isTRUE(plot.missing)) {
    df <- thlNaLines(data = data, xvar = deparse(substitute(xvar)), 
                     yvar = deparse(substitute(yvar)), groupvar = unlist(ifelse(deparse(substitute(groupvar)) != 
                                                                                  "NULL", deparse(substitute(groupvar)), list(NULL))))
    if (!is.null(df)) {
      gg <- gg + geom_line(data = df, aes_(x = substitute(xvar), 
                                           y = substitute(yvar), group = ifelse(!is.null(substitute(groupvar)), 
                                                                                substitute(groupvar), NA), colour = ifelse(!is.null(substitute(groupvar)), 
                                                                                                                           substitute(groupvar), "")), linetype = 2, 
                           size = lwd)
    }
  }
  if (!is.null(marked.treshold)) {
    if (length(unique(data[, deparse(substitute(xvar))])) > 
        marked.treshold) {
      if (is.factor(data[, deparse(substitute(xvar))]) || 
          is.character(data[, deparse(substitute(xvar))]) || 
          is.logical(data[, deparse(substitute(xvar))])) {
        levs <- levels(factor(data[, deparse(substitute(xvar))]))
        min <- levs[1]
        max <- levs[length(levs)]
      }
      else {
        min <- min(data[, deparse(substitute(xvar))])
        max <- max(data[, deparse(substitute(xvar))])
      }
      subdata <- data[c(data[, deparse(substitute(xvar))] %in% 
                          c(min, max)), ]
      gg <- gg + geom_point(data = subdata, aes_(
        x = substitute(xvar), 
        y = substitute(yvar),
        group = ifelse(!is.null(substitute(groupvar)), substitute(groupvar), NA),
        colour = ifelse(!is.null(substitute(groupvar)), substitute(groupvar), ""),
        shape = substitute(groupvar)),
        fill = "white",
        stroke = 1.35 * lwd,
        size = 10/3 * lwd)+scale_shape_manual(values=21:25)
    }
    else {
      gg <- gg + geom_point(
        aes_(
          shape = substitute(groupvar)
        ),
        fill = "white",
        stroke = 1.35 * lwd,
        size = 10/3 * lwd)+scale_shape_manual(values=21:25)
    }
  }
  if (isTRUE(labels.end)) {
    if (is.factor(data[, deparse(substitute(xvar))]) || 
        is.character(data[, deparse(substitute(xvar))]) || 
        is.logical(data[, deparse(substitute(xvar))])) {
      levs <- levels(factor(data[, deparse(substitute(xvar))]))
      maxd <- data[data[, deparse(substitute(xvar))] == 
                     levs[length(levs)], ]
    }
    else {
      maxd <- data[data[, deparse(substitute(xvar))] == 
                     max(data[, deparse(substitute(xvar))]), ]
    }
    brks <- maxd[, deparse(substitute(yvar))]
    labsut <- maxd[, deparse(substitute(groupvar))]
  }
  else (brks <- labsut <- waiver())
  gg <- gg + ylab(ifelse(deparse(substitute(ylabel)) == "yvar", 
                         deparse(substitute(yvar)), ylabel)) + labs(title = title, 
                                                                    subtitle = subtitle, caption = caption) + thlTheme(show.grid.y = show.grid.y, 
                                                                                                                       show.grid.x = show.grid.x, base.size = base.size, legend.position = legend.position, 
                                                                                                                       x.axis.title = ifelse(!is.null(xlabel), TRUE, FALSE)) + 
    xlab(ifelse(!is.null(xlabel), xlabel, "")) + scale_color_manual(values = colors) + 
    thlYaxisControl(lang = lang, limits = ylimits, breaks = yaxis.breaks, 
                    sec.axis = labels.end, sec.axis.breaks = brks, sec.axis.labels = labsut)
  if (is.factor(data[, deparse(substitute(xvar))]) || is.character(data[, 
                                                                        deparse(substitute(xvar))]) || is.logical(data[, deparse(substitute(xvar))])) {
    gg <- gg + scale_x_discrete(breaks = xaxis.breaks, expand = expand_scale(mult = c(0.05)))
  }
  else (gg <- gg + scale_x_continuous(breaks = xaxis.breaks))
  if (isTRUE(panels)) {
    fmla <- as.formula(paste0("~", substitute(groupvar)))
    gg <- gg + facet_wrap(fmla, scales = "free", nrow = nrow.panels)
  }
  gg
}

groups <- function(o) {
  o$Group <- paste(o$Gender, o$Ages)
  o$Group <- factor(o$Group, levels = c(
    "Female 18-45",
    "Male 18-45",
    "Female >45",
    "Male >45"
  ))
  return(o)
}

#' @title reasons produces a ggplot object for a graph that shows fractions of reasons mentioned in the data.
#' @param dat a data.frame containing survey answers for the target subgroup.
#' @param title an atomic string with the name to be shown as the title of graph
#' @return ggplot object

reasons <- function(
  dat,
  title
) {
  require(reshape2)
  weight <- Ovariable("weight",data=data.frame(
    Row=dat$Row,
    Result=dat$Weight
  ))
  levels(dat$Country) <- paste0(
    levels(dat$Country), " (n=", 
    aggregate(dat$Country, dat["Country"], length)$x, ")"
  )
  tmp <- melt(
    dat[setdiff(colnames(dat),"Weighting")],
    id.vars = c("Country","Row"),
    variable.name ="Reason",
    value.name="Result"
  )
  tmp$Result <- (tmp$Result %in% c("Yes","1"))*1
  tmp <- weight * (Ovariable("tmp",data=tmp))
  cat(title, ": number of individual answers.\n")
  oprint(oapply(tmp, c("tmpResult","Country"),length)@output)
  
  tmp <- (oapply(tmp, c("Country","Reason"), sum)/oapply(tmp, "Country",sum))
  popu <- oapply(tmp, "Reason",sum)@output
  popu <- popu$Reason[order(popu$Result)]
  tmp$Reason <- factor(tmp$Reason, levels=popu)
  levels(tmp$Reason) <- gsub("( BH| BS)", "", gsub("\\.", " ", levels(tmp$Reason)))
  cat(title, ": fractions shown on graph.\n")
  oprint(tmp@output)
  
  if(thl) {
    tmp <- thlLinePlot(tmp@output, xvar=Reason, yvar=Result,groupvar=Country, 
                       colors= c("#519B2FFF", "#2F62ADFF", "#BE3F72FF","#88D0E6FF"), # #29A0C1FF"),
                       # THL colors but fourth is brigter
                       legend.position = c(0.85,0.2), base.size = BS, title=title,
                       subtitle="Fraction of population")+
      coord_flip()+
      scale_y_continuous(labels=scales::percent_format(accuracy=1))
  } else {
    tmp <- ggplot(tmp@output, aes(x=Reason, y=Result,colour=Country, group=Country))+
      geom_point(shape=21, size=5, fill="Grey", stroke=2)+
      geom_line(size=1.2)+
      coord_flip()+
      theme_gray(base_size=BS)+
      scale_y_continuous(labels=scales::percent_format())+#accuracy=1))+
      scale_colour_manual(values=colors)+
      labs(
        title=title,
        x="Answer",
        y="Fraction of population")
  }
  return(tmp)
}

#' @title Function impacts produces a graph showing the magnitude of impact if some things change
#' @param dat data.frame containing answers
#' @param title string to be used as the title of the graph
#' @param population ovariable of country population weights
#' @return ggplot object

impacts <- function(dat,title, population) {
  tmp <- melt(
    dat,
    id.vars = c("Country","Weighting"),
    variable.name ="Reason",
    value.name="Response"
  )
  tmp$Response <- factor(
    tmp$Response,
    levels=c(
      "I don't know",
      "Increase it over to double",
      "Increase it over by half",
      "Increase it a bit",
      "No effect",
      "Decrease it a bit",
      "Decrease it to less than half",
      "Decrease it to zero"
    ),
    ordered=TRUE
  )
  levels(tmp$Response)[levels(tmp$Response) %in% c(
    "Decrease it to zero",
    "Decrease it to less than half",
    "Decrease it a bit"
  )] <- "Decrease"
  levels(tmp$Response)[levels(tmp$Response) %in% c(
    "Increase it a bit",
    "Increase it over by half",
    "Increase it over to double"
  )] <- "Increase"
  
  colnames(tmp)[colnames(tmp)=="Weighting"] <- "Result"
  tmp <- Ovariable("tmp",data=tmp) * population
  
  popu <- aggregate(as.numeric(tmp$Response), tmp@output["Reason"],sum)
  popu <- popu$Reason[order(popu$x)]
  tmp$Reason <- factor(tmp$Reason, levels=popu)
  levels(tmp$Reason) <- gsub("( BH| BS)", "", gsub("\\.", " ", levels(tmp$Reason)))
  
  tmp <- (oapply(tmp, c("Reason","Response"), sum)/
            oapply(tmp, c("Reason"),sum))
  
  oprint(tmp@output)
  
  print(ggplot(tmp@output,
               aes(x=Reason, weight=Result, fill=Response))+geom_bar()+
          coord_flip()+#facet_grid(.~Country)+
          theme_gray(base_size=BS)+
          theme(legend.position = "bottom")+
          scale_fill_manual(values=c("Grey",colors[c(1,6,5)]))+
          scale_y_continuous(breaks=c(0.5,1),labels=scales::percent_format())+
          labs(
            title=title,
            x="Cause",
            y="Fraction of population"
          ))
  
  return(tmp@output)
}

#' @title or95 prints the odds ratio and 95 % confidence interval of a glm fit
#' @param fit a glm fit
#' @return data.frame with OR and 95%CI

or95 <- function(fit) {
  cat("Odds ratios with 95 % confidence intervals\n")
  tmp <- exp(cbind(coef(fit), confint(fit)))
  tmp <- data.frame(
    rownames(tmp),
    paste0(sprintf("%.2f",tmp[,1]), " (",sprintf("%.2f", tmp[,2]),"-",sprintf("%.2f",tmp[,3]),")")
  )
  colnames(tmp) <- c("Parameter","OR (95% CI)")
  return(tmp)
}

#' @title Perform and print regression analyses
#' @param dat data.frame with data
#' @param y name of column to be used as dependent variable
#' @param countries logical atom to tell whether to run country-specific analyses
#' @return returns the model fit object

printregr <- function(dat, y, title, countries = TRUE) {
  
  dat$y <- dat[[y]]
  cat("\n############### Logistic regression analysis:",title, "(", y, ")\n")
  fit <- glm(
    y ~ Ages + Gender + Country + as.numeric(Education) + as.numeric(Purchasing.power),
    family=binomial(),na.action="na.omit",weights=Weighting,
    data=dat
  )
  step <- stepAIC(fit, direction = "both")
  oprint(step$anova)
  oprint(summary(fit))
  oprint(or95(fit))
  
  if(countries) {
    for(i in unique(dat$Country)) {
      cat("\n#### Country-specific logistic regression analysis:", i, y, "\n")
      fit <- glm(
        Eat.fish ~ Ages + Gender + as.numeric(Education) + as.numeric(Purchasing.power),
        family=binomial(),na.action="na.omit",weights=Weighting,
        data=dat[dat$Country==i,]
      )
      step <- stepAIC(fit, direction = "both")
      oprint(step$anova)
      oprint(summary(fit))
      oprint(or95(fit))
    }
  }
  return(fit)
}

#################### Data for Figure 1.

if(thl) { # Commented out because needs package thlVerse.
  
  dat <- opbase.data("Op_en7749", subset="Fish consumption as food in Finland") # [[Goherr: Fish consumption study]]
  dat$Year <- as.numeric(as.character(dat$Year))
  
  gr <- thlLinePlot(
    dat[dat$Species=="Total",],
    xvar=Year,
    yvar=Result,
    ylimits=c(0,12),
    groupvar=Origin,
    base.size=24,
    title="Fish consumption in Finland",
    ylabel="",
    subtitle="(kg/a per person)",
    legend.position = "bottom"
  )
  
  if(localcomp) ggsave("Figure 1.pdf", width=8,height=6)
  if(localcomp) ggsave("Figure 1.png", width=8,height=6)
  
} # End if
################## Get data

objects.latest("Op_en7749", "preprocess2") # [[Goherr: Fish consumption study]]: survey1, surv
objects.latest("Op_en7749", "nonsamplejsp") # [[Goherr: Fish consumption study]]: jsp every respondent from data exactly once.
objects.latest("Op_en7749", "initiate") # [[Goherr: Fish consumption study]]: amount etc.
openv.setN(50)
survey1 <- groups(survey1)
#levels(survey1$Country) <- c("FI","SE","DK","EE") # WHY was this here? It is a potential hazard.
effinfo <- 0 # No policies implemented.
effrecomm <- 0

population <- EvalOutput(Ovariable(
  "population", 
  ddata = "Op_en7748",
  subset = "Population"
))
levels(population$Country) <- c("DK","EE","FI","SE")
population <- oapply(population, c("Country"),sum)

amount <- EvalOutput(amount)
#result(amount)[result(amount)==0] <- 0.1 Not needed if not log-transformed.
amount <- amount * 365.25 / 1000 # g/day --> kg/year

#### General fish eating

su1 <- EvalOutput(Ovariable(
  "su1",
  data = data.frame(
    survey1[c("Country","Gender","Education","Purchasing.power")],
    Result=1
  )
))

# Figure 1. Origin of consumed fish in Finland between 1999 and 2016.
# Data not on this page, drawn separately.

# Table 1. Dimensions of embeddedness, modified from Hass (2007, p. 16)
# Written directly on the manuscript.

# Table 2: statistics of the survey population in each country (n, female %, education, purchasing power)

tmp <- round(t(data.frame(
  n = tapply(rep(1,nrow(survey1)), survey1$Country,sum),
  Females = tapply(survey1$Gender=="Female", survey1$Country,mean)*100,
  Old = tapply(survey1$Ages==">45", survey1$Country,mean)*100,
  PurcVlow = tapply(survey1$Purchasing.power=="Very low", survey1$Country,mean)*100,
  PurcLow = tapply(survey1$Purchasing.power=="Low", survey1$Country,mean)*100,
  PurcSuf = tapply(survey1$Purchasing.power=="Sufficient", survey1$Country,mean)*100,
  PurcGood = tapply(survey1$Purchasing.power=="Good", survey1$Country,mean)*100,
  PurcVgood = tapply(survey1$Purchasing.power=="Very good", survey1$Country,mean)*100,
  PurcExc = tapply(survey1$Purchasing.power=="Excellent", survey1$Country,mean)*100,
  EducPri = tapply(survey1$Education=="Primary education", survey1$Country,mean)*100,
  EducSec = tapply(survey1$Education=="Secondary education (gymnasium, vocational school or similar)", survey1$Country,mean)*100,
  EducCol = tapply(survey1$Education=="Lower level college education or similar", survey1$Country,mean)*100,
  EducHig = tapply(survey1$Education=="Higher level college education or similar", survey1$Country,mean)*100
)))
rownames(tmp) <- c(
  "Number of respondents",
  "Females (%)",
  ">45 years (%)",
  "Purchasing power: Very low (%)",
  "Low (%)",
  "Sufficient (%)",
  "Good (%)",
  "Very good (%)",
  "Excellent (%)",
  "Education: Primary education (%)",
  "Secondary education (%)",
  "Lower level college (%)",
  "Higher level college (%)"
)
oprint(tmp, digits=0)

cat("Weighted fraction of fish eaters\n")
tmp <- aggregate(survey1$Weighting, survey1[c("Eat.fish","Country")],FUN=sum)
oprint(aggregate(tmp$x, tmp["Country"],FUN=function(x) x[2]/sum(x)))

ggplot(survey1, 
       aes(x=Eat.fish, weight=Weighting, group=Country), stat="count")+
  geom_bar(aes(y=..prop.., fill=Country), position="dodge")+
  theme_gray(base_size=BS)+
  scale_fill_manual(values=colors)+
  scale_y_continuous(labels=scales::percent_format())+
  labs(title="Fraction of fish eaters within population")

############## Answers to open questions
if(FALSE){
  cat("Number of open-ended answers per question and country\n")
  
  tmp <- survey1[c(1,3,27:28,36:37,44:45,60:61,71:72,83:84,93:94,109:110,121:122,133:134,154,157:158)]
  tmp <- reshape(
    tmp,
    varying=list(
      Condition = colnames(tmp)[grepl("other",tolower(colnames(tmp)))],
      Text = colnames(tmp)[grepl("open",colnames(tmp))]
    ),
    times = colnames(tmp)[grepl("other",tolower(colnames(tmp)))],
    direction="long"
  )
  tmp <- tmp[tmp$Why.not.fish.open!="" , ]
  oprint(table(tmp$time,tmp$Country))
  oprint(tmp)
} # End if(FALSE)

#### Baltic salmon

reasons(
  survey1[survey1$Eat.fish=="No"& !is.na(survey1$Eat.fish),c(1,17:26,154,157)], # 1 Country, 8 Why don't you eat fish
  "Reasons not to eat among fish non-consumers"
)

# Figure 5

cat("Figure 5 (to be converted to text)\n")

reasons(
  survey1[survey1$Eat.salmon=="Yes"& !is.na(survey1$Eat.salmon),c(1,31:36,154,157)], # 1 Country, 11 Which salmon species
  "Species used among salmon consumers"
)

reasons(
  survey1[survey1$Baltic.salmon=="Yes"& !is.na(survey1$Baltic.salmon),c(1,50:59,154,157)], # 1 Country, 17 Why Baltic salmon
  "Reasons to eat among Baltic salmon consumers"
)

if(localcomp) ggsave("Figure 5.pdf", width=10,height=5)
if(localcomp) ggsave("Figure 5.png", width=10,height=5)

reasons(
  survey1[survey1$Baltic.salmon=="No"& !is.na(survey1$Baltic.salmon),c(1,62:70,154,157)], # 1 Country, 18 Why not Baltic salmon
  "Reasons not to eat, Baltic salmon non-consumers"
)

if(localcomp) ggsave("Figure 6.pdf", width=10,height=5)
if(localcomp) ggsave("Figure 6.png", width=10,height=5)

impacts.sal <- impacts(
  survey1[survey1$Baltic.salmon=="Yes"& !is.na(survey1$Baltic.salmon),c(1,73:82,154)],
  "Effect on Baltic salmon consumption",
  population
)

####### Any herring

ggplot(survey1[survey1$Eat.fish=="Yes",],
       aes(x=Country, weight=Weighting, fill=Eat.herring))+geom_bar(position="fill")+
  coord_flip()+#facet_grid(.~Country)+
  theme_gray(base_size=BS)+
  theme(legend.position = "bottom")+
  scale_y_continuous(labels=scales::percent_format())+
  scale_fill_manual(values=colors)+
  labs(
    title="Any herring consumption",
    y="Fraction of fish consumers"
  )

####### Baltic herring

### Figure 2. Comparison of Baltic herring consumption habits. 

survey1$What <- ifelse(is.na(survey1$Eat.BH), "Other fish", as.character(survey1$Eat.BH))
survey1$What <- factor(
  survey1$What,
  levels=(c("Yes","I don't know","No","Other fish")),
  labels=(c("Baltic herring","Some herring","Other herring","Other fish"))
)

tmp <- survey1[survey1$Eat.fish=="Yes",]
colnames(tmp)[colnames(tmp)=="Weighting"] <- "Result"
tmp <- EvalOutput(Ovariable("tmp", data=tmp))
tmp <- oapply(tmp, c("What","Country"), sum) / oapply(tmp,"Country",sum)
tmp$Country <- factor(tmp$Country, levels=rev(levels(tmp$Country)))

if(thl) {
  thlBarPlot(tmp@output, xvar=Country, yvar=Result, groupvar=What, horizontal = TRUE, stacked = TRUE,
             legend.position = "bottom",,
             title="Baltic herring consumption",
             subtitle="Fraction of fish consumers",
  )+
    scale_y_continuous(labels=scales::percent_format())
  
} else {
  ggplot(tmp@output, aes(x=Country, weight=Result, fill=What))+geom_bar(position="fill")+
    coord_flip()+#facet_grid(.~Country)+
    theme_gray(base_size=BS)+
    theme(legend.position = "bottom")+
    scale_y_continuous(labels=scales::percent_format())+
    scale_fill_manual(values=c("gray",colors))+
    guides(fill=guide_legend(reverse=TRUE))+
    labs(
      title="Baltic herring consumption",
      y="Fraction of fish consumers"
    )
  
}

if(localcomp) ggsave("Figure 2.pdf",width=10, height=5)
if(localcomp) ggsave("Figure 2.png",width=6, height=3)

tmp <- survey1[survey1$Eat.fish=="Yes",]
tmp <- aggregate(tmp$Weighting, tmp[c("What","Country")], sum)

rown <- tmp$What[tmp$Country=="FI"]
tmp <- aggregate(tmp$x, tmp["Country"],FUN=function(x) x/sum(x))
colnames(tmp[[2]]) <- rown
oprint(as.matrix(tmp))


ggplot(survey1[survey1$Eat.herring=="Yes",],
       aes(x=Country, weight=Weighting, fill=Eat.BH))+geom_bar(position="fill")+
  coord_flip()+#facet_grid(.~Country)+
  theme_gray(base_size=BS)+
  theme(legend.position = "bottom")+
  scale_y_continuous(labels=scales::percent_format())+
  scale_fill_manual(values=c(colors))+
  labs(
    title="Baltic herring consumption",
    y="Fraction of herring consumers"
  )

####### Any salmon

ggplot(survey1[survey1$Eat.fish=="Yes",],
       aes(x=Country, weight=Weighting, fill=Eat.salmon))+geom_bar(position="fill")+
  coord_flip()+
  theme_gray(base_size=BS)+
  theme(legend.position = "bottom")+
  scale_y_continuous(labels=scales::percent_format())+
  scale_fill_manual(values=colors)+
  labs(
    title="Any salmon consumption",
    y="Fraction of fish consumers"
  )

####### Baltic salmon

ggplot(survey1[survey1$Eat.fish=="Yes",],
       aes(x=Country, weight=Weighting, fill=Baltic.salmon))+geom_bar(position="fill")+
  coord_flip()+
  theme_gray(base_size=BS)+
  theme(legend.position = "bottom")+
  scale_y_continuous(labels=scales::percent_format())+
  scale_fill_manual(values=colors)+
  labs(
    title="Baltic salmon consumption",
    y="Fraction of fish consumers"
  )

ggplot(survey1[survey1$Eat.salmon=="Yes",],
       aes(x=Country, weight=Weighting, fill=Baltic.salmon))+geom_bar(position="fill")+
  coord_flip()+
  theme_gray(base_size=BS)+
  theme(legend.position = "bottom")+
  scale_y_continuous(labels=scales::percent_format())+
  scale_fill_manual(values=colors)+
  labs(
    title="Baltic salmon consumption",
    y="Fraction of salmon consumers"
  )

############## Recommendation awareness

levels(survey1$Recommendation.awareness) <- c("Not aware","Aware","Know content")
ggplot(survey1,
       aes(x=Country, weight=Weighting, fill=Recommendation.awareness))+geom_bar(position="fill")+
  coord_flip()+#facet_grid(.~Country)+
  theme_gray(base_size=BS)+
  theme(legend.position = "bottom")+
  scale_y_continuous(labels=scales::percent_format())+
  scale_fill_manual(values=colors)+
  labs(title="Awareness of food recommendations about Baltic fish")

# Figure 3. Percentages of reasons to eat Baltic herring

reasons(
  survey1[survey1$Eat.BH=="Yes"& !is.na(survey1$Eat.BH),c(1,99:108,154,157)], # 1 Country, 27 why Baltic herring
  "Reasons to eat among Baltic herring consumers"
)

if(localcomp) ggsave("Figure 3.pdf", width=10, height=5)
if(localcomp) ggsave("Figure 3.png", width=10, height=5)

# Figure 4. Reasons for not to eat Baltic herring.
# Denmark and Estonia were omitted because they had less than 20 observations.

reasons(
  survey1[
    survey1$Eat.BH=="No"&
      survey1$Country %in% c("FI","SE") &
      !is.na(survey1$Eat.BH),c(1,111:120,154,157)], # 1 Country, 28 Why not Baltic herring
  "Reasons not to eat, Baltic herring non-consumers"
)+scale_color_manual(values=c("#BE3F72FF","#88D0E6FF"))+
  scale_shape_manual(values=23:24)

if(localcomp) ggsave("Figure 4.pdf", width=10, height=5)
if(localcomp) ggsave("Figure 4.png", width=10, height=5)

# Figure 7. The role of different determinants on Baltic salmon consumption

impacts.herr <- impacts(
  survey1[survey1$Eat.BH=="Yes"& !is.na(survey1$Eat.BH),c(1,123:132,154)], # 1 Country, 29 Influence of Baltic herring policies
  title="Effect on Baltic herring consumption",
  population
)

tmp <- rbind(
  data.frame(
    Fish="Herring",
    impacts.herr
  ),
  data.frame(
    Fish="Salmon",
    impacts.sal
  )
)

oprint(aggregate(tmp$Result,tmp[c("Reason","Response","Fish")],sum))

if(thl) {
  tmp$Response <- factor(tmp$Response, levels=rev(levels(tmp$Response)))
  thlBarPlot(aggregate(tmp$Result,tmp[c("Reason","Response","Fish")],sum),
             xvar=Reason, yvar=x, groupvar=Response, legend.position = "bottom",
             horizontal = TRUE, stacked = TRUE,
             base.size = BS, title="Effect on Baltic fish consumption",
             subtitle="Fraction of population")+
    facet_grid(.~Fish)+
    scale_fill_manual(values=rev(c("gray",colors[c(1,6,5)])))+
    scale_y_continuous(breaks=c(0.5,1),labels=scales::percent_format(accuracy=1))
  
} else {
  ggplot(tmp, aes(x=Reason, weight=Result, fill=Response))+geom_bar()+
    coord_flip()+facet_grid(.~Fish)+
    theme_gray(base_size=BS)+
    theme(legend.position = "bottom")+
    scale_fill_manual(values=c("gray",colors[c(1,6,5)]))+
    scale_y_continuous(breaks=c(0.5,1),labels=scales::percent_format())+#accuracy=1))+
    guides(fill=guide_legend(reverse=TRUE))+
    labs(
      title="Effect on Baltic fish consumption",
      x="Cause",
      y="Fraction of population"
    )
}

if(localcomp) ggsave("Figure 7.pdf", width=12, height=7)
if(localcomp) ggsave("Figure 7.png", width=12, height=7)

######### Amounts estimated for each respondent

amount <- groups(amount)

#tmp <- amount*info
#tmp <- tmp[
#  tmp$Info.improvements=="BAU" &
#    tmp$Recomm.herring=="BAU" &
#    tmp$Recomm.salmon=="BAU" ,
#  ]
tmp <- cbind(
  oapply(amount, c("Country","Group","Gender","Ages","Fish"), FUN=mean)@output,
  SD=oapply(amount, c("Country","Group","Gender","Ages","Fish"), FUN=sd)$amountResult
) 

# Figure 3 (old, unused). Average consumption (kg/year) of Baltic herring in four countries,
# calculated with Monte Carlo simulation (1000 iterations).

ggplot(tmp[tmp$Fish=="Herring",], aes(x=Group, weight=amountResult, fill=Group))+
  geom_bar()+facet_wrap(~Country)+
  theme_gray(base_size=BS)+
  scale_fill_manual(values=colors)+
  guides(fill=FALSE)+
  theme(axis.text.x = element_text(angle = -90))+
  labs(
    title="Baltic herring consumption in subgroups",
    y="Average consumption (kg/year)")

# if(localcomp) ggsave("Figure3.pdf", width=9, height=10)
# if(localcomp) ggsave("Figure3.png", width=9, height=10)

tmp$Result <- paste0(sprintf("%.1f",tmp$amountResult), " (",sprintf("%.1f", tmp$SD,1),")")
tmp <- reshape(tmp, v.names="Result", timevar="Country", idvar=c("Fish","Group"),drop=c("Gender","Ages","amountResult","SD"), direction="wide")
colnames(tmp) <- gsub("Result\\.", "", colnames(tmp))
cat("Average fish consumption in subgroups, mean (sd)\n")
oprint(tmp)

weight <- EvalOutput(Ovariable("weight",data=data.frame(
  amount@output[c("Row","Iter","Fish","Country")],
  Result=amount$Weighting
)))

tmp <- (oapply(amount * weight, c("Fish","Country"), sum) / oapply(weight,c("Fish","Country"), sum))@output

cat("Average fish consumption per country (kg/year)\n")
oprint(tmp)

result(amount)[result(amount)==0] <- 0.1
ggplot(amount@output, aes(x=amountResult, colour=Group))+stat_ecdf()+scale_x_log10()+
  facet_wrap(~Fish)

############################

#### Statistical analyses

dat <- merge(survey1,amount@output) # [amount$Fish=="Herring",]) # This results in 50 * 2 times of rows

# These should be corrected to preprocess2. NOT ordered.
dat$Country <- factor(dat$Country, ordered=FALSE)
dat$Ages <- factor(dat$Ages, ordered=FALSE)
dat$Gender <- factor(dat$Gender, ordered=FALSE)

ggplot(dat, aes(x=amountResult, weight=Weighting, color=Country))+stat_ecdf()+facet_wrap(~Fish)+
  scale_x_log10()

gro <- unique(dat$Group)
out <- list(data.frame(), data.frame())
for(i in unique(dat$Fish)) {
  for(j in unique(dat$Country)){
    for(k in unique(dat$Iter)) {
      res <- kruskal.test(amountResult ~ Group, data=dat[
        dat$Iter==k  & dat$Country==j & dat$Fish==i,])
      out[[1]] <- rbind(out[[1]], data.frame(
        test="Kruskal-Wallis",
        Fish=i,
        Country=j,
        Iter=k,
        p.value=res[[3]])
      )
      for(l in 1:(length(gro)-1)) {
        for(m in (l+1):length(gro)) {
          res2 <- wilcox.test(
            x=dat$amountResult[dat$Iter==k  & dat$Country==j & dat$Fish==i & dat$Group==gro[l]], 
            y=dat$amountResult[dat$Iter==k  & dat$Country==j & dat$Fish==i & dat$Group==gro[m]], conf.int = FALSE)
          out[[2]] <- rbind(out[[2]], data.frame(
            Fish=i,
            Country=j,
            Iter=k,
            Pair1=gro[l],
            Pair2=gro[m],
            p.value = res2$p.value
          ))
        }
      }
    }
  }
}

ggplot(out[[1]], aes(x=paste(Fish, Country), y = p.value, colour = Country))+geom_jitter()+
  geom_hline(yintercept=0.05)+coord_flip()+theme_gray(base_size=BS)+
  labs(title="Kruskal-Wallis non-parametric test across all groups")
ggplot(out[[2]], aes(x=paste(Fish, Country), y=p.value, colour=Pair1, shape=Pair2))+geom_jitter()+
  geom_hline(yintercept=0.05)+coord_flip()+theme_gray(base_size=BS)+
  labs(title="Mann-Whitney U test across all pairs of groups")

cat("Kurskal-Wallis non-parametric test for 50 iterations of amount\n")
oprint(aggregate(out[[1]]["p.value"], out[[1]][c("Country","Fish")], mean))

cat("Mann-Whitney U non-parametric test for 50 iterations of amount\n")
oprint(aggregate(out[[2]]["p.value"], out[[2]][c("Pair1","Pair2","Country","Fish")], mean))

###################### Logistic regression

dat2 <- dat[dat$Iter==1 & dat$Fish=="Herring",]

tmp <- printregr(dat2, "Eat.fish", "What explains whether people eat fish at all?")

tmp <- printregr(dat2, "Eat.herring", "What explains whether people eat herring compared with other fish?")

dat2$Eat.BH2 <- ifelse(dat2$Eat.BH!="Yes" | is.na(dat2$Eat.BH),0,1)
tmp <- printregr(dat2, "Eat.BH2", "What explains whether people eat Baltic herring compared with everyone else?")

dat2$What2 <- ifelse(dat2$What=="Baltic herring",1,ifelse(dat2$What=="Other herring",0,NA))
tmp <- printregr(dat2, "What2", "What explains whether people eat Baltic herring compared with other herring?")


cat("###################### How fish-specific causes are explained by
    population determinants?\n")


for(j in c(
  "Tastes.good.BH",
  "Self.caught.BH",
  "Easy.to.cook.BH",
  "Quick.to.cook.BH",
  "Readily.available.BH",
  "Healthy.BH",
  "Inexpensive.BH",
  "Family.likes.it.BH",
  "Environmental.BH",
  "Traditional.BH"
)) {
  tmp <- printregr(dat2, j, "What explains the reason to eat Baltic herring?")
}

for(j in c(
  "Bad.taste.BH",
  "Not.used.to.BH",
  "Cannot.cook.BH",
  "Difficult.to.cook.BH",
  "Not.available.BH",
  "Health.risks.BH",
  "Better.for.feed.BH",
  "Quality.issues.BH",
  "Sustainability.BH",
  "Not.traditional.BH"
)) {
  tmp <- printregr(dat2, j, "What explains the reason to not eat Baltic herring?")
}

# tmp <- printregr(dat2, "Eat.salmon", "What explains whether people eat any salmon compared with other fish?")
# tmp <- printregr(dat2, "Baltic.salmon", "What explains whether people eat Baltic salmon compared with other salmon?")

Luke data about fish consumption in Finland [8][9]

Fish consumption as food in Finland(kg/a per person)
Obs	Origin	Species	1999	2000	2001	2002	2003	2004	2005	2006	2007	2008	2009	2010	2011	2012	2013	2014	2015	2016	2017
1	domestic fish	Total	6.1	6.1	5.9	6.2	5.8	5.3	5.2	5.0	5.0	4.4	4.5	4.3	3.8	3.8	3.8	4.0	4.1	4.1	4.1
2	domestic fish	Farmed rainbow trout	1.6	1.6	1.6	1.6	1.3	1.3	1.4	1.1	1.1	1.2	1.2	1.1	1.0	1.0	1.1	1.1	1.3	1.2	1.2
3	domestic fish	Baltic herring	0.8	1.2	1.1	1.1	0.9	0.8	0.7	0.5	0.4	0.4	0.4	0.3	0.3	0.3	0.3	0.3	0.3	0.3	0.3
4	domestic fish	Pike	0.8	0.7	0.7	0.7	0.6	0.7	0.7	0.8	0.7	0.6	0.6	0.6	0.5	0.4	0.4	0.5	0.5	0.4	0.4
5	domestic fish	Perch	0.7	0.7	0.7	0.7	0.6	0.6	0.6	0.7	0.7	0.5	0.5	0.5	0.4	0.4	0.4	0.5	0.5	0.4	0.4
6	domestic fish	Vendace	0.7	0.7	0.7	0.7	0.8	0.8	0.7	0.6	0.6	0.6	0.6	0.7	0.6	0.6	0.6	0.6	0.6	0.5	0.6
7	domestic fish	European whitefish	0.4	0.4	0.4	0.4	0.3	0.3	0.3	0.3	0.5	0.3	0.3	0.3	0.3	0.3	0.3	0.3	0.2	0.3	0.3
8	domestic fish	Pike perch	0.3	0.2	0.2	0.2	0.3	0.3	0.3	0.3	0.3	0.3	0.3	0.3	0.3	0.3	0.3	0.3	0.3	0.4	0.4
9	domestic fish	Other domestic fish	0.8	0.6	0.5	0.8	1.0	0.5	0.5	0.7	0.7	0.5	0.6	0.5	0.4	0.5	0.4	0.5	0.4	0.5	0.5
10	imported fish	Total	6.0	6.2	7.0	7.1	8.0	8.6	7.9	8.6	9.7	9.7	9.3	10.2	11.1	10.9	10.8	10.9	10.2	9.1	9.8
11	imported fish	Farmed rainbow trout	0.2	0.3	0.4	0.6	0.9	0.6	0.6	0.7	1.0	1.0	0.8	0.8	0.9	1.0	0.9	0.9	0.8	0.9	0.8
12	imported fish	Farmed salmon	1.0	0.9	1.2	1.3	1.6	2.2	1.9	2.0	2.7	2.6	2.9	3.1	3.9	4.2	4.0	4.4	4.1	3.5	4.0
13	imported fish	Tuna (prepared and preserved)	1.0	1.2	1.4	1.4	1.5	1.6	1.6	1.5	1.7	1.7	1.6	1.7	1.7	1.6	1.9	1.7	1.6	1.4	1.5
14	imported fish	Saithe (frozen fillet)	0.7	0.6	0.7	0.7	0.7	0.6	0.4	0.5	0.5	0.5	0.5	0.5	0.6	0.5	0.5	0.5	0.5	0.4	0.4
15	imported fish	Shrimps	0.5	0.4	0.5	0.5	0.5	0.5	0.5	0.5	0.6	0.6	0.6	0.7	0.7	0.6	0.6	0.5	0.5	0.4	0.4
16	imported fish	Herring and Baltic herring (preserved)	0.6	0.5	0.6	0.6	0.5	0.4	0.5	0.6	0.5	0.6	0.3	0.5	0.4	0.3	0.4	0.5	0.5	0.5	0.5
17	imported fish	Other imported fish	2.0	2.3	2.2	2.0	2.3	2.7	2.4	2.8	2.7	2.7	2.6	2.9	2.9	2.7	2.5	2.3	2.3	1.9	2.2

Descriptive statistics

Model must contain predictors such as country, gender, age etc. Maybe we should first study what determinants are important? Model must also contain determinants that would increase or decrease fish consumption. This should be conditional on the current consumption. How? Maybe we should look at principal coordinates analysis with all questions to see how they behave.

Also look at correlation table to see clusters.

Some obvious results:

If reports no fish eating, many subsequent answers are NA.
No vitamins correlates negatively with vitamin intake.
Unknown salmon correlates negatively with the types of salmon eaten.
Different age categories correlate with each other.

However, there are also meaningful negative correlations:

Country vs allergy
Country vs Norwegian salmon and Rainbow trout
Country vs not traditional.
Country vs recommendation awareness
Allergy vs economic wellbeing
Baltic salmon use (4 questions) vs Don't like taste and Not used to
All questions between Easy to cook ... Traditional dish

Meaningful positive correlations:

All questions between Baltic salmon ... Rainbow trout
How often Baltic salmon/herring/side salmon/side herring
How much Baltic salmon/herring/side salmon/side herring
Better availability ... Recommendation
All questions between Economic wellbeing...Personal aims
Omega3, Vitamin D, and Other vitamins

Model runs

Model run 13.3.2017
Model run 21.4.2017 [10] old code from Answer merged to this code and debugged

+ Show code - Hide code

# This is code Op_en7749/ on page [[Goherr: Fish consumption study]]

library(OpasnetUtils)
library(ggplot2)
library(reshape2)
library(car)
library(vegan)

objects.latest("Op_en7749", "preprocess2") # [[Goherr: Fish consumption study]]: survey, surv

############################### From a previous code on Answer

for(i in c(5:6, 16, 29:30, 46:49, 85:86, 95:98, 135)) {
  temp <- survey[!is.na(survey[[i]]),]
  p <- ggplot(temp, aes(x = temp[[i]])) + geom_bar() +
    theme_gray(base_size = 18) + theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4)) +
    labs(title = colnames(temp[i])) + xlab("") + facet_wrap(~ Country)
  print(p)
}

temp <- melt(survey, measure.vars = 145:153, variable.name = "Changing.factor", value.name = "Impact")
levs <- c("-5 strongly disagree", "-4", "-3", "-2", "-1", "0 Neutral", "1", "2", "3", "4", "5 strongly agree", "I don't know")
temp$Impact <- factor(temp$Impact, levels = levs, labels = levs, ordered = TRUE)

ggplot(temp, aes(x = Impact, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Changing.factor)+
  theme_gray(base_size = 24) + theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4))

surveytemp <- subset(survey, survey[[31]] == "Yes")

temp <- melt(surveytemp, measure.vars = 38:43, variable.name = "Variable", value.name = "Value")
ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
  theme_gray(base_size = 24) + labs(title = "Baltic salmon sources")

temp <- melt(surveytemp, measure.vars = 50:59, variable.name = "Variable", value.name = "Value")
ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
  theme_gray(base_size = 24) + labs(title = "Reasons to eat Baltic salmon")

temp <- melt(surveytemp, measure.vars = 73:82, variable.name = "Variable", value.name = "Value")
ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
  theme_gray(base_size = 24) + theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4)) + labs(title = "Baltic salmon actions")

surveytemp <- subset(survey, survey[[31]] == "No")

temp <- melt(surveytemp, measure.vars = 62:70, variable.name = "Variable", value.name = "Value")
ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
  theme_gray(base_size = 24) + labs(title = "Reasons not to eat Baltic salmon")


surveytemp <- subset(survey, survey[[86]] == "Yes")

temp <- melt(surveytemp, measure.vars = 87:92, variable.name = "Variable", value.name = "Value")
ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
  theme_gray(base_size = 24) + labs(title = "Baltic herring sources")

temp <- melt(surveytemp, measure.vars = 99:108, variable.name = "Variable", value.name = "Value")
ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
  theme_gray(base_size = 24) + labs(title = "Reasons to eat Baltic herring")

temp <- melt(surveytemp, measure.vars = 123:132, variable.name = "Variable", value.name = "Value")
ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
  theme_gray(base_size = 24) + theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4)) + labs(title = "Baltic herring actions")

surveytemp <- subset(survey, survey[[86]] == "No")

temp <- melt(surveytemp, measure.vars = 111:120, variable.name = "Variable", value.name = "Value")
ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
  theme_gray(base_size = 24) + labs(title = "Reasons not to eat Baltic herring")

#####################################3

temp <- sapply(survey, as.numeric) # Can be done for surv to get a smaller matrix

survey_correlations <- (cor(temp, method="spearman", use="pairwise.complete.obs"))

temp <- colnames(survey_correlations)

melted_correlations <- melt(survey_correlations)

melted_correlations$Var1 <- factor(melted_correlations$Var1, levels=temp)
melted_correlations$Var2 <- factor(melted_correlations$Var2, levels=temp)
melted_correlations$value <- ifelse(melted_correlations$value >= 0.99,NA,melted_correlations$value)

ggplot(melted_correlations, aes(x = Var1, y = Var2, fill = value, label= round(value, 2)))+
  geom_raster()+
  theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4))+
  scale_fill_gradient2(low = "#480610", mid = "#FFFFFF", high = "#06480F", midpoint = 0, space = "Lab", guide = "colourbar")

####################### Descriptive statistics

oprint(cor(surv, use = "pairwise.complete.obs"))
# --> Baltic salmon and herring eating are correlated, so they should be estimated together

oprint(table(surv[c(12,7,4)], useNA = "ifany"))
oprint(table(surv[c(13,12)], useNA = "ifany"))

############################# Plot original data

# Eating frequencies of fish and Baltic salmon and herring with random noise, all
pl <- surv[surv$Eatfish,c(5,8,10,13,15)]
scatterplotMatrix(
  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
)

ggplot(data.frame(
  X = rep(1,5), Y = 5:1,
  legend = c("All", "Finland", "Sweden", "Denmark", "Estonia")
), aes(x=X, y=Y, label=legend))+
  geom_text()+
  labs(title="Fish eating questions with random noise")


## Fish eating questions with some random noise, all
pl <- surv[surv$Eatfish,c(5,6,7,12)]
scatterplotMatrix(
  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
)

## Fish eating questions with some random noise, FI
pl <- surv[surv$Country == 1 & surv$Eatfish,c(5,6,7,12)]
scatterplotMatrix(
  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
)

## Fish eating questions with some random noise, SWE
pl <- surv[surv$Country == 2 & surv$Eatfish,c(5,6,7,12)]
scatterplotMatrix(
  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
)

## Fish eating questions with some random noise, Dk
pl <- surv[surv$Country == 3 & surv$Eatfish,c(5,6,7,12)]
scatterplotMatrix(
  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
)

## Fish eating questions with some random noise, EST
pl <- surv[surv$Country == 4 & surv$Eatfish,c(5,6,7,12)]
scatterplotMatrix(
  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
)

## Baltic herring questions with some random noise
pl <- surv[surv$Eatherr,13:16]
scatterplotMatrix(
  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
)

## Baltic salmon questions with some random noise
pl <- surv[surv$Eatsalm,8:11]
scatterplotMatrix(
  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
)

##################### CORRELATION MATRIX

temp <- sapply(survey, as.numeric) # Can be done for surv to get a smaller matrix

survey_correlations <- (cor(temp, method="spearman", use="pairwise.complete.obs"))

temp <- colnames(survey_correlations)

melted_correlations <- melt(survey_correlations)

melted_correlations$Var1 <- factor(melted_correlations$Var1, levels=temp)
melted_correlations$Var2 <- factor(melted_correlations$Var2, levels=temp)
melted_correlations$value <- ifelse(melted_correlations$value >= 0.99,NA,melted_correlations$value)

ggplot(melted_correlations, aes(x = Var1, y = Var2, fill = value, label= round(value, 2)))+
  geom_raster()+
  theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4))+
  scale_fill_gradient2(low = "#480610", mid = "#FFFFFF", high = "#06480F", midpoint = 0, space = "Lab", guide = "colourbar")

############################### PRINCIPAL COORDINATE ANALYSIS (PCoA)

#tämä osa valmistaa sen datan.
hypocols1 <- c(46:49,95:98)
answ <- sapply(survey[hypocols1], FUN=as.numeric)
answ <- as.matrix(answ[!is.na(rowSums(answ)),])

pcoa_caps <- capscale(t(answ) ~ 1, distance="euclidean") ##PCoA done

## Kuva koko hypoteeseista

colstr <- c("palevioletred1","royalblue1","seagreen1","violet","khaki2","skyblue", "orange")

#hypo_sizes <- (5 - colMeans(answ))
#leg_sizes <- c(4, 3, 2, 1, 0.01)

#pdf(file="pcoa_plot.pdf", height=6, width=7.5)
plot(pcoa_caps, display = c("sp", "wa"), type="n")#, xlim=c(-6,4.5)) ## PCoA biplot, full scale
points(pcoa_caps, display= c("sp"), col="gray40") # adding the people points
points(pcoa_caps, display= c("wa"), pch=19)#, cex=hypo_sizes, col=trait.cols)
text(pcoa_caps, display=c("wa"), srt=25, cex=0.5)
#legend(x=-6, y=3.8, levels(traits), fill=colstr, bty="n", cex=1)
#legend(-6, -2, legend=c("Very likely", "Moderately likely",
#                        "No opinion", "Moderately unlikely", "Very unlikely"),
#       pch=21, pt.cex = leg_sizes, bty="n", cex=1)
#dev.off()

Bayes model

Model run 3.3.2017. All variables assumed independent. [11]
Model run 3.3.2017. p has more dimensions. [12]
Model run 25.3.2017. Several model versions: strange binomial+multivarnormal, binomial, fractalised multivarnormal [13]
Model run 27.3.2017 [14]
Other models except multivariate normal were archived and removed from active code 29.3.2017.
Model run 29.3.2017 with raw data graphs [15]
Model run 29.3.2017 with salmon and herring ovariables stored [16]
Model run 13.4.2017 with first version of coordinate matrix and principal coordinate analysis [17]
Model run 20.4.2017 [18] code works but needs a safety check against outliers
Model run 21.4.2017 [19] some model results plotted
Model run 21.4.2017 [20] ovariables produced by the model stored.
Model run 18.5.2017 [21] small updates
13.2.2018 old model run but with new Opasnet [22]

+ Show code - Hide code

# This is code Op_en7749/bayes on page [[Goherr: Fish consumption study]]

library(OpasnetUtils)
library(ggplot2)
library(reshape2)
library(rjags)
library(car)
library(vegan)
library(MASS)
#library(gridExtra) # Error: package ‘gridExtra’ was built before R 3.0.0: please re-install it

# Fish intake in humans
# Data from data.frame survey from page [[Goherr: Fish consumption study]]

objects.latest("Op_en7749", "preprocess2") # [[Goherr: Fish consumption study]]: survey, surv, ...

cat("Version with multivariate normal.\n")

# Development needs:
## Correlation between salmon.often and herring.often needs to be estimated.
## Gender, country and age-spesific values should be estimated.

mod <- textConnection("
  model{
    for(i in 1:S) {
      survs[i,1:4] ~ dmnorm(mus[], Omegas[,])
    }
    for(j in 1:H) {
      survh[j,1:4] ~ dmnorm(muh[], Omegah[,])
    }
    mus[1:4] ~ dmnorm(mu0[1:4], S2[1:4,1:4])
    Omegas[1:4,1:4] ~ dwish(S3[1:4,1:4],S)
    anss.pred ~ dmnorm(mus[], Omegas[,])
    muh[1:4] ~ dmnorm(mu0[1:4], S2[1:4,1:4])
    Omegah[1:4,1:4] ~ dwish(S3[1:4,1:4],H)
    ansh.pred ~ dmnorm(muh[], Omegah[,])
  }
")

jags <- jags.model(
  mod,
  data = list(
    survs = surv[surv$Eatsalm,c(8:11)],
    S = sum(surv$Eatsalm),
    survh = surv[surv$Eatherr,c(13:16)],
    H = sum(surv$Eatherr),
    mu0 = rep(2,4),
    S2 = diag(4)/100000,
    S3 = diag(4)/10000
  ),
  n.chains = 4,
  n.adapt = 100
)

update(jags, 100)
samps.j <- jags.samples(
  jags, 
  c('mus', 'Omegas', 'anss.pred','muh','Omegah','ansh.pred'), 
  1000
)
js <- array(
  c(
    samps.j$mus[,,1],
    samps.j$Omegas[,1,,1],
    samps.j$Omegas[,2,,1],
    samps.j$Omegas[,3,,1],
    samps.j$Omegas[,4,,1],
    samps.j$anss.pred[,,1],
    samps.j$muh[,,1],
    samps.j$Omegah[,1,,1],
    samps.j$Omegah[,2,,1],
    samps.j$Omegah[,3,,1],
    samps.j$Omegah[,4,,1],
    samps.j$ansh.pred[,,1]
  ),
  dim = c(4,1000,6,2),
  dimnames = list(
    Question = 1:4,
    Iter = 1:1000,
    Parameter = c("mu","Omega1", "Omega2", "Omega3", "Omega4", "ans.pred"),
    Fish = c("Salmon", "Herring")
  )
)

# Mu for all questions about salmon
scatterplotMatrix(t(js[,,1,1]))
# All parameters for question 1 about salmon
scatterplotMatrix(js[1,,,1])
# Same for herring
scatterplotMatrix(js[1,,,2])

jsd <- melt(js)
#ggplot(jsd, aes(x=value, colour=Question))+geom_density()+facet_grid(Parameter ~ Fish)
#ggplot(as.data.frame(js), aes(x = anss.pred, y = Sampled))+geom_point()+stat_ellipse()

coda.j <- coda.samples(
  jags, 
  c('mus', 'Omegas', 'anss.pred'), 
  1000
)

plot(coda.j)

######## fish.param contains expected values of the distribution parameters from the model

fish.param <- list(
  mu = apply(js[,,1,], MARGIN = c(1,3), FUN = mean),
  Omega = lapply(
    1:2,
    FUN = function(x) {
      solve(apply(js[,,2:5,], MARGIN = c(1,3,4), FUN = mean)[,,x])
    } # solve matrix: precision->covariace
  )
)

objects.store(fish.param)
cat("List fish.param stored.\n")

Initiate ovariables

jsp taken directly from data WITHOUT salmpling

+ Show code - Hide code

# This is code Op_en7749/nonsamplejsp on page [[Goherr: Fish consumption study]]
# The code produces amount esimates (jsp ovariable) directly from data rather than bayesian model or sampling.

library(OpasnetUtils)

jsp <- Ovariable(
  "jsp", 
  dependencies = data.frame(Name="survey1", Ident="Op_en7749/preprocess2"), # [[Goherr: Fish consumption study]]
  formula = function(...) {
    require(reshape2)
    
    sur <- survey1[c(157,1,3,158,162:178,81,82,131,132,154)] # Removed, not needed:16,29,30?
    
    #colnames(sur)
    #[1] "Row"                      "Country"                  "Gender"                  
    #[4] "Ages"                     "Baltic.salmon.n"          "How.often.BS.n"          
    #[7] "How.much.BS.n"            "How.often.side.BS.n"      "How.much.side.BS.n"      
    #[10] "Better.availability.BS.n" "Less.chemicals.BS.n"      "Eat.BH.n"                
    #[13] "How.often.BH.n"           "How.much.BH.n"            "How.often.side.BH.n"     
    #[16] "How.much.side.BH.n"       "Better.availability.BH.n" "Less.chemicals.BH.n"     
    #[19] "Eatfish"                  "Eatsalm"                  "Eatherr"                 
    #[22] "Recommended.BS"           "Not.recommended.BS"       "Recommended.BH"          
    #[25] "Not.recommended.BH"       "Weighting"               
    
    # Make sure that Row is kept separate from Iter because in the sampling version they are different.
    # sur contained columns Eat.fish, How.often.fish, Eat.salmon. Are these needed, as all other questions are melted? No
    # Columns 5-7 removed, so colnames list above does not match.
    
    colnames(sur)[5] <- "Eat.BS"
    colnames(sur) <- gsub("\\.n","",colnames(sur))
    sur[22:25] <- sapply(sur[22:25], as.numeric)
    sur <- melt(
      sur,
      id.vars=c(1:4,26),
      variable.name="Question",
      value.name="Result"
    )
    sur$Fish <- ifelse(grepl("BH",sur$Question),"Herring","Salmon")
    sur$Question <- gsub("\\.BS","",sur$Question)
    sur$Question <- gsub("\\.BH","",sur$Question)
    
    ########## If the missing values are not adjusted, they drop out in the next stage.    
    if(TRUE) {
      # The adjustments below probably should go to the preprocess2 code.
      sur$Result[sur$Question %in% c(
        "Better.availability",
        "Less.chemicals",
        "Recommended",
        "Not.recommended"
      ) & is.na(sur$Result)] <- 4 # If missing --> no change  
      sur$Result[is.na(sur$Result)] <- 1 # Replace missing values with 1. That will produce 0 g/d.
    }
    
    return(Ovariable(
      output = sur, 
      marginal = colnames(sur) %in% c("Fish", "Iter", "Question")
    ))
  }
)

objects.store(jsp)
cat("Ovariable jsp with actual survey data: every respondent is kept in data without sampling.\n")

Amount estimated from a bayesian model

Model run 24.5.2017 [23]

+ Show code - Hide code

# This is code Op_en7749/modeljsp on page [[Goherr: Fish consumption study]]

library(OpasnetUtils)

jsp <- Ovariable(
  "jsp",
  dependencies = data.frame(Name = "fish.param", Ident = "Op_en7749/bayes"),
  formula = function(...) {
    require(MASS)
    require(reshape2)
    jsp <- lapply(
      1:2,
      FUN = function(x) {
        mvrnorm(openv$N, fish.param$mu[,x], fish.param$Omega[[x]])
      }
    )
    
    jsp <- rbind(
      cbind(
        Fish = "Salmon",
        Iter = 1:nrow(jsp[[1]]),
        as.data.frame(jsp[[1]])
      ),
      cbind(
        Fish = "Herring",
        Iter = 1:nrow(jsp[[2]]),
        as.data.frame(jsp[[2]])
      )
    )
    jsp <- melt(jsp, id.vars = c("Iter", "Fish"), variable.name = "Question", value.name = "Result")
    jsp <- Ovariable(output=jsp, marginal = colnames(jsp) %in% c("Iter", "Fish", "Question"))
    return(jsp)
  }
)

objects.store(jsp)
cat("Ovariable jsp stored.\n")

Amount estimates directly from data rather than from a bayesian model

Initiation run 18.5.2017 [24]
Initiation run 24.2.2018: sampling from survey rather than each respondent once [25]

+ Show code - Hide code

# This is code Op_en7749/surveyjsp on page [[Goherr: Fish consumption study]]
# The code produces amount esimates (jsp ovariable) directly from data rather than bayesian model.

library(OpasnetUtils)

jsp <- Ovariable(
  "jsp", 
  dependencies = data.frame(Name="survey1", Ident="Op_en7749/preprocess2"), # [[Goherr: Food cunsumption study]]
  formula = function(...) {
    require(reshape2)
    
    sur <- survey1[
      sample(1:nrow(survey1), openv$N, replace=TRUE, prob=survey1$Weighting) , 
      c(157,1,3,158,162:178,81,82,131,132) # Removed, not needed:16,29,30?
      ]
    sur$Iter <- 1:openv$N
    
    #colnames(sur)
    #[1] "Row"                      "Country"                  "Gender"                  
    #[4] "Ages"                     "Baltic.salmon.n"          "How.often.BS.n"          
    #[7] "How.much.BS.n"            "How.often.side.BS.n"      "How.much.side.BS.n"      
    #[10] "Better.availability.BS.n" "Less.chemicals.BS.n"      "Eat.BH.n"                
    #[13] "How.often.BH.n"           "How.much.BH.n"            "How.often.side.BH.n"     
    #[16] "How.much.side.BH.n"       "Better.availability.BH.n" "Less.chemicals.BH.n"     
    #[19] "Eatfish"                  "Eatsalm"                  "Eatherr"                 
    #[22] "Recommended.BS"           "Not.recommended.BS"       "Recommended.BH"          
    #[25] "Not.recommended.BH"       "Iter"                    
    
    # Make sure that Row is kept separate from Iter because in the sampling version they are different.
    # sur contained columns Eat.fish, How.often.fish, Eat.salmon. Are these needed, as all other questions are melted? No
    # Columns 5-7 removed, so colnames list above does not match.
    
    colnames(sur)[5] <- "Eat.BS"
    colnames(sur) <- gsub("\\.n","",colnames(sur))
    sur[22:25] <- sapply(sur[22:25], as.numeric)
    sur <- melt(
      sur,
      id.vars=c(1:4,26),
      variable.name="Question",
      value.name="Result"
    )
    sur$Fish <- ifelse(grepl("BH",sur$Question),"Herring","Salmon")
    sur$Question <- gsub("\\.BS","",sur$Question)
    sur$Question <- gsub("\\.BH","",sur$Question)
    
    # The adjustments below probably should go to the preprocess2 code.
    sur$Result[sur$Question %in% c(
      "Better.availability",
      "Less.chemicals",
      "Recommended",
      "Not.recommended"
    ) & is.na(sur$Result)] <- 4 # If missing --> no change  
    sur$Result[is.na(sur$Result)] <- 1 # Replace missing values with 1. That will produce 0 g/d.
    
    return(Ovariable(
      output = sur, 
      marginal = colnames(sur) %in% c("Fish", "Iter", "Question")
    ))
  }
)

objects.store(jsp)
cat("Ovariable jsp with actual survey data: a respondent is sampled for each iteration.\n")

Initiate other ovariables

Code stores ovariables assump, often, much, oftenside, muchside, amount.
Model run 19.5.2017 [26]
Initiation run 24.5.2017 without jsp [27]
Model run 8.6.2017 [28]

+ Show code - Hide code

# This is code Op_en7749/initiate on page [[Goherr: Fish consumption study]]

library(OpasnetUtils)

# Combine modelled survey answers with estimated amounts and frequencies by:
# Rounding the modelled result and merging that with value in ovariable assump

often <- Ovariable(
  "often",
  dependencies = data.frame(
    Name=c("jsp","assump"),
    Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
  ),
  formula = function(...) {
    out <- jsp[jsp$Question == "How.often" , ]
    out$Value <- round(result(out))
    out <- merge(
      assump@output[assump$Variable == "freq",],
      out@output
    )
    out <- out[!colnames(out) %in% c("Value", "Variable", "Result","Question")]
    colnames(out)[colnames(out) == "assumpResult"] <- "Result"
    return(out)
  }
)

much <- Ovariable(
  "much",
  dependencies = data.frame(
    Name=c("jsp","assump"),
    Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
  ),
  formula = function(...) {
    out <- jsp[jsp$Question == "How.much" , ]
    out$Value <- round(result(out))
    out <- merge(
      assump@output[assump$Variable == "amdish",],
      out@output
    )
    out <- out[!colnames(out) %in% c("Value", "Variable", "Result","Question")]
    colnames(out)[colnames(out) == "assumpResult"] <- "Result"
    return(out)
  }
)

oftenside <- Ovariable(
  "oftenside",
  dependencies = data.frame(
    Name=c("jsp","assump"),
    Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
  ),
  formula = function(...) {
    out <- jsp[jsp$Question == "How.often.side" , ]
    out$Value <- round(result(out))
    out <- merge(
      assump@output[assump$Variable == "freq",],
      out@output
    )
    out <- out[!colnames(out) %in% c("Value", "Variable", "Result","Question")]
    colnames(out)[colnames(out) == "assumpResult"] <- "Result"
    return(out)
  }
)

muchside <- Ovariable(
  "muchside",
  dependencies = data.frame(
    Name=c("jsp","assump"),
    Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
  ),
  formula = function(...) {
    out <- jsp[jsp$Question == "How.much.side" , ]
    out$Value <- round(result(out))
    out <- merge(
      assump@output[assump$Variable == "amside",],
      out@output
    )
    out <- out[!colnames(out) %in% c("Value", "Variable", "Result","Question")]
    colnames(out)[colnames(out) == "assumpResult"] <- "Result"
    return(out)
  }
)

assump <- Ovariable(
  "assump",
  ddata = "Op_en7749", subset = "Assumptions for calculations"
)

effinfo <- Ovariable( # Effect of information
  "effinfo",
  dependencies = data.frame(
    Name=c("jsp","assump"),
    Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
  ),
  formula = function(...) {
    away <- c("Value","Variable","Result","Question","assumpUnit","assumpSource","jspResult","jspSource")
    # Better availability of fish
    o1 <- jsp[jsp$Question == "Better.availability" , ]
    o1$Value <- round(result(o1))
    o1 <- merge(
      assump@output[assump$Variable == "change",],
      o1@output
    )
    colnames(o1)[colnames(o1)=="assumpResult"] <- "Better.availabilityResult"
    o1 <- Ovariable(name="Better.availability", output=o1[!colnames(o1) %in% away])
    
    # Less chemicals
    o2 <- jsp[jsp$Question == "Less.chemicals" , ]
    o2$Value <- round(result(o2))
    o2 <- merge(
      assump@output[assump$Variable == "change",],
      o2@output
    )
    colnames(o2)[colnames(o2)=="assumpResult"] <- "Less.chemicalsResult"
    o2 <- Ovariable(name="Less.chemicals", output=o2[!colnames(o2) %in% away])
    
    out <- (o1 + o2)@output
    return(Ovariable(output=out, marginal=colnames(out) %in% c("Iter","Fish")))
  }
)

effrecommRaw <- Ovariable( # Effect of recommendations
  "effrecommRaw",
  dependencies = data.frame(
    Name=c("jsp","assump"),
    Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
  ),
  formula = function(...) {
    # Recommend more fish
    o1 <- jsp[jsp$Question == "Recommended" , ]
    o1$Value <- round(result(o1))
    o1 <- merge(
      assump@output[assump$Variable == "change",],
      o1@output
    )
    colnames(o1)[colnames(o1)=="assumpResult"] <- "Result"
    o1$Recommendation <- "Eat more"
    
    # Recommend less fish
    o2 <- jsp[jsp$Question == "Not.recommended" , ]
    o2$Value <- round(result(o2))
    o2 <- merge(
      assump@output[assump$Variable == "change",],
      o2@output
    )
    colnames(o2)[colnames(o2)=="assumpResult"] <- "Result"
    o2$Recommendation <- "Eat less"
    
    out <- rbind(o1, o2)[!colnames(o1) %in% 
            c("Value","Variable","Question","assumpUnit","assumpSource","jspResult","jspSource")
]

    return(Ovariable(output=out, marginal=colnames(out) %in% c("Iter","Fish","Recommendation")))
  },
  unit = "change of amount in fractions"
)

# effrecommRaw and effrecomm are needed to enable decisions to affect the effect of recommendation before
# the ovariable is collapsed. So, target the decisions to effrecommRaw and collapses to effrecomm.

effrecomm <- Ovariable(
  "effrecomm",
  dependencies = data.frame(Name="effrecommRaw"),
  formula = function(...) {
    return(effrecommRaw)
  }
)

amountRaw <- Ovariable(
  "amountRaw",
  dependencies = data.frame(Name = c(
    "often",
    "much",
    "oftenside",
    "muchside",
    "assump"
  )),
  formula = function(...) {
    away <- c(
      "assumpUnit", 
      "jspResult",
      "jspSource",
      "Variable",
      "Value",
      "Explanation"
    )
    often <- often[ , !colnames(often@output) %in% away]
    much <- much[ , !colnames(much@output) %in% away]
    oftenside <- oftenside[ , !colnames(oftenside@output) %in% away]
    muchside <- muchside[ , !colnames(muchside@output) %in% away]
    assump <- assump[assump$Variable == "ingredient", !colnames(assump@output) %in% away]
    
    out <- (often * much + oftenside * muchside * assump)/365 # g /d
    return(out)
  }
)

amount <- Ovariable(
  "amount",
  dependencies = data.frame(Name = c(
    "amountRaw",
    "effinfo",
    "effrecomm"
  )),
  formula = function(...) {

    out <- amountRaw * (1 + effinfo + effrecomm)
    result(out)[result(out)<0] <- 0
    result(out)[result(out)>100] <- 100
    return(out)
  }
)

rm(wiki_username)
objects.store(list=ls())
cat("Ovariables", ls(), "stored.\n")

Other code

This is code for analysing EFSA food intake data about fish for BONUS GOHERR manuscript Ronkainen L, Lehikoinen A, Haapasaari P, Tuomisto JT. 2019.

+ Show code - Hide code

################################
# This is code for analysing EFSA food intake data about fish for manuscript
# Ronkainen L, Lehikoinen A, Haapasaari P, Tuomisto JT. 2019

library(ggplot2)

# Change the encoding from UCS-2 to UTF-8 first.
dat <- read.csv(
#  "C:/_Eivarmisteta/EFSA fish chronic intake.csv",
  "C:/_Eivarmisteta/EFSA herring salmon chronic intake.csv",
  header=TRUE, sep=",",dec=".",quote='\"'
)

#> colnames(dat)
#[1] "ï..Survey.s.country"     "Survey.start.year"       "Survey"                 
#[4] "Population.Group..L2."   "pop.order"               "Exposure.hierarchy..L1."
#[7] "Number.of.subjects"      "Number.of.consumers"     "Mean"                   
#[10] "Standard.Deviation"      "X5th.percentile"         "X10th.percentile"       
#[13] "Median"                  "X95th.percentile"        "X97.5th.percentile"     
#[16] "X99th.percentile"        "Comment"                

colnames(dat)[c(1,2,4,12)] <- c("Country","Year","Group","Fish")

#> levels(dat$Group)
#[1] "Adolescents"     "Adults"          "Elderly"         "Infants"         "Lactating women"
#[6] "Other children"  "Pregnant women"  "Toddlers"        "Very elderly"   

levels(dat$Group) <- c("Children","Adults","Elderly","Children","Pregnant women",
                       "Children","Pregnant women","Children","Elderly")
dat <- dat[dat$Fish !="Diadromous fish",] # Foodex2 level 7 entries
ggplot(dat,aes(x=Country, y=Mean, colour=Group))+
  geom_point()+coord_flip()+facet_wrap(~Fish, scales="free")+
  labs(title="Fish consumption in European countries",
       y="Average consumption (g/d)")

Dependencies

The survey data will be used as input in the benefit-risk assessment of Baltic herring and salmon intake, which is part of the WP5 work in Goherr-project.

Keywords

References

↑ ^1.0 ^1.1 Mia Pihlajamäki, Arja Asikainen, Suvi Ignatius, Päivi Haapasaari and Jouni T. Tuomisto. Forage Fish as Food: Consumer Perceptions on Baltic Herring. Sustainability 2019, 11(16), 4298; https://doi.org/10.3390/su11164298
↑ Tuomisto, J.T., Asikainen, A., Meriläinen, P., Haapasaari, P. Health effects of nutrients and environmental pollutants in Baltic herring and salmon: a quantitative benefit-risk assessment. BMC Public Health 20, 64 (2020). https://doi.org/10.1186/s12889-019-8094-1

Related files

Goherr: Fish consumption study. Opasnet . [29]. Accessed 15 May 2025.

[pihlajamaki2019-1] 1.0 ^1.1 Mia Pihlajamäki, Arja Asikainen, Suvi Ignatius, Päivi Haapasaari and Jouni T. Tuomisto. Forage Fish as Food: Consumer Perceptions on Baltic Herring. Sustainability 2019, 11(16), 4298; https://doi.org/10.3390/su11164298

[tuomisto2020-2] Tuomisto, J.T., Asikainen, A., Meriläinen, P., Haapasaari, P. Health effects of nutrients and environmental pollutants in Baltic herring and salmon: a quantitative benefit-risk assessment. BMC Public Health 20, 64 (2020). https://doi.org/10.1186/s12889-019-8094-1

[1]

[2]

Goherr: Fish consumption study: Difference between revisions

Latest revision as of 15:39, 12 August 2020

Contents

Question

Answer

Rationale

Data

Questionnaire

Assumptions

Preprocessing

Analyses

Figures, tables and stat analyses for the first manuscript

Descriptive statistics

Bayes model

Initiate ovariables

jsp taken directly from data WITHOUT salmpling

Amount estimated from a bayesian model

Amount estimates directly from data rather than from a bayesian model

Initiate other ovariables

Other code

Dependencies

See also

Keywords

References

Related files

Navigation menu

@@ Line 1: / Line 1: @@
-{{study|moderator=Arja|stub=Yes}}
+{{progression class
+|progression=Reviewed
+|curator=THL
+|date=2019-08-26
+}}
+{{study|moderator=Arja}}
+:''This page contains a detailed description about data management and analysis of an international survey related to scientific article ''Forage Fish as Food: Consumer Perceptions on Baltic Herring'' by Mia Pihlajamäki, Arja Asikainen, Suvi Ignatius, Päivi Haapasaari, and Jouni T. Tuomisto.<ref name="pihlajamaki2019">Mia Pihlajamäki, Arja Asikainen, Suvi Ignatius, Päivi Haapasaari and Jouni T. Tuomisto. Forage Fish as Food: Consumer Perceptions on Baltic Herring. Sustainability 2019, 11(16), 4298; https://doi.org/10.3390/su11164298</ref> The results of this survey where also used in another article ''Health effects of nutrients and environmental pollutants in Baltic herring and salmon: a quantitative benefit-risk assessment'' by the same group.<ref name="tuomisto2020">Tuomisto, J.T., Asikainen, A., Meriläinen, P., Haapasaari, P. Health effects of nutrients and environmental pollutants in Baltic herring and salmon: a quantitative benefit-risk assessment. BMC Public Health 20, 64 (2020). https://doi.org/10.1186/s12889-019-8094-1</ref>
 == Question ==
@@ Line 6: / Line 13: @@
 == Answer ==
-Original questionnaire analysis results
+* Model run with all the results of the article<ref name="pihlajamaki2019"/> [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=B2jMOHfuUSmTjfrn 26.8.2018]
-*[http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=QaMJZqUX0cPaTfOF 13.3.2017] {{comment|# |These should be presented somewhere|--[[User:Arja|Arja]] ([[User talk:Arja|talk]]) 07:39, 26 April 2017 (UTC)}}
+* Original questionnaire analysis results [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=QaMJZqUX0cPaTfOF 13.3.2017]
+* Consumption amount estimates
-Consumption amount estimates
+** Model run 21.4.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=xc0kaCs8cgzpjwo9] first distribution
-* Model run 21.4.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=xc0kaCs8cgzpjwo9] first distribution
+** Model run 18.5.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=YnuQMDJTQgW1Se5a with modelled data]; [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=KXXFiP0aj0DYEPdx with direct survey data]
-* Model run 18.5.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=YnuQMDJTQgW1Se5a with modelled data]; [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=KXXFiP0aj0DYEPdx with direct survey data]
-<rcode graphics=1 variables="
-name:usesurvey|description:Do you want to use directly the survey data rather than modelled data?|type:selection|options:
-	FALSE;No, use modelled data;TRUE;Yes, use survey data
-">
-# This is code Op_en7749/ on page [[Goherr: Fish consumption study#Answer]]
-library(OpasnetUtils)
-library(ggplot2)
-objects.latest("Op_en7749", code_name="initiate") # [[Goherr: Fish consumption study]] ovariables
-DecisionTableParser(opbase.data("Op_en7748", subset = "Decisions"))
-if(usesurvey) {
-  objects.latest("Op_en7749", code_name="surveyjsp") # jsp ovariable directly based on survey data (N=2217)
-  openv.setN(nrow(jsp@data))
-}
-amount <- EvalOutput(amount)
-if(usesurvey) {
-  oprint(summary(amount, marginals=c("Gender", "Country", "Fish","Ages","Foodpolicy")))
-  print(ggplot(amount@output, aes(x=amountResult+0.1, colour=Country))+stat_ecdf()+scale_x_log10()+facet_wrap(~ Fish)+
-    labs(x="Fish consumption (g /d)", y="Cumulative frequency")+theme_gray(base_size=24))
-  print(ggplot(amount@output, aes(x=amountResult+0.1, colour=Ages))+stat_ecdf()+scale_x_log10()+facet_grid(Country ~ Fish)+
-    labs(x="Fish consumption (g /d)", y="Cumulative frequency")+theme_gray(base_size=24))
-  print(ggplot(amount@output, aes(x=amountResult+0.1, colour=Gender))+stat_ecdf()+scale_x_log10()+facet_grid(Country ~ Fish)+
-    labs(x="Fish consumption (g /d)", y="Cumulative frequency")+theme_gray(base_size=24))
-  print(ggplot(often@output, aes(x=oftenResult+0.1, colour=Country))+stat_ecdf()+scale_x_log10()+facet_wrap(~ Fish))
-  print(ggplot(oftenside@output, aes(x=oftensideResult+0.1, colour=Country))+stat_ecdf()+scale_x_log10()+facet_wrap(~ Fish))
-  print(ggplot(much@output, aes(x=muchResult+0.1, colour=Country))+stat_ecdf()+scale_x_log10()+facet_wrap(~ Fish))
-  print(ggplot(muchside@output, aes(x=muchsideResult+0.1, colour=Country))+stat_ecdf()+scale_x_log10()+facet_wrap(~ Fish))
-} else {
-  oprint(summary(amount, marginals=c("Fish")))
-  print(ggplot(amount@output, aes(x=amountResult+0.1, colour=Fish))+stat_ecdf()+scale_x_log10()+
-    labs(x="Fish consumption (g /d)", y="Cumulative frequency")+theme_gray(base_size=24))
-  print(ggplot(often@output, aes(x=oftenResult+0.1, colour=Fish))+stat_ecdf()+scale_x_log10())
-  print(ggplot(oftenside@output, aes(x=oftensideResult+0.1, colour=Fish))+stat_ecdf()+scale_x_log10())
-  print(ggplot(much@output, aes(x=muchResult+0.1, colour=Fish))+stat_ecdf()+scale_x_log10())
-  print(ggplot(muchside@output, aes(x=muchsideResult+0.1, colour=Fish))+stat_ecdf()+scale_x_log10())
-}
-</rcode>
 == Rationale ==
@@ Line 288: / Line 247: @@
 This code is used to preprocess the original questionnaire data from the above .csv file and to store the data as a usable variable to Opasnet base. The code stores a data.frame named survey.
 * Model run 11.7.2018 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=Fa8TMg6h8DbEmhjp]
+* Model run 27.3.2019 with country codes DK EE FI SE [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=6UJy6JKNE3EyNHEQ]
 <rcode name="preprocess2" label="Preprocess (only for developers)">
@@ Line 300: / Line 260: @@
 # Get the data either from Opasnet or your own hard drive.
-#survey1 original file: N:/Ymal/Projects/Goherr/WP5/Goherr_fish_consumption.csv
+#survey1 original fi_le: N:/Ymal/Projects/Goherr/WP5/Goherr_fish_consumption.csv
 survey1 <- opasnet.csv(
@@ Line 306: / Line 266: @@
    wiki = "opasnet_en", sep = ";", fill = TRUE, quote = "\""
 )
-#survey1 <- re#ad.csv(file = "N:/Ymal/Projects/Goherr/WP5/Goherr_fish_consumption.csv",
+#survey1 <- re#ad.csv(fi_le = "N:/Ymal/Projects/Goherr/WP5/Goherr_fish_consumption.csv",
 #                   header=FALSE, sep=";", fill = TRUE, quote="\"")
-# Data file is converted to data.frame using levels at row 2121.
+# Data fi_le is converted to data.frame using levels at row 2121.
 survey1 <- webropol.convert(survey1, 2121, textmark = ":Other open")
@@ Line 329: / Line 289: @@
    ifelse(as.numeric(as.character(survey1$Age)) < 46, "18-45",">45"),
    levels = c("18-45", ">45"), ordered = TRUE
+)
+survey1$Country <- factor(
+  survey1$Country,
+  levels=c("DK","EST","FI","SWE"),
+  labels=c("DK","EE","FI","SE")
 )
 # Anonymize data
@@ Line 450: / Line 415: @@
 * Sketches about modelling determinants of eating (spring 2018) [http://en.opasnet.org/en-opwiki/index.php?title=Benefit-risk_assessment_of_Baltic_herring_and_salmon_intake&oldid=41947#Sketches_about_modelling_determinants_of_eating]
-==== Figures for the first manuscript ====
+==== Figures, tables and stat analyses for the first manuscript ====
+* Model run 8.5.2019 with thlVerse code (not run on Opasnet) [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=46qKuh08I7PHqXRX] [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=lnq3yNAPXAJFxwaG]
+* Model run 26.8.2018 with fig 6 as table [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=B2jMOHfuUSmTjfrn]
+* Previous model runs are [http://en.opasnet.org/en-opwiki/index.php?title=Goherr:_Fish_consumption_study&oldid=42828#Figures.2C_tables_and_stat_analyses_for_the_first_manuscript archived].
-* Model run 15.6.2018 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=eiVRACT5B9AUJMUF]
+Methodological concerns:
-* Model run 16.6.2018 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=RfEKhGyyVlMG6frc]
+* [https://stackoverflow.com/questions/12953045/warning-non-integer-successes-in-a-binomial-glm-survey-packages Is the warning in logistic regression important?]
-* Model run 20.6.2018 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=bWeOP4teZT5c4T9Z]
+* [https://stats.stackexchange.com/questions/46345/how-to-calculate-goodness-of-fit-in-glm-r Goodness of fit in logistic regression]
-* Model run 11.7.2018 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=yakkgFPuWKkXu3qF]
+* [https://stats.stackexchange.com/questions/8661/logistic-regression-in-r-odds-ratio Calculate odds ratios in logistic regression]
+* [https://www3.nd.edu/~rwilliam/stats3/ordinalindependent.pdf You might treat independent ordinal variables as continuous]
+* [https://www.color-blindness.com/coblis-color-blindness-simulator/ Color blindness simulator to adjust colors for the color blind and for black and white printing]
 <rcode graphics=1>
@@ Line 463: / Line 434: @@
 library(ggplot2)
 library(reshape2)
+library(MASS)
+#library(extrafont) # Needed to save Arial fonts to PDF or EPS
+#library(thlVerse)
 #library(car)
 #library(vegan)
 BS <- 24
+localcomp <- FALSE
+thl <- FALSE
-groups <- function(o) {
+# Set colours
+#library(thlVerse)
+#library(tidyverse)
+#colors <- c(
+#  thlColors(6,"quali","bar",thin=0.8),
+#  thlColors(6,"quali","bar",thin=1)
+#)[c(1,8,3,10,11,6)]
+#ggplot(tibble(A=as.character(1:6),B=1),aes(x=A,weight=B,fill=A))+
+#  geom_bar()+
+#  scale_fill_manual(values=colors)
+colors <- c("#74AF59FF","#2F62ADFF","#D692BDFF","#29A0C1FF","#BE3F72FF","#FBB848FF")
+########################## Functions
+#### thlLinePlot was adjusted to enable different point shapes
+thlLinePlot <- function (data, xvar, yvar, groupvar = NULL, ylabel = yvar,
+                         xlabel = NULL, colors = thlColors(n = 12, type = "quali",
+                                                           name = "line"), title = NULL, subtitle = NULL, caption = NULL,
+                         legend.position = "none", base.size = 16, linewidth = 3,
+                         show.grid.x = FALSE, show.grid.y = TRUE, lang = "fi", ylimits = NULL,
+                         marked.treshold = 10, plot.missing = FALSE, xaxis.breaks = waiver(),
+                         yaxis.breaks = waiver(), panels = FALSE, nrow.panels = 1,
+                         labels.end = FALSE)
+{
+  lwd <- thlPtsConvert(linewidth)
+  gg <- ggplot(data, aes_(x = substitute(xvar), y = substitute(yvar),
+                          group = ifelse(!is.null(substitute(groupvar)), substitute(groupvar),
+                                         NA), colour = ifelse(!is.null(substitute(groupvar)),
+                                                              substitute(groupvar), ""))) + geom_line(size = lwd)
+  if (isTRUE(plot.missing)) {
+    df <- thlNaLines(data = data, xvar = deparse(substitute(xvar)),
+                     yvar = deparse(substitute(yvar)), groupvar = unlist(ifelse(deparse(substitute(groupvar)) !=
+                                                                                  "NULL", deparse(substitute(groupvar)), list(NULL))))
+    if (!is.null(df)) {
+      gg <- gg + geom_line(data = df, aes_(x = substitute(xvar),
+                                           y = substitute(yvar), group = ifelse(!is.null(substitute(groupvar)),
+                                                                                substitute(groupvar), NA), colour = ifelse(!is.null(substitute(groupvar)),
+                                                                                                                           substitute(groupvar), "")), linetype = 2,
+                           size = lwd)
+    }
+  }
+  if (!is.null(marked.treshold)) {
+    if (length(unique(data[, deparse(substitute(xvar))])) >
+        marked.treshold) {
+      if (is.factor(data[, deparse(substitute(xvar))]) ||
+          is.character(data[, deparse(substitute(xvar))]) ||
+          is.logical(data[, deparse(substitute(xvar))])) {
+        levs <- levels(factor(data[, deparse(substitute(xvar))]))
+        min <- levs[1]
+        max <- levs[length(levs)]
+      }
+      else {
+        min <- min(data[, deparse(substitute(xvar))])
+        max <- max(data[, deparse(substitute(xvar))])
+      }
+      subdata <- data[c(data[, deparse(substitute(xvar))] %in%
+                          c(min, max)), ]
+      gg <- gg + geom_point(data = subdata, aes_(
+        x = substitute(xvar),
+        y = substitute(yvar),
+        group = ifelse(!is.null(substitute(groupvar)), substitute(groupvar), NA),
+        colour = ifelse(!is.null(substitute(groupvar)), substitute(groupvar), ""),
+        shape = substitute(groupvar)),
+        fill = "white",
+        stroke = 1.35 * lwd,
+        size = 10/3 * lwd)+scale_shape_manual(values=21:25)
+    }
+    else {
+      gg <- gg + geom_point(
+        aes_(
+          shape = substitute(groupvar)
+        ),
+        fill = "white",
+        stroke = 1.35 * lwd,
+        size = 10/3 * lwd)+scale_shape_manual(values=21:25)
+    }
+  }
+  if (isTRUE(labels.end)) {
+    if (is.factor(data[, deparse(substitute(xvar))]) ||
+        is.character(data[, deparse(substitute(xvar))]) ||
+        is.logical(data[, deparse(substitute(xvar))])) {
+      levs <- levels(factor(data[, deparse(substitute(xvar))]))
+      maxd <- data[data[, deparse(substitute(xvar))] ==
+                     levs[length(levs)], ]
+    }
+    else {
+      maxd <- data[data[, deparse(substitute(xvar))] ==
+                     max(data[, deparse(substitute(xvar))]), ]
+    }
+    brks <- maxd[, deparse(substitute(yvar))]
+    labsut <- maxd[, deparse(substitute(groupvar))]
+  }
+  else (brks <- labsut <- waiver())
+  gg <- gg + ylab(ifelse(deparse(substitute(ylabel)) == "yvar",
+                         deparse(substitute(yvar)), ylabel)) + labs(title = title,
+                                                                    subtitle = subtitle, caption = caption) + thlTheme(show.grid.y = show.grid.y,
+                                                                                                                       show.grid.x = show.grid.x, base.size = base.size, legend.position = legend.position,
+                                                                                                                       x.axis.title = ifelse(!is.null(xlabel), TRUE, FALSE)) +
+    xlab(ifelse(!is.null(xlabel), xlabel, "")) + scale_color_manual(values = colors) +
+    thlYaxisControl(lang = lang, limits = ylimits, breaks = yaxis.breaks,
+                    sec.axis = labels.end, sec.axis.breaks = brks, sec.axis.labels = labsut)
+  if (is.factor(data[, deparse(substitute(xvar))]) || is.character(data[,
+                                                                        deparse(substitute(xvar))]) || is.logical(data[, deparse(substitute(xvar))])) {
+    gg <- gg + scale_x_discrete(breaks = xaxis.breaks, expand = expand_scale(mult = c(0.05)))
+  }
+  else (gg <- gg + scale_x_continuous(breaks = xaxis.breaks))
+  if (isTRUE(panels)) {
+    fmla <- as.formula(paste0("~", substitute(groupvar)))
+    gg <- gg + facet_wrap(fmla, scales = "free", nrow = nrow.panels)
+  }
+  gg
+}
+groups <- function(o) {
    o$Group <- paste(o$Gender, o$Ages)
    o$Group <- factor(o$Group, levels = c(
@@ Line 478: / Line 570: @@
    return(o)
 }
+#' @title reasons produces a ggplot object for a graph that shows fractions of reasons mentioned in the data.
+#' @param dat a data.frame containing survey answers for the target subgroup.
+#' @param title an atomic string with the name to be shown as the title of graph
+#' @return ggplot object
 reasons <- function(
-   dat, # data.frame to be used
+   dat,
-   title # title on graph
+   title
 ) {
+  require(reshape2)
    weight <- Ovariable("weight",data=data.frame(
      Row=dat$Row,
@@ Line 499: / Line 597: @@
    tmp$Result <- (tmp$Result %in% c("Yes","1"))*1
    tmp <- weight * (Ovariable("tmp",data=tmp))
+  cat(title, ": number of individual answers.\n")
    oprint(oapply(tmp, c("tmpResult","Country"),length)@output)
@@ Line 506: / Line 605: @@
    tmp$Reason <- factor(tmp$Reason, levels=popu)
    levels(tmp$Reason) <- gsub("( BH| BS)", "", gsub("\\.", " ", levels(tmp$Reason)))
+  cat(title, ": fractions shown on graph.\n")
+  oprint(tmp@output)
-   tmp <- ggplot(tmp@output, aes(x=Reason, y=Result,colour=Country, group=Country))+
+   if(thl) {
-    geom_point(shape=21, size=6, fill="Grey", stroke=2)+
+    tmp <- thlLinePlot(tmp@output, xvar=Reason, yvar=Result,groupvar=Country,
-    geom_line()+
+                       colors= c("#519B2FFF", "#2F62ADFF", "#BE3F72FF","#88D0E6FF"), # #29A0C1FF"),
-    coord_flip()+
+                       # THL colors but fourth is brigter
-    theme_gray(base_size=BS)+
+                       legend.position = c(0.85,0.2), base.size = BS, title=title,
-    scale_y_continuous(labels=scales::percent_format())+
+                       subtitle="Fraction of population")+
-    labs(
+      coord_flip()+
-      title=title,
+      scale_y_continuous(labels=scales::percent_format(accuracy=1))
-      x="Answer",
+  } else {
-      y="Fraction of respondents")
+    tmp <- ggplot(tmp@output, aes(x=Reason, y=Result,colour=Country, group=Country))+
+      geom_point(shape=21, size=5, fill="Grey", stroke=2)+
+      geom_line(size=1.2)+
+      coord_flip()+
+      theme_gray(base_size=BS)+
+      scale_y_continuous(labels=scales::percent_format())+#accuracy=1))+
+      scale_colour_manual(values=colors)+
+      labs(
+        title=title,
+        x="Answer",
+        y="Fraction of population")
+  }
    return(tmp)
 }
-impacts <- function(dat,title) {
+#' @title Function impacts produces a graph showing the magnitude of impact if some things change
+#' @param dat data.frame containing answers
+#' @param title string to be used as the title of the graph
+#' @param population ovariable of country population weights
+#' @return ggplot object
+impacts <- function(dat,title, population) {
    tmp <- melt(
      dat,
-     id.vars = "Country",
+     id.vars = c("Country","Weighting"),
      variable.name ="Reason",
      value.name="Response"
@@ Line 552: / Line 670: @@
    )] <- "Increase"
-   popu <- aggregate(as.numeric(tmp$Response), tmp["Reason"],sum)
+  colnames(tmp)[colnames(tmp)=="Weighting"] <- "Result"
+  tmp <- Ovariable("tmp",data=tmp) * population
+   popu <- aggregate(as.numeric(tmp$Response), tmp@output["Reason"],sum)
    popu <- popu$Reason[order(popu$x)]
    tmp$Reason <- factor(tmp$Reason, levels=popu)
    levels(tmp$Reason) <- gsub("( BH| BS)", "", gsub("\\.", " ", levels(tmp$Reason)))
-  tmp$Result <- 1
+   tmp <- (oapply(tmp, c("Reason","Response"), sum)/
-  tmp <- EvalOutput(Ovariable("tmp",data=tmp))
+             oapply(tmp, c("Reason"),sum))
-   tmp <- (oapply(tmp, c("Country","Reason","Response"), sum)/
-             oapply(tmp, c("Country","Reason"),sum))
+  oprint(tmp@output)
-   tmp <- ggplot(tmp@output,
+   print(ggplot(tmp@output,
-         aes(x=Reason, weight=Result, fill=Response))+geom_bar()+
+               aes(x=Reason, weight=Result, fill=Response))+geom_bar()+
-    coord_flip()+facet_grid(.~Country)+
+          coord_flip()+#facet_grid(.~Country)+
-    theme_gray(base_size=BS)+
+          theme_gray(base_size=BS)+
-    theme(legend.position = "bottom")+
+          theme(legend.position = "bottom")+
-    scale_fill_manual(values=c("Grey","Green","Yellow","Red"))+
+          scale_fill_manual(values=c("Grey",colors[c(1,6,5)]))+
-    scale_y_continuous(breaks=c(0.5,1),labels=scales::percent_format())+
+          scale_y_continuous(breaks=c(0.5,1),labels=scales::percent_format())+
-    labs(
+          labs(
-      title=title,
+            title=title,
-      x="Cause",
+            x="Cause",
-      y="Fraction of respondents"
+            y="Fraction of population"
-    )
+          ))
+  return(tmp@output)
+}
+#' @title or95 prints the odds ratio and 95 % confidence interval of a glm fit
+#' @param fit a glm fit
+#' @return data.frame with OR and 95%CI
+or95 <- function(fit) {
+  cat("Odds ratios with 95 % confidence intervals\n")
+  tmp <- exp(cbind(coef(fit), confint(fit)))
+  tmp <- data.frame(
+    rownames(tmp),
+    paste0(sprintf("%.2f",tmp[,1]), " (",sprintf("%.2f", tmp[,2]),"-",sprintf("%.2f",tmp[,3]),")")
+  )
+  colnames(tmp) <- c("Parameter","OR (95% CI)")
    return(tmp)
 }
-################## Get data
+#' @title Perform and print regression analyses
+#' @param dat data.frame with data
+#' @param y name of column to be used as dependent variable
+#' @param countries logical atom to tell whether to run country-specific analyses
+#' @return returns the model fit object
-objects.latest("Op_en7749", "preprocess2") # [[Goherr: Fish consumption study]]: survey, surv
+printregr <- function(dat, y, title, countries = TRUE) {
+  dat$y <- dat[[y]]
+  cat("\n############### Logistic regression analysis:",title, "(", y, ")\n")
+  fit <- glm(
+    y ~ Ages + Gender + Country + as.numeric(Education) + as.numeric(Purchasing.power),
+    family=binomial(),na.action="na.omit",weights=Weighting,
+    data=dat
+  )
+  step <- stepAIC(fit, direction = "both")
+  oprint(step$anova)
+  oprint(summary(fit))
+  oprint(or95(fit))
+  if(countries) {
+    for(i in unique(dat$Country)) {
+      cat("\n#### Country-specific logistic regression analysis:", i, y, "\n")
+      fit <- glm(
+        Eat.fish ~ Ages + Gender + as.numeric(Education) + as.numeric(Purchasing.power),
+        family=binomial(),na.action="na.omit",weights=Weighting,
+        data=dat[dat$Country==i,]
+      )
+      step <- stepAIC(fit, direction = "both")
+      oprint(step$anova)
+      oprint(summary(fit))
+      oprint(or95(fit))
+    }
+  }
+  return(fit)
+}
-#### General fish eating
+#################### Data for Figure 1.
-su1 <- EvalOutput(Ovariable(
+if(thl) { # Commented out because needs package thlVerse.
-   "su1",
-   data = data.frame(
+  dat <- opbase.data("Op_en7749", subset="Fish consumption as food in Finland") # [[Goherr: Fish consumption study]]
-     survey1[c("Country","Gender","Education","Purchasing.power")],
+  dat$Year <- as.numeric(as.character(dat$Year))
-     Result=1
+  gr <- thlLinePlot(
+    dat[dat$Species=="Total",],
+    xvar=Year,
+    yvar=Result,
+    ylimits=c(0,12),
+    groupvar=Origin,
+    base.size=24,
+    title="Fish consumption in Finland",
+    ylabel="",
+    subtitle="(kg/a per person)",
+    legend.position = "bottom"
+  )
+  if(localcomp) ggsave("Figure 1.pdf", width=8,height=6)
+  if(localcomp) ggsave("Figure 1.png", width=8,height=6)
+} # End if
+################## Get data
+objects.latest("Op_en7749", "preprocess2") # [[Goherr: Fish consumption study]]: survey1, surv
+objects.latest("Op_en7749", "nonsamplejsp") # [[Goherr: Fish consumption study]]: jsp every respondent from data exactly once.
+objects.latest("Op_en7749", "initiate") # [[Goherr: Fish consumption study]]: amount etc.
+openv.setN(50)
+survey1 <- groups(survey1)
+#levels(survey1$Country) <- c("FI","SE","DK","EE") # WHY was this here? It is a potential hazard.
+effinfo <- 0 # No policies implemented.
+effrecomm <- 0
+population <- EvalOutput(Ovariable(
+  "population",
+  ddata = "Op_en7748",
+  subset = "Population"
+))
+levels(population$Country) <- c("DK","EE","FI","SE")
+population <- oapply(population, c("Country"),sum)
+amount <- EvalOutput(amount)
+#result(amount)[result(amount)==0] <- 0.1 Not needed if not log-transformed.
+amount <- amount * 365.25 / 1000 # g/day --> kg/year
+#### General fish eating
+su1 <- EvalOutput(Ovariable(
+   "su1",
+   data = data.frame(
+     survey1[c("Country","Gender","Education","Purchasing.power")],
+     Result=1
    )
 ))
-cat("Total number of answers in each country.\n")
+# Figure 1. Origin of consumed fish in Finland between 1999 and 2016.
-oprint(oapply(su1, "Country", sum)@output)#, caption="Total number of answers in each country")
+# Data not on this page, drawn separately.
-tmp <- (oapply(su1, c("Country","Gender"), sum)/oapply(su1,"Country",sum))@output
-cat("Fraction of genders\n")
-oprint(tmp[order(tmp$Country),])#, caption="Fraction of genders")
-tmp <- (oapply(su1, c("Country","Education"), sum)/oapply(su1,"Country",sum))@output
-cat("Fraction of education levels\n")
-oprint(tmp[order(tmp$Country),])#, caption="Fraction of education levels")
-tmp <- (oapply(su1, c("Country","Purchasing.power"), sum)/oapply(su1,"Country",sum))@output
-cat("Fraction of purchasing power\n")
-oprint(tmp[order(tmp$Country),])#, caption="Fraction of purchasing power")
-ggplot(survey1,
+# Table 1. Dimensions of embeddedness, modified from Hass (2007, p. 16)
-       aes(x=Eat.fish, group=Country), stat="count")+
+# Written directly on the manuscript.
-  geom_bar(aes(y=..prop.., fill=Country), position="dodge")+
-  theme_gray(base_size=BS)+
-  scale_y_continuous(labels=scales::percent_format())+
-  labs(title="Fraction of fish eaters within the study population")
-#### Baltic salmon
+# Table 2: statistics of the survey population in each country (n, female %, education, purchasing power)
-reasons(
+tmp <- round(t(data.frame(
-   survey1[survey1$Eat.fish=="No"& !is.na(survey1$Eat.fish),c(1,17:26,154,157)], # 1 Country, 8 Why don't you eat fish
+   n = tapply(rep(1,nrow(survey1)), survey1$Country,sum),
-   "Reasons not to eat among fish non-consumers"
+  Females = tapply(survey1$Gender=="Female", survey1$Country,mean)*100,
-)
+  Old = tapply(survey1$Ages==">45", survey1$Country,mean)*100,
-reasons(
+  PurcVlow = tapply(survey1$Purchasing.power=="Very low", survey1$Country,mean)*100,
-   survey1[survey1$Eat.salmon=="Yes"& !is.na(survey1$Eat.salmon),c(1,31:36,154,157)], # 1 Country, 11 Which salmon species
+  PurcLow = tapply(survey1$Purchasing.power=="Low", survey1$Country,mean)*100,
-   "Species used among salmon consumers"
+   PurcSuf = tapply(survey1$Purchasing.power=="Sufficient", survey1$Country,mean)*100,
-)
+  PurcGood = tapply(survey1$Purchasing.power=="Good", survey1$Country,mean)*100,
-reasons(
+   PurcVgood = tapply(survey1$Purchasing.power=="Very good", survey1$Country,mean)*100,
-  survey1[survey1$Baltic.salmon=="Yes"& !is.na(survey1$Baltic.salmon),c(1,50:59,154,157)], # 1 Country, 17 Why Baltic salmon
+  PurcExc = tapply(survey1$Purchasing.power=="Excellent", survey1$Country,mean)*100,
-   "Reasons to eat among Baltic salmon consumers"
+  EducPri = tapply(survey1$Education=="Primary education", survey1$Country,mean)*100,
-)
+  EducSec = tapply(survey1$Education=="Secondary education (gymnasium, vocational school or similar)", survey1$Country,mean)*100,
-reasons(
+   EducCol = tapply(survey1$Education=="Lower level college education or similar", survey1$Country,mean)*100,
-   survey1[survey1$Baltic.salmon=="No"& !is.na(survey1$Baltic.salmon),c(1,62:71,154,157)], # 1 Country, 18 Why not Baltic salmon
+  EducHig = tapply(survey1$Education=="Higher level college education or similar", survey1$Country,mean)*100
-   "Reasons not to eat among Baltic salmon non-consumers"
+)))
+rownames(tmp) <- c(
+  "Number of respondents",
+  "Females (%)",
+  ">45 years (%)",
+  "Purchasing power: Very low (%)",
+   "Low (%)",
+  "Sufficient (%)",
+  "Good (%)",
+   "Very good (%)",
+  "Excellent (%)",
+  "Education: Primary education (%)",
+  "Secondary education (%)",
+  "Lower level college (%)",
+   "Higher level college (%)"
 )
+oprint(tmp, digits=0)
-impacts(
+cat("Weighted fraction of fish eaters\n")
-  survey1[survey1$Baltic.salmon=="Yes"& !is.na(survey1$Baltic.salmon),c(1,73:82)],
+tmp <- aggregate(survey1$Weighting, survey1[c("Eat.fish","Country")],FUN=sum)
-  "Effect on Baltic salmon consumption"
+oprint(aggregate(tmp$x, tmp["Country"],FUN=function(x) x[2]/sum(x)))
-)
-####### Baltic herring
 ggplot(survey1,
-        aes(x=Country, weight=Weighting, fill=Eat.BH))+geom_bar(position="fill")+
+        aes(x=Eat.fish, weight=Weighting, group=Country), stat="count")+
-   coord_flip()+#facet_grid(.~Country)+
+   geom_bar(aes(y=..prop.., fill=Country), position="dodge")+
    theme_gray(base_size=BS)+
-   theme(legend.position = "bottom")+
+   scale_fill_manual(values=colors)+
    scale_y_continuous(labels=scales::percent_format())+
-  #  scale_fill_manual(values=c("Grey","Green","Yellow","Red"))+
+   labs(title="Fraction of fish eaters within population")
-   labs(title="Baltic herring consumption")
-levels(survey1$Recommendation.awareness) <- c("Not aware","Aware","Know content")
+############## Answers to open questions
-ggplot(survey1,
+if(FALSE){
-       aes(x=Country, weight=Weighting, fill=Recommendation.awareness))+geom_bar(position="fill")+
+  cat("Number of open-ended answers per question and country\n")
-  coord_flip()+#facet_grid(.~Country)+
-   theme_gray(base_size=BS)+
+  tmp <- survey1[c(1,3,27:28,36:37,44:45,60:61,71:72,83:84,93:94,109:110,121:122,133:134,154,157:158)]
-   theme(legend.position = "bottom")+
+  tmp <- reshape(
-   scale_y_continuous(labels=scales::percent_format())+
+    tmp,
-   #  scale_fill_manual(values=c("Grey","Green","Yellow","Red"))+
+    varying=list(
-  labs(title="Awareness of food recommendations about Baltic fish")
+      Condition = colnames(tmp)[grepl("other",tolower(colnames(tmp)))],
+      Text = colnames(tmp)[grepl("open",colnames(tmp))]
+    ),
+    times = colnames(tmp)[grepl("other",tolower(colnames(tmp)))],
+    direction="long"
+   )
+   tmp <- tmp[tmp$Why.not.fish.open!="" , ]
+   oprint(table(tmp$time,tmp$Country))
+   oprint(tmp)
+} # End if(FALSE)
+#### Baltic salmon
 reasons(
-   survey1[survey1$Eat.BH=="Yes"& !is.na(survey1$Eat.BH),c(1,99:108,154,157)], # 1 Country, 27 why Baltic herring
+   survey1[survey1$Eat.fish=="No"& !is.na(survey1$Eat.fish),c(1,17:26,154,157)], # 1 Country, 8 Why don't you eat fish
-  "Reasons to eat among Baltic herring consumers"
+   "Reasons not to eat among fish non-consumers"
-)
-reasons(
-  survey1[survey1$Eat.BH=="No"& !is.na(survey1$Eat.BH),c(1,111:120,154,157)], # 1 Country, 28 Why not Baltic herring
-   "Reasons not to eat among Baltic herring non-consumers"
 )
-impacts(
+# Figure 5
-  survey1[survey1$Eat.BH=="Yes"& !is.na(survey1$Eat.BH),c(1,123:132)], # 1 Country, 29 Influence of Baltic herring policies
-  title="Effect on Baltic salmon consumption"
-)
-objects.latest("Op_en7748", code_name="hia")
+cat("Figure 5 (to be converted to text)\n")
-tmp <- amount*info
+reasons(
-tmp <- tmp[
+  survey1[survey1$Eat.salmon=="Yes"& !is.na(survey1$Eat.salmon),c(1,31:36,154,157)], # 1 Country, 11 Which salmon species
-  tmp$Info.improvements=="BAU" &
+   "Species used among salmon consumers"
-    tmp$Recomm.herring=="BAU" &
+)
-    tmp$Recomm.salmon=="BAU" ,
-  ]
-tmp <- groups(oapply(tmp, c("Country","Gender","Ages","Fish"), FUN=mean)@output)
-ggplot(tmp, aes(x=Group, weight=Result, fill=Fish))+
-   geom_bar()+facet_grid(Country~Fish)+
-  theme_gray(base_size=BS)+
-  theme(axis.text.x = element_text(angle = -90))+
-  labs(
-    title="Fish consumption in subgroups",
-    y="Average consumption (g/day)")
-cat("Average fish consumption in subgroups\n")
+reasons(
-oprint(tmp)
+  survey1[survey1$Baltic.salmon=="Yes"& !is.na(survey1$Baltic.salmon),c(1,50:59,154,157)], # 1 Country, 17 Why Baltic salmon
-</rcode>
+  "Reasons to eat among Baltic salmon consumers"
+)
-==== Regression analysis, ANOVA, and Tukey ====
+if(localcomp) ggsave("Figure 5.pdf", width=10,height=5)
+if(localcomp) ggsave("Figure 5.png", width=10,height=5)
-{{argument|relat1=relevant attack|selftruth1=true|id=arg7420|type=|content=Work in progress.|sign=--[[User:Jouni|Jouni]] ([[User talk:Jouni|talk]]) 10:39, 8 August 2018 (UTC)}}
+reasons(
+  survey1[survey1$Baltic.salmon=="No"& !is.na(survey1$Baltic.salmon),c(1,62:70,154,157)], # 1 Country, 18 Why not Baltic salmon
+  "Reasons not to eat, Baltic salmon non-consumers"
+)
-<rcode graphics=1>
+if(localcomp) ggsave("Figure 6.pdf", width=10,height=5)
-# This is code Op_en7749/surveyjsp on page [[Goherr: Fish consumption study]]
+if(localcomp) ggsave("Figure 6.png", width=10,height=5)
-# The code produces amount esimates (jsp ovariable) directly from data rather than bayesian model.
-library(OpasnetUtils)
+impacts.sal <- impacts(
+  survey1[survey1$Baltic.salmon=="Yes"& !is.na(survey1$Baltic.salmon),c(1,73:82,154)],
+  "Effect on Baltic salmon consumption",
+  population
+)
-jsp <- Ovariable(
+####### Any herring
-  "jsp",
-  dependencies = data.frame(Name="survey1", Ident="Op_en7749/preprocess2"), # [[Goherr: Food cunsumption study]]
-  formula = function(...) {
-    require(reshape2)
-    sur <- survey1[c(157,1,3,158,162:178,81,82,131,132)] # Removed, not needed:16,29,30?
-    #colnames(sur)
+ggplot(survey1[survey1$Eat.fish=="Yes",],
-    #[1] "Row"                      "Country"                  "Gender"
+       aes(x=Country, weight=Weighting, fill=Eat.herring))+geom_bar(position="fill")+
-    #[4] "Ages"                     "Baltic.salmon.n"          "How.often.BS.n"
+  coord_flip()+#facet_grid(.~Country)+
-     #[7] "How.much.BS.n"            "How.often.side.BS.n"      "How.much.side.BS.n"
+  theme_gray(base_size=BS)+
-    #[10] "Better.availability.BS.n" "Less.chemicals.BS.n"      "Eat.BH.n"
+  theme(legend.position = "bottom")+
-    #[13] "How.often.BH.n"           "How.much.BH.n"            "How.often.side.BH.n"
+  scale_y_continuous(labels=scales::percent_format())+
-    #[16] "How.much.side.BH.n"       "Better.availability.BH.n" "Less.chemicals.BH.n"
+  scale_fill_manual(values=colors)+
-    #[19] "Eatfish"                  "Eatsalm"                  "Eatherr"
+  labs(
-    #[22] "Recommended.BS"           "Not.recommended.BS"       "Recommended.BH"
+     title="Any herring consumption",
-    #[25] "Not.recommended.BH"
+    y="Fraction of fish consumers"
+  )
-    # Make sure that Row is kept separate from Iter because in the sampling version they are different.
-    # sur contained columns Eat.fish, How.often.fish, Eat.salmon. Are these needed, as all other questions are melted? No
+####### Baltic herring
-    # Columns 5-7 removed, so colnames list above does not match.
+### Figure 2. Comparison of Baltic herring consumption habits.
-    colnames(sur)[5] <- "Eat.BS"
-    colnames(sur) <- gsub("\\.n","",colnames(sur))
+survey1$What <- ifelse(is.na(survey1$Eat.BH), "Other fish", as.character(survey1$Eat.BH))
-    sur[22:25] <- sapply(sur[22:25], as.numeric)
+survey1$What <- factor(
-     sur <- melt(
+  survey1$What,
-      sur,
+  levels=(c("Yes","I don't know","No","Other fish")),
-      id.vars=c(1:4),
+  labels=(c("Baltic herring","Some herring","Other herring","Other fish"))
-       variable.name="Question",
+)
-       value.name="Result"
+tmp <- survey1[survey1$Eat.fish=="Yes",]
+colnames(tmp)[colnames(tmp)=="Weighting"] <- "Result"
+tmp <- EvalOutput(Ovariable("tmp", data=tmp))
+tmp <- oapply(tmp, c("What","Country"), sum) / oapply(tmp,"Country",sum)
+tmp$Country <- factor(tmp$Country, levels=rev(levels(tmp$Country)))
+if(thl) {
+  thlBarPlot(tmp@output, xvar=Country, yvar=Result, groupvar=What, horizontal = TRUE, stacked = TRUE,
+             legend.position = "bottom",,
+             title="Baltic herring consumption",
+             subtitle="Fraction of fish consumers",
+  )+
+     scale_y_continuous(labels=scales::percent_format())
+} else {
+  ggplot(tmp@output, aes(x=Country, weight=Result, fill=What))+geom_bar(position="fill")+
+    coord_flip()+#facet_grid(.~Country)+
+    theme_gray(base_size=BS)+
+    theme(legend.position = "bottom")+
+    scale_y_continuous(labels=scales::percent_format())+
+    scale_fill_manual(values=c("gray",colors))+
+    guides(fill=guide_legend(reverse=TRUE))+
+    labs(
+       title="Baltic herring consumption",
+       y="Fraction of fish consumers"
      )
-    sur$Fish <- ifelse(grepl("BH",sur$Question),"Herring","Salmon")
-    sur$Question <- gsub("\\.BS","",sur$Question)
+}
-    sur$Question <- gsub("\\.BH","",sur$Question)
+if(localcomp) ggsave("Figure 2.pdf",width=10, height=5)
-    ########## If the missing values are not adjusted, they drop out in the next stage.
+if(localcomp) ggsave("Figure 2.png",width=6, height=3)
-    if(TRUE) {
-      # The adjustments below probably should go to the preprocess2 code.
+tmp <- survey1[survey1$Eat.fish=="Yes",]
-        sur$Result[sur$Question %in% c(
+tmp <- aggregate(tmp$Weighting, tmp[c("What","Country")], sum)
-          "Better.availability",
-          "Less.chemicals",
+rown <- tmp$What[tmp$Country=="FI"]
-          "Recommended",
+tmp <- aggregate(tmp$x, tmp["Country"],FUN=function(x) x/sum(x))
-          "Not.recommended"
+colnames(tmp[[2]]) <- rown
-        ) & is.na(sur$Result)] <- 4 # If missing --> no change
+oprint(as.matrix(tmp))
-        sur$Result[is.na(sur$Result)] <- 1 # Replace missing values with 1. That will produce 0 g/d.
-    }
-    return(Ovariable(
-      output = sur,
-      marginal = colnames(sur) %in% c("Fish", "Iter", "Question")
-    ))
-  }
-)
-objects.store(jsp)
+ggplot(survey1[survey1$Eat.herring=="Yes",],
-cat("Ovariable jsp with actual survey data: every respondent is kept in data without sampling.\n")
+       aes(x=Country, weight=Weighting, fill=Eat.BH))+geom_bar(position="fill")+
+  coord_flip()+#facet_grid(.~Country)+
+  theme_gray(base_size=BS)+
+  theme(legend.position = "bottom")+
+  scale_y_continuous(labels=scales::percent_format())+
+  scale_fill_manual(values=c(colors))+
+  labs(
+    title="Baltic herring consumption",
+    y="Fraction of herring consumers"
+  )
-############################
+####### Any salmon
-library(OpasnetUtils)
+ggplot(survey1[survey1$Eat.fish=="Yes",],
-library(ggplot2)
+       aes(x=Country, weight=Weighting, fill=Eat.salmon))+geom_bar(position="fill")+
+  coord_flip()+
+  theme_gray(base_size=BS)+
+  theme(legend.position = "bottom")+
+  scale_y_continuous(labels=scales::percent_format())+
+  scale_fill_manual(values=colors)+
+  labs(
+    title="Any salmon consumption",
+    y="Fraction of fish consumers"
+  )
-openv.setN(10)
+####### Baltic salmon
-objects.latest("Op_en7749",code_name="initiate")
+ggplot(survey1[survey1$Eat.fish=="Yes",],
+       aes(x=Country, weight=Weighting, fill=Baltic.salmon))+geom_bar(position="fill")+
+  coord_flip()+
+  theme_gray(base_size=BS)+
+  theme(legend.position = "bottom")+
+  scale_y_continuous(labels=scales::percent_format())+
+  scale_fill_manual(values=colors)+
+  labs(
+    title="Baltic salmon consumption",
+    y="Fraction of fish consumers"
+  )
-effinfo <- 0
+ggplot(survey1[survey1$Eat.salmon=="Yes",],
-effrecomm <- 0
+       aes(x=Country, weight=Weighting, fill=Baltic.salmon))+geom_bar(position="fill")+
-amount <- EvalOutput(amount)
+  coord_flip()+
-result(amount)[result(amount)==0] <- 0.1
+  theme_gray(base_size=BS)+
+  theme(legend.position = "bottom")+
+  scale_y_continuous(labels=scales::percent_format())+
+  scale_fill_manual(values=colors)+
+  labs(
+    title="Baltic salmon consumption",
+    y="Fraction of salmon consumers"
+  )
-dat <- merge(survey1,amount@output)
+############## Recommendation awareness
+levels(survey1$Recommendation.awareness) <- c("Not aware","Aware","Know content")
+ggplot(survey1,
+       aes(x=Country, weight=Weighting, fill=Recommendation.awareness))+geom_bar(position="fill")+
+  coord_flip()+#facet_grid(.~Country)+
+  theme_gray(base_size=BS)+
+  theme(legend.position = "bottom")+
+  scale_y_continuous(labels=scales::percent_format())+
+  scale_fill_manual(values=colors)+
+  labs(title="Awareness of food recommendations about Baltic fish")
-#table 2: statistics of the survey population in each country (n, female %, education, purchasing power)
+# Figure 3. Percentages of reasons to eat Baltic herring
-round(t(data.frame(
+reasons(
-  n = tapply(rep(1,nrow(survey1)), survey1$Country,sum),
+   survey1[survey1$Eat.BH=="Yes"& !is.na(survey1$Eat.BH),c(1,99:108,154,157)], # 1 Country, 27 why Baltic herring
-  Females = tapply(survey1$Gender=="Female", survey1$Country,mean)*100,
+   "Reasons to eat among Baltic herring consumers"
-   Old = tapply(survey1$Ages==">45", survey1$Country,mean)*100,
+)
-  PurcVlow = tapply(survey1$Purchasing.power=="Very low", survey1$Country,mean)*100,
-  PurcLow = tapply(survey1$Purchasing.power=="Low", survey1$Country,mean)*100,
-  PurcSuf = tapply(survey1$Purchasing.power=="Sufficient", survey1$Country,mean)*100,
-  PurcGood = tapply(survey1$Purchasing.power=="Good", survey1$Country,mean)*100,
-  PurcVgood = tapply(survey1$Purchasing.power=="Very good", survey1$Country,mean)*100,
-  PurcExc = tapply(survey1$Purchasing.power=="Excellent", survey1$Country,mean)*100,
-  EducPri = tapply(survey1$Education=="Primary education", survey1$Country,mean)*100,
-  EducSec = tapply(survey1$Education=="Secondary education (gymnasium, vocational school or similar)", survey1$Country,mean)*100,
-  EducCol = tapply(survey1$Education=="Lower level college education or similar", survey1$Country,mean)*100,
-   EducHig = tapply(survey1$Education=="Higher level college education or similar", survey1$Country,mean)*100
-)))
+if(localcomp) ggsave("Figure 3.pdf", width=10, height=5)
+if(localcomp) ggsave("Figure 3.png", width=10, height=5)
-# These should be corrected to preprocess2. NOT ordered.
+# Figure 4. Reasons for not to eat Baltic herring.
-dat$Country <- factor(dat$Country, ordered=FALSE)
+# Denmark and Estonia were omitted because they had less than 20 observations.
-dat$Ages <- factor(dat$Ages, ordered=FALSE)
-dat$Gender <- factor(dat$Gender, ordered=FALSE)
-ggplot(dat, aes(x=amountResult, weight=1, color=Country))+stat_ecdf()+facet_wrap(~Fish)+
+reasons(
-   scale_x_log10()
+  survey1[
+    survey1$Eat.BH=="No"&
+      survey1$Country %in% c("FI","SE") &
+      !is.na(survey1$Eat.BH),c(1,111:120,154,157)], # 1 Country, 28 Why not Baltic herring
+  "Reasons not to eat, Baltic herring non-consumers"
+)+scale_color_manual(values=c("#BE3F72FF","#88D0E6FF"))+
+   scale_shape_manual(values=23:24)
-out <- data.frame()
+if(localcomp) ggsave("Figure 4.pdf", width=10, height=5)
-for(i in unique(dat$Fish)) {
+if(localcomp) ggsave("Figure 4.png", width=10, height=5)
-#  for(j in unique(dat$Country)){
-    for(k in unique(dat$Iter)) {
-      res <- lm(log10(amountResult) ~ Gender + Ages + Country, data=dat[
-        dat$Iter==k & dat$Fish==i,]) #  & dat$Country==j
-      out <- rbind(out, data.frame(
-        summary(res)[[4]],
-        Fish=i,
-        Country=j,
-        Iter=k,
-        Topic=rownames(summary(res)[[4]])
-      ))
-    }
-#  }
-}
-ggplot(out, aes(x=paste(Fish, Country), y = Pr...t.., colour = Topic))+geom_jitter()
+# Figure 7. The role of different determinants on Baltic salmon consumption
-###################### The following is directly from AAH study. Adjust for this study.
+impacts.herr <- impacts(
+  survey1[survey1$Eat.BH=="Yes"& !is.na(survey1$Eat.BH),c(1,123:132,154)], # 1 Country, 29 Influence of Baltic herring policies
+  title="Effect on Baltic herring consumption",
+  population
+)
-################################# ANOVA AND TUKEY'S TEST
+tmp <- rbind(
+  data.frame(
+    Fish="Herring",
+    impacts.herr
+  ),
+  data.frame(
+    Fish="Salmon",
+    impacts.sal
+  )
+)
-#### test if means differ among gender, age or expertise within aah_average and nonaah_average and make a table
+oprint(aggregate(tmp$Result,tmp[c("Reason","Response","Fish")],sum))
-# make a list to store statistical results
+if(thl) {
-varnames <- c("aah_average", "nonaah_average", "nonaah.average12", "Social.average", "Enviro.average")
+  tmp$Response <- factor(tmp$Response, levels=rev(levels(tmp$Response)))
-explnames <- c("Gender", "Age", "Expertise", "Expertise2", "Pub.all.peer", "Pub.hum.peer", "Pub.all.popular",
+  thlBarPlot(aggregate(tmp$Result,tmp[c("Reason","Response","Fish")],sum),
-               "Pub.hum.popular", "Familiarity", "Teaching")
+             xvar=Reason, yvar=x, groupvar=Response, legend.position = "bottom",
-M <- matrix(data=NA, nrow=length(explnames), ncol=length(varnames))
+             horizontal = TRUE, stacked = TRUE,
-colnames(M) <- varnames
+             base.size = BS, title="Effect on Baltic fish consumption",
-rownames(M) <- explnames
+             subtitle="Fraction of population")+
-res.all <- apply(M, 1, as.list)
+    facet_grid(.~Fish)+
+     scale_fill_manual(values=rev(c("gray",colors[c(1,6,5)])))+
-# run the analyses of variance
+    scale_y_continuous(breaks=c(0.5,1),labels=scales::percent_format(accuracy=1))
-for(i in explnames) {
-  for(j in varnames) {
-     subset <- survey1[,c(i,j)]
-    colnames(subset) <- c("Explanation", "Variable")
-    #    subset$Variable <- as.numeric(subset$Variable)
-    res.aov <- aov(Variable ~ Explanation, data=subset)
-    res.tukey <- TukeyHSD(res.aov)
-    res.all[[i]][[j]][1:2] <- list(res.aov, res.tukey)
-  }
-}
-oprint(res.all)
-####################### Means and SDs of average indices by several determinants
-cat("Means and SDs of average indices by several determinants.\n")
-meantable <- function(col) {
-  out <-  aggregate(
-    survey1[varnames],
-    by = survey1[col],
-    FUN = function(x) {
-      paste(
-        sprintf("%.2f", mean(as.numeric(x), na.rm = TRUE)), " (",
-        sprintf("%.2f", sd(as.numeric(x), na.rm = TRUE)), ")", sep=""
-      )
-    }
-  )
-   colnames(out)[1] <- "Subgroup"
+} else {
-  out <- rbind(
+   ggplot(tmp, aes(x=Reason, weight=Result, fill=Response))+geom_bar()+
-     data.frame( # significant p values summarised
+    coord_flip()+facet_grid(.~Fish)+
-      Subgroup = col,
+     theme_gray(base_size=BS)+
-      sapply(
+    theme(legend.position = "bottom")+
-        varnames,
+    scale_fill_manual(values=c("gray",colors[c(1,6,5)]))+
-        FUN = function(x) {
+    scale_y_continuous(breaks=c(0.5,1),labels=scales::percent_format())+#accuracy=1))+
-          pees <- res.all[[col]][[x]][2][[1]][[1]][,4] # p values from ANOVA/Tukey
+    guides(fill=guide_legend(reverse=TRUE))+
-          pees <- pees[pees < 0.05]
+    labs(
-          pees <- paste(
+      title="Effect on Baltic fish consumption",
-            paste(
+      x="Cause",
-              names(pees),
+       y="Fraction of population"
-              cut(pees, breaks = c(-1,0.001,0.01, 0.05), labels = c("***", "**", "*"))
+     )
-            ), collapse = " "
-          )
-        },
-        simplify = FALSE
-       )
-     ),
-    out # means and sd's
-  )
-  return(out)
 }
-means <- rbind(
-  meantable("Expertise"),
-  meantable("Familiarity"),
-  meantable("Gender"),
-  meantable("Age"),
-  meantable("Pub.hum.peer"),
-  meantable("Teaching")
-)
-oprint(means)
+if(localcomp) ggsave("Figure 7.pdf", width=12, height=7)
+if(localcomp) ggsave("Figure 7.png", width=12, height=7)
-############################################# ANOVA TUKEY WITH INTERACTIONS
+######### Amounts estimated for each respondent
-res.aov <- aov(aah_average ~ Teaching*Expertise, data=survey1)
+amount <- groups(amount)
-res.tukey <- TukeyHSD(res.aov)
-oprint(list(res.aov, res.tukey))
-res.aov <- aov(aah_average ~ Familiarity*Expertise, data=survey1)
+#tmp <- amount*info
-res.tukey <- TukeyHSD(res.aov)
+#tmp <- tmp[
-oprint(list(res.aov, res.tukey))
+#  tmp$Info.improvements=="BAU" &
+#    tmp$Recomm.herring=="BAU" &
+#    tmp$Recomm.salmon=="BAU" ,
+#  ]
+tmp <- cbind(
+  oapply(amount, c("Country","Group","Gender","Ages","Fish"), FUN=mean)@output,
+  SD=oapply(amount, c("Country","Group","Gender","Ages","Fish"), FUN=sd)$amountResult
+)
-############################################# TUKEY INTERACTIONS IN NICE TABLES
+# Figure 3 (old, unused). Average consumption (kg/year) of Baltic herring in four countries,
+# calculated with Monte Carlo simulation (1000 iterations).
-res.aov <- aov(aah_average ~ Familiarity*Expertise, data=survey1)
+ggplot(tmp[tmp$Fish=="Herring",], aes(x=Group, weight=amountResult, fill=Group))+
-res.tukey <- TukeyHSD(res.aov)
+  geom_bar()+facet_wrap(~Country)+
-out <- res.tukey[[3]][order(dimnames(res.tukey[[3]])[[1]]),]
+  theme_gray(base_size=BS)+
+  scale_fill_manual(values=colors)+
+  guides(fill=FALSE)+
+  theme(axis.text.x = element_text(angle = -90))+
+  labs(
+    title="Baltic herring consumption in subgroups",
+    y="Average consumption (kg/year)")
+# if(localcomp) ggsave("Figure3.pdf", width=9, height=10)
+# if(localcomp) ggsave("Figure3.png", width=9, height=10)
+tmp$Result <- paste0(sprintf("%.1f",tmp$amountResult), " (",sprintf("%.1f", tmp$SD,1),")")
+tmp <- reshape(tmp, v.names="Result", timevar="Country", idvar=c("Fish","Group"),drop=c("Gender","Ages","amountResult","SD"), direction="wide")
+colnames(tmp) <- gsub("Result\\.", "", colnames(tmp))
+cat("Average fish consumption in subgroups, mean (sd)\n")
+oprint(tmp)
-test <- lapply( # Split the comparison items
+weight <- EvalOutput(Ovariable("weight",data=data.frame(
-  strsplit(dimnames(out)[[1]], split="-"),
+   amount@output[c("Row","Iter","Fish","Country")],
-   FUN = function(x) strsplit(x, split = ":")
+  Result=amount$Weighting
-)
+)))
-oprint(out[sapply(test, FUN = function(x) x[[1]][1] == x[[2]][1]),]) # Matching Familiarity
-oprint(out[sapply(test, FUN = function(x) x[[1]][2] == x[[2]][2]),]) # Matching Expertise
-</rcode>
+tmp <- (oapply(amount * weight, c("Fish","Country"), sum) / oapply(weight,c("Fish","Country"), sum))@output
-==== Descriptive statistics ====
+cat("Average fish consumption per country (kg/year)\n")
+oprint(tmp)
-[[File:Goherr survey correlations.png|thumb|300px|Correlation matrix of all questions in the survey (answers converted to numbers).]]
+result(amount)[result(amount)==0] <- 0.1
+ggplot(amount@output, aes(x=amountResult, colour=Group))+stat_ecdf()+scale_x_log10()+
+  facet_wrap(~Fish)
-Model must contain predictors such as country, gender, age etc. Maybe we should first study what determinants are important?
+############################
-Model must also contain determinants that would increase or decrease fish consumption. This should be conditional on the current consumption. How?
-Maybe we should look at principal coordinates analysis with all questions to see how they behave.
-Also look at correlation table to see clusters.
+#### Statistical analyses
-Some obvious results:
+dat <- merge(survey1,amount@output) # [amount$Fish=="Herring",]) # This results in 50 * 2 times of rows
-* If reports no fish eating, many subsequent answers are NA.
-* No vitamins correlates negatively with vitamin intake.
-* Unknown salmon correlates negatively with the types of salmon eaten.
-* Different age categories correlate with each other.
-However, there are also meaningful negative correlations:
+# These should be corrected to preprocess2. NOT ordered.
-* Country vs allergy
+dat$Country <- factor(dat$Country, ordered=FALSE)
-* Country vs Norwegian salmon and Rainbow trout
+dat$Ages <- factor(dat$Ages, ordered=FALSE)
-* Country vs not traditional.
+dat$Gender <- factor(dat$Gender, ordered=FALSE)
-* Country vs recommendation awareness
-* Allergy vs economic wellbeing
-* Baltic salmon use (4 questions) vs Don't like taste and Not used to
-* All questions between Easy to cook ... Traditional dish
-*
-Meaningful positive correlations:
+ggplot(dat, aes(x=amountResult, weight=Weighting, color=Country))+stat_ecdf()+facet_wrap(~Fish)+
-* All questions between Baltic salmon ... Rainbow trout
+  scale_x_log10()
-* How often Baltic salmon/herring/side salmon/side herring
-* How much  Baltic salmon/herring/side salmon/side herring
-* Better availability ... Recommendation
-* All questions between Economic wellbeing...Personal aims
-* Omega3, Vitamin D, and Other vitamins
-Model runs
+gro <- unique(dat$Group)
-* [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=QaMJZqUX0cPaTfOF Model run 13.3.2017]
+out <- list(data.frame(), data.frame())
-* Model run 21.4.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=0Lmu6OVAjomgJwAN] old code from Answer merged to this code and debugged
+for(i in unique(dat$Fish)) {
+  for(j in unique(dat$Country)){
-<rcode label="Descriptive statistics" graphics=1>
+    for(k in unique(dat$Iter)) {
-# This is code Op_en7749/ on page [[Goherr: Fish consumption study]]
+      res <- kruskal.test(amountResult ~ Group, data=dat[
+        dat$Iter==k  & dat$Country==j & dat$Fish==i,])
+      out[[1]] <- rbind(out[[1]], data.frame(
+        test="Kruskal-Wallis",
+        Fish=i,
+        Country=j,
+        Iter=k,
+        p.value=res[[3]])
+      )
+      for(l in 1:(length(gro)-1)) {
+        for(m in (l+1):length(gro)) {
+          res2 <- wilcox.test(
+            x=dat$amountResult[dat$Iter==k  & dat$Country==j & dat$Fish==i & dat$Group==gro[l]],
+            y=dat$amountResult[dat$Iter==k  & dat$Country==j & dat$Fish==i & dat$Group==gro[m]], conf.int = FALSE)
+          out[[2]] <- rbind(out[[2]], data.frame(
+            Fish=i,
+            Country=j,
+            Iter=k,
+            Pair1=gro[l],
+            Pair2=gro[m],
+            p.value = res2$p.value
+          ))
+        }
+      }
+    }
+  }
+}
-library(OpasnetUtils)
+ggplot(out[[1]], aes(x=paste(Fish, Country), y = p.value, colour = Country))+geom_jitter()+
-library(ggplot2)
+  geom_hline(yintercept=0.05)+coord_flip()+theme_gray(base_size=BS)+
-library(reshape2)
+  labs(title="Kruskal-Wallis non-parametric test across all groups")
-library(car)
+ggplot(out[[2]], aes(x=paste(Fish, Country), y=p.value, colour=Pair1, shape=Pair2))+geom_jitter()+
-library(vegan)
+  geom_hline(yintercept=0.05)+coord_flip()+theme_gray(base_size=BS)+
+  labs(title="Mann-Whitney U test across all pairs of groups")
-objects.latest("Op_en7749", "preprocess2") # [[Goherr: Fish consumption study]]: survey, surv
+cat("Kurskal-Wallis non-parametric test for 50 iterations of amount\n")
+oprint(aggregate(out[[1]]["p.value"], out[[1]][c("Country","Fish")], mean))
-############################### From a previous code on Answer
+cat("Mann-Whitney U non-parametric test for 50 iterations of amount\n")
+oprint(aggregate(out[[2]]["p.value"], out[[2]][c("Pair1","Pair2","Country","Fish")], mean))
-for(i in c(5:6, 16, 29:30, 46:49, 85:86, 95:98, 135)) {
+###################### Logistic regression
-  temp <- survey[!is.na(survey[[i]]),]
-  p <- ggplot(temp, aes(x = temp[[i]])) + geom_bar() +
-    theme_gray(base_size = 18) + theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4)) +
-    labs(title = colnames(temp[i])) + xlab("") + facet_wrap(~ Country)
-  print(p)
-}
-temp <- melt(survey, measure.vars = 145:153, variable.name = "Changing.factor", value.name = "Impact")
+dat2 <- dat[dat$Iter==1 & dat$Fish=="Herring",]
-levs <- c("-5 strongly disagree", "-4", "-3", "-2", "-1", "0 Neutral", "1", "2", "3", "4", "5 strongly agree", "I don't know")
-temp$Impact <- factor(temp$Impact, levels = levs, labels = levs, ordered = TRUE)
-ggplot(temp, aes(x = Impact, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Changing.factor)+
+tmp <- printregr(dat2, "Eat.fish", "What explains whether people eat fish at all?")
-  theme_gray(base_size = 24) + theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4))
-surveytemp <- subset(survey, survey[[31]] == "Yes")
+tmp <- printregr(dat2, "Eat.herring", "What explains whether people eat herring compared with other fish?")
-temp <- melt(surveytemp, measure.vars = 38:43, variable.name = "Variable", value.name = "Value")
+dat2$Eat.BH2 <- ifelse(dat2$Eat.BH!="Yes" | is.na(dat2$Eat.BH),0,1)
-ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
+tmp <- printregr(dat2, "Eat.BH2", "What explains whether people eat Baltic herring compared with everyone else?")
-  theme_gray(base_size = 24) + labs(title = "Baltic salmon sources")
-temp <- melt(surveytemp, measure.vars = 50:59, variable.name = "Variable", value.name = "Value")
+dat2$What2 <- ifelse(dat2$What=="Baltic herring",1,ifelse(dat2$What=="Other herring",0,NA))
-ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
+tmp <- printregr(dat2, "What2", "What explains whether people eat Baltic herring compared with other herring?")
-  theme_gray(base_size = 24) + labs(title = "Reasons to eat Baltic salmon")
-temp <- melt(surveytemp, measure.vars = 73:82, variable.name = "Variable", value.name = "Value")
-ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
-  theme_gray(base_size = 24) + theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4)) + labs(title = "Baltic salmon actions")
-surveytemp <- subset(survey, survey[[31]] == "No")
+cat("###################### How fish-specific causes are explained by
+    population determinants?\n")
-temp <- melt(surveytemp, measure.vars = 62:70, variable.name = "Variable", value.name = "Value")
-ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
-  theme_gray(base_size = 24) + labs(title = "Reasons not to eat Baltic salmon")
+for(j in c(
+  "Tastes.good.BH",
+  "Self.caught.BH",
+  "Easy.to.cook.BH",
+  "Quick.to.cook.BH",
+  "Readily.available.BH",
+  "Healthy.BH",
+  "Inexpensive.BH",
+  "Family.likes.it.BH",
+  "Environmental.BH",
+  "Traditional.BH"
+)) {
+  tmp <- printregr(dat2, j, "What explains the reason to eat Baltic herring?")
+}
-surveytemp <- subset(survey, survey[[86]] == "Yes")
+for(j in c(
+  "Bad.taste.BH",
+  "Not.used.to.BH",
+  "Cannot.cook.BH",
+  "Difficult.to.cook.BH",
+  "Not.available.BH",
+  "Health.risks.BH",
+  "Better.for.feed.BH",
+  "Quality.issues.BH",
+  "Sustainability.BH",
+  "Not.traditional.BH"
+)) {
+  tmp <- printregr(dat2, j, "What explains the reason to not eat Baltic herring?")
+}
-temp <- melt(surveytemp, measure.vars = 87:92, variable.name = "Variable", value.name = "Value")
+# tmp <- printregr(dat2, "Eat.salmon", "What explains whether people eat any salmon compared with other fish?")
-ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
+# tmp <- printregr(dat2, "Baltic.salmon", "What explains whether people eat Baltic salmon compared with other salmon?")
-  theme_gray(base_size = 24) + labs(title = "Baltic herring sources")
+</rcode>
-temp <- melt(surveytemp, measure.vars = 99:108, variable.name = "Variable", value.name = "Value")
+Luke data about fish consumption in Finland [https://stat.luke.fi/en/fish-consumption-2017_en][http://statdb.luke.fi/PXWeb/pxweb/en/LUKE/LUKE__06%20Kala%20ja%20riista__06%20Muut__02%20Kalan%20kulutus/2_Kalankulutus.px/table/tableViewLayout1/?rxid=dc711a9e-de6d-454b-82c2-74ff79a3a5e0]
-ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
+<t2b name="Fish consumption as food in Finland" index="Origin,Species,Year" locations="1999,2000,2001,2002,2003,2004,2005,2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017" unit="kg/a per person">
-  theme_gray(base_size = 24) + labs(title = "Reasons to eat Baltic herring")
+domestic fish|Total|6.1|6.1|5.9|6.2|5.8|5.3|5.2|5.0|5.0|4.4|4.5|4.3|3.8|3.8|3.8|4.0|4.1|4.1|4.1
+domestic fish|Farmed rainbow trout|1.6|1.6|1.6|1.6|1.3|1.3|1.4|1.1|1.1|1.2|1.2|1.1|1.0|1.0|1.1|1.1|1.3|1.2|1.2
+domestic fish|Baltic herring|0.8|1.2|1.1|1.1|0.9|0.8|0.7|0.5|0.4|0.4|0.4|0.3|0.3|0.3|0.3|0.3|0.3|0.3|0.3
+domestic fish|Pike|0.8|0.7|0.7|0.7|0.6|0.7|0.7|0.8|0.7|0.6|0.6|0.6|0.5|0.4|0.4|0.5|0.5|0.4|0.4
+domestic fish|Perch|0.7|0.7|0.7|0.7|0.6|0.6|0.6|0.7|0.7|0.5|0.5|0.5|0.4|0.4|0.4|0.5|0.5|0.4|0.4
+domestic fish|Vendace|0.7|0.7|0.7|0.7|0.8|0.8|0.7|0.6|0.6|0.6|0.6|0.7|0.6|0.6|0.6|0.6|0.6|0.5|0.6
+domestic fish|European whitefish|0.4|0.4|0.4|0.4|0.3|0.3|0.3|0.3|0.5|0.3|0.3|0.3|0.3|0.3|0.3|0.3|0.2|0.3|0.3
+domestic fish|Pike perch|0.3|0.2|0.2|0.2|0.3|0.3|0.3|0.3|0.3|0.3|0.3|0.3|0.3|0.3|0.3|0.3|0.3|0.4|0.4
+domestic fish|Other domestic fish|0.8|0.6|0.5|0.8|1.0|0.5|0.5|0.7|0.7|0.5|0.6|0.5|0.4|0.5|0.4|0.5|0.4|0.5|0.5
+imported fish|Total|6.0|6.2|7.0|7.1|8.0|8.6|7.9|8.6|9.7|9.7|9.3|10.2|11.1|10.9|10.8|10.9|10.2|9.1|9.8
+imported fish|Farmed rainbow trout|0.2|0.3|0.4|0.6|0.9|0.6|0.6|0.7|1.0|1.0|0.8|0.8|0.9|1.0|0.9|0.9|0.8|0.9|0.8
+imported fish|Farmed salmon|1.0|0.9|1.2|1.3|1.6|2.2|1.9|2.0|2.7|2.6|2.9|3.1|3.9|4.2|4.0|4.4|4.1|3.5|4.0
+imported fish|Tuna (prepared and preserved)|1.0|1.2|1.4|1.4|1.5|1.6|1.6|1.5|1.7|1.7|1.6|1.7|1.7|1.6|1.9|1.7|1.6|1.4|1.5
+imported fish|Saithe (frozen fillet)|0.7|0.6|0.7|0.7|0.7|0.6|0.4|0.5|0.5|0.5|0.5|0.5|0.6|0.5|0.5|0.5|0.5|0.4|0.4
+imported fish|Shrimps|0.5|0.4|0.5|0.5|0.5|0.5|0.5|0.5|0.6|0.6|0.6|0.7|0.7|0.6|0.6|0.5|0.5|0.4|0.4
+imported fish|Herring and Baltic herring (preserved)|0.6|0.5|0.6|0.6|0.5|0.4|0.5|0.6|0.5|0.6|0.3|0.5|0.4|0.3|0.4|0.5|0.5|0.5|0.5
+imported fish|Other imported fish|2.0|2.3|2.2|2.0|2.3|2.7|2.4|2.8|2.7|2.7|2.6|2.9|2.9|2.7|2.5|2.3|2.3|1.9|2.2
+</t2b>
-temp <- melt(surveytemp, measure.vars = 123:132, variable.name = "Variable", value.name = "Value")
+==== Descriptive statistics ====
-ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
-  theme_gray(base_size = 24) + theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4)) + labs(title = "Baltic herring actions")
-surveytemp <- subset(survey, survey[[86]] == "No")
+[[File:Goherr survey correlations.png|thumb|300px|Correlation matrix of all questions in the survey (answers converted to numbers).]]
-temp <- melt(surveytemp, measure.vars = 111:120, variable.name = "Variable", value.name = "Value")
+Model must contain predictors such as country, gender, age etc. Maybe we should first study what determinants are important?
-ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
+Model must also contain determinants that would increase or decrease fish consumption. This should be conditional on the current consumption. How?
-  theme_gray(base_size = 24) + labs(title = "Reasons not to eat Baltic herring")
+Maybe we should look at principal coordinates analysis with all questions to see how they behave.
-#####################################3
+Also look at correlation table to see clusters.
-temp <- sapply(survey, as.numeric) # Can be done for surv to get a smaller matrix
+Some obvious results:
+* If reports no fish eating, many subsequent answers are NA.
+* No vitamins correlates negatively with vitamin intake.
+* Unknown salmon correlates negatively with the types of salmon eaten.
+* Different age categories correlate with each other.
-survey_correlations <- (cor(temp, method="spearman", use="pairwise.complete.obs"))
+However, there are also meaningful negative correlations:
+* Country vs allergy
-temp <- colnames(survey_correlations)
+* Country vs Norwegian salmon and Rainbow trout
+* Country vs not traditional.
-melted_correlations <- melt(survey_correlations)
+* Country vs recommendation awareness
+* Allergy vs economic wellbeing
+* Baltic salmon use (4 questions) vs Don't like taste and Not used to
+* All questions between Easy to cook ... Traditional dish
+*
-melted_correlations$Var1 <- factor(melted_correlations$Var1, levels=temp)
+Meaningful positive correlations:
-melted_correlations$Var2 <- factor(melted_correlations$Var2, levels=temp)
+* All questions between Baltic salmon ... Rainbow trout
-melted_correlations$value <- ifelse(melted_correlations$value >= 0.99,NA,melted_correlations$value)
+* How often Baltic salmon/herring/side salmon/side herring
+* How much  Baltic salmon/herring/side salmon/side herring
+* Better availability ... Recommendation
+* All questions between Economic wellbeing...Personal aims
+* Omega3, Vitamin D, and Other vitamins
-ggplot(melted_correlations, aes(x = Var1, y = Var2, fill = value, label= round(value, 2)))+
+Model runs
-  geom_raster()+
+* [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=QaMJZqUX0cPaTfOF Model run 13.3.2017]
-  theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4))+
+* Model run 21.4.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=0Lmu6OVAjomgJwAN] old code from Answer merged to this code and debugged
-  scale_fill_gradient2(low = "#480610", mid = "#FFFFFF", high = "#06480F", midpoint = 0, space = "Lab", guide = "colourbar")
-####################### Descriptive statistics
+<rcode label="Descriptive statistics" graphics=1>
+# This is code Op_en7749/ on page [[Goherr: Fish consumption study]]
-oprint(cor(surv, use = "pairwise.complete.obs"))
+library(OpasnetUtils)
-# --> Baltic salmon and herring eating are correlated, so they should be estimated together
+library(ggplot2)
+library(reshape2)
+library(car)
+library(vegan)
-oprint(table(surv[c(12,7,4)], useNA = "ifany"))
+objects.latest("Op_en7749", "preprocess2") # [[Goherr: Fish consumption study]]: survey, surv
-oprint(table(surv[c(13,12)], useNA = "ifany"))
-############################# Plot original data
+############################### From a previous code on Answer
-# Eating frequencies of fish and Baltic salmon and herring with random noise, all
+for(i in c(5:6, 16, 29:30, 46:49, 85:86, 95:98, 135)) {
-pl <- surv[surv$Eatfish,c(5,8,10,13,15)]
+  temp <- survey[!is.na(survey[[i]]),]
-scatterplotMatrix(
+   p <- ggplot(temp, aes(x = temp[[i]])) + geom_bar() +
-   data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
+    theme_gray(base_size = 18) + theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4)) +
-)
+    labs(title = colnames(temp[i])) + xlab("") + facet_wrap(~ Country)
+  print(p)
+}
-ggplot(data.frame(
+temp <- melt(survey, measure.vars = 145:153, variable.name = "Changing.factor", value.name = "Impact")
-  X = rep(1,5), Y = 5:1,
+levs <- c("-5 strongly disagree", "-4", "-3", "-2", "-1", "0 Neutral", "1", "2", "3", "4", "5 strongly agree", "I don't know")
-  legend = c("All", "Finland", "Sweden", "Denmark", "Estonia")
+temp$Impact <- factor(temp$Impact, levels = levs, labels = levs, ordered = TRUE)
-), aes(x=X, y=Y, label=legend))+
-  geom_text()+
-  labs(title="Fish eating questions with random noise")
+ggplot(temp, aes(x = Impact, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Changing.factor)+
+  theme_gray(base_size = 24) + theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4))
-## Fish eating questions with some random noise, all
+surveytemp <- subset(survey, survey[[31]] == "Yes")
-pl <- surv[surv$Eatfish,c(5,6,7,12)]
-scatterplotMatrix(
-  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
-)
-## Fish eating questions with some random noise, FI
+temp <- melt(surveytemp, measure.vars = 38:43, variable.name = "Variable", value.name = "Value")
-pl <- surv[surv$Country == 1 & surv$Eatfish,c(5,6,7,12)]
+ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
-scatterplotMatrix(
+  theme_gray(base_size = 24) + labs(title = "Baltic salmon sources")
-  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
-)
-## Fish eating questions with some random noise, SWE
+temp <- melt(surveytemp, measure.vars = 50:59, variable.name = "Variable", value.name = "Value")
-pl <- surv[surv$Country == 2 & surv$Eatfish,c(5,6,7,12)]
+ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
-scatterplotMatrix(
+  theme_gray(base_size = 24) + labs(title = "Reasons to eat Baltic salmon")
-  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
-)
+temp <- melt(surveytemp, measure.vars = 73:82, variable.name = "Variable", value.name = "Value")
+ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
+  theme_gray(base_size = 24) + theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4)) + labs(title = "Baltic salmon actions")
-## Fish eating questions with some random noise, Dk
+surveytemp <- subset(survey, survey[[31]] == "No")
-pl <- surv[surv$Country == 3 & surv$Eatfish,c(5,6,7,12)]
-scatterplotMatrix(
-  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
-)
-## Fish eating questions with some random noise, EST
+temp <- melt(surveytemp, measure.vars = 62:70, variable.name = "Variable", value.name = "Value")
-pl <- surv[surv$Country == 4 & surv$Eatfish,c(5,6,7,12)]
+ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
-scatterplotMatrix(
+  theme_gray(base_size = 24) + labs(title = "Reasons not to eat Baltic salmon")
-  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
-)
-## Baltic herring questions with some random noise
-pl <- surv[surv$Eatherr,13:16]
-scatterplotMatrix(
-  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
-)
-## Baltic salmon questions with some random noise
+surveytemp <- subset(survey, survey[[86]] == "Yes")
-pl <- surv[surv$Eatsalm,8:11]
-scatterplotMatrix(
-  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
-)
-##################### CORRELATION MATRIX
+temp <- melt(surveytemp, measure.vars = 87:92, variable.name = "Variable", value.name = "Value")
+ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
+  theme_gray(base_size = 24) + labs(title = "Baltic herring sources")
-temp <- sapply(survey, as.numeric) # Can be done for surv to get a smaller matrix
+temp <- melt(surveytemp, measure.vars = 99:108, variable.name = "Variable", value.name = "Value")
+ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
+  theme_gray(base_size = 24) + labs(title = "Reasons to eat Baltic herring")
-survey_correlations <- (cor(temp, method="spearman", use="pairwise.complete.obs"))
+temp <- melt(surveytemp, measure.vars = 123:132, variable.name = "Variable", value.name = "Value")
+ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
+  theme_gray(base_size = 24) + theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4)) + labs(title = "Baltic herring actions")
-temp <- colnames(survey_correlations)
+surveytemp <- subset(survey, survey[[86]] == "No")
-melted_correlations <- melt(survey_correlations)
+temp <- melt(surveytemp, measure.vars = 111:120, variable.name = "Variable", value.name = "Value")
+ggplot(temp, aes(x = Value, fill = Country)) + geom_bar(position = "dodge") + facet_wrap(~ Variable)+
+  theme_gray(base_size = 24) + labs(title = "Reasons not to eat Baltic herring")
+#####################################3
+temp <- sapply(survey, as.numeric) # Can be done for surv to get a smaller matrix
+survey_correlations <- (cor(temp, method="spearman", use="pairwise.complete.obs"))
+temp <- colnames(survey_correlations)
+melted_correlations <- melt(survey_correlations)
 melted_correlations$Var1 <- factor(melted_correlations$Var1, levels=temp)
@@ Line 1,137: / Line 1,433: @@
    scale_fill_gradient2(low = "#480610", mid = "#FFFFFF", high = "#06480F", midpoint = 0, space = "Lab", guide = "colourbar")
-############################### PRINCIPAL COORDINATE ANALYSIS (PCoA)
+####################### Descriptive statistics
-#tämä osa valmistaa sen datan.
+oprint(cor(surv, use = "pairwise.complete.obs"))
-hypocols1 <- c(46:49,95:98)
+# --> Baltic salmon and herring eating are correlated, so they should be estimated together
-answ <- sapply(survey[hypocols1], FUN=as.numeric)
-answ <- as.matrix(answ[!is.na(rowSums(answ)),])
-pcoa_caps <- capscale(t(answ) ~ 1, distance="euclidean") ##PCoA done
+oprint(table(surv[c(12,7,4)], useNA = "ifany"))
+oprint(table(surv[c(13,12)], useNA = "ifany"))
-## Kuva koko hypoteeseista
+############################# Plot original data
-colstr <- c("palevioletred1","royalblue1","seagreen1","violet","khaki2","skyblue", "orange")
+# Eating frequencies of fish and Baltic salmon and herring with random noise, all
+pl <- surv[surv$Eatfish,c(5,8,10,13,15)]
+scatterplotMatrix(
+  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
+)
-#hypo_sizes <- (5 - colMeans(answ))
+ggplot(data.frame(
-#leg_sizes <- c(4, 3, 2, 1, 0.01)
+  X = rep(1,5), Y = 5:1,
+  legend = c("All", "Finland", "Sweden", "Denmark", "Estonia")
-#pdf(file="pcoa_plot.pdf", height=6, width=7.5)
+), aes(x=X, y=Y, label=legend))+
-plot(pcoa_caps, display = c("sp", "wa"), type="n")#, xlim=c(-6,4.5)) ## PCoA biplot, full scale
+  geom_text()+
-points(pcoa_caps, display= c("sp"), col="gray40") # adding the people points
+  labs(title="Fish eating questions with random noise")
-points(pcoa_caps, display= c("wa"), pch=19)#, cex=hypo_sizes, col=trait.cols)
-text(pcoa_caps, display=c("wa"), srt=25, cex=0.5)
-#legend(x=-6, y=3.8, levels(traits), fill=colstr, bty="n", cex=1)
-#legend(-6, -2, legend=c("Very likely", "Moderately likely",
-#                        "No opinion", "Moderately unlikely", "Very unlikely"),
-#       pch=21, pt.cex = leg_sizes, bty="n", cex=1)
-#dev.off()
-</rcode>
-==== Bayes model ====
-* Model run 3.3.2017. All variables assumed independent. [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=lwlSwXazIDHDyJJg]
+## Fish eating questions with some random noise, all
-* Model run 3.3.2017. p has more dimensions. [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=ZmbNUuZeb7UOf8NP]
+pl <- surv[surv$Eatfish,c(5,6,7,12)]
-* Model run 25.3.2017. Several model versions: strange binomial+multivarnormal, binomial, fractalised multivarnormal [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=pKe0s2Ldm1mbIVuO]
+scatterplotMatrix(
-* Model run 27.3.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=2hY9p2r8CTJi3Qwq]
+  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
-* Other models except multivariate normal were [http://en.opasnet.org/en-opwiki/index.php?title=Goherr:_Fish_consumption_study&oldid=40185 archived] and removed from active code 29.3.2017.
+)
-* Model run 29.3.2017 with raw data graphs [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=BB8nePJb7hzSw6Ha]
-* Model run 29.3.2017 with salmon and herring ovariables stored [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=2Hz4tYjrQLnUfIXw]
-* Model run 13.4.2017 with first version of coordinate matrix and principal coordinate analysis [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=2k2dKhYPc2UkOCY5]
-* Model run 20.4.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=QXRSZOliFNfJil39] code works but needs a safety check against outliers
-* Model run 21.4.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=xbt2OoXPG8fi5COL] some model results plotted
-* Model run 21.4.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=waDgScuIkUvEShLy] ovariables produced by the model stored.
-* Model run 18.5.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=hy2clSt3Vo5y4xN5] small updates
-* 13.2.2018 old model run but with new Opasnet [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=flr56viVhZ0VzZQu]
-<rcode name="bayes" label="Initiate Bayes model (for developers only)" graphics=1>
+## Fish eating questions with some random noise, FI
-# This is code Op_en7749/bayes on page [[Goherr: Fish consumption study]]
+pl <- surv[surv$Country == 1 & surv$Eatfish,c(5,6,7,12)]
+scatterplotMatrix(
+  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
+)
-library(OpasnetUtils)
+## Fish eating questions with some random noise, SWE
-library(ggplot2)
+pl <- surv[surv$Country == 2 & surv$Eatfish,c(5,6,7,12)]
-library(reshape2)
+scatterplotMatrix(
-library(rjags)
+  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
-library(car)
+)
-library(vegan)
-library(MASS)
-#library(gridExtra) # Error: package ‘gridExtra’ was built before R 3.0.0: please re-install it
-# Fish intake in humans
+## Fish eating questions with some random noise, Dk
-# Data from data.frame survey from page [[Goherr: Fish consumption study]]
+pl <- surv[surv$Country == 3 & surv$Eatfish,c(5,6,7,12)]
+scatterplotMatrix(
+  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
+)
-objects.latest("Op_en7749", "preprocess2") # [[Goherr: Fish consumption study]]: survey, surv, ...
+## Fish eating questions with some random noise, EST
+pl <- surv[surv$Country == 4 & surv$Eatfish,c(5,6,7,12)]
+scatterplotMatrix(
+  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
+)
-cat("Version with multivariate normal.\n")
+## Baltic herring questions with some random noise
+pl <- surv[surv$Eatherr,13:16]
+scatterplotMatrix(
+  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
+)
+## Baltic salmon questions with some random noise
+pl <- surv[surv$Eatsalm,8:11]
+scatterplotMatrix(
+  data.matrix(pl) + runif(nrow(pl)*ncol(pl), -0.5, 0.5)
+)
+##################### CORRELATION MATRIX
+temp <- sapply(survey, as.numeric) # Can be done for surv to get a smaller matrix
+survey_correlations <- (cor(temp, method="spearman", use="pairwise.complete.obs"))
-# Development needs:
+temp <- colnames(survey_correlations)
-## Correlation between salmon.often and herring.often needs to be estimated.
-## Gender, country and age-spesific values should be estimated.
-mod <- textConnection("
+melted_correlations <- melt(survey_correlations)
-  model{
-    for(i in 1:S) {
+melted_correlations$Var1 <- factor(melted_correlations$Var1, levels=temp)
-      survs[i,1:4] ~ dmnorm(mus[], Omegas[,])
+melted_correlations$Var2 <- factor(melted_correlations$Var2, levels=temp)
-    }
+melted_correlations$value <- ifelse(melted_correlations$value >= 0.99,NA,melted_correlations$value)
-    for(j in 1:H) {
-      survh[j,1:4] ~ dmnorm(muh[], Omegah[,])
-    }
-    mus[1:4] ~ dmnorm(mu0[1:4], S2[1:4,1:4])
-    Omegas[1:4,1:4] ~ dwish(S3[1:4,1:4],S)
-    anss.pred ~ dmnorm(mus[], Omegas[,])
-    muh[1:4] ~ dmnorm(mu0[1:4], S2[1:4,1:4])
-    Omegah[1:4,1:4] ~ dwish(S3[1:4,1:4],H)
-    ansh.pred ~ dmnorm(muh[], Omegah[,])
-  }
-")
-jags <- jags.model(
+ggplot(melted_correlations, aes(x = Var1, y = Var2, fill = value, label= round(value, 2)))+
-   mod,
+   geom_raster()+
-   data = list(
+   theme(axis.text.x=element_text(angle=90,hjust=1,vjust=0.4))+
-    survs = surv[surv$Eatsalm,c(8:11)],
+  scale_fill_gradient2(low = "#480610", mid = "#FFFFFF", high = "#06480F", midpoint = 0, space = "Lab", guide = "colourbar")
-    S = sum(surv$Eatsalm),
-    survh = surv[surv$Eatherr,c(13:16)],
+############################### PRINCIPAL COORDINATE ANALYSIS (PCoA)
-    H = sum(surv$Eatherr),
-    mu0 = rep(2,4),
+#tämä osa valmistaa sen datan.
-    S2 = diag(4)/100000,
+hypocols1 <- c(46:49,95:98)
-    S3 = diag(4)/10000
+answ <- sapply(survey[hypocols1], FUN=as.numeric)
-  ),
+answ <- as.matrix(answ[!is.na(rowSums(answ)),])
-  n.chains = 4,
-  n.adapt = 100
+pcoa_caps <- capscale(t(answ) ~ 1, distance="euclidean") ##PCoA done
-)
-update(jags, 100)
+## Kuva koko hypoteeseista
-samps.j <- jags.samples(
-  jags,
-  c('mus', 'Omegas', 'anss.pred','muh','Omegah','ansh.pred'),
-)
-js <- array(
-  c(
-    samps.j$mus[,,1],
-    samps.j$Omegas[,1,,1],
-    samps.j$Omegas[,2,,1],
-    samps.j$Omegas[,3,,1],
-    samps.j$Omegas[,4,,1],
-    samps.j$anss.pred[,,1],
-    samps.j$muh[,,1],
-    samps.j$Omegah[,1,,1],
-    samps.j$Omegah[,2,,1],
-    samps.j$Omegah[,3,,1],
-    samps.j$Omegah[,4,,1],
-    samps.j$ansh.pred[,,1]
-  ),
-  dim = c(4,1000,6,2),
-  dimnames = list(
-    Question = 1:4,
-    Iter = 1:1000,
-    Parameter = c("mu","Omega1", "Omega2", "Omega3", "Omega4", "ans.pred"),
-    Fish = c("Salmon", "Herring")
-  )
-)
-# Mu for all questions about salmon
+colstr <- c("palevioletred1","royalblue1","seagreen1","violet","khaki2","skyblue", "orange")
-scatterplotMatrix(t(js[,,1,1]))
-# All parameters for question 1 about salmon
-scatterplotMatrix(js[1,,,1])
-# Same for herring
-scatterplotMatrix(js[1,,,2])
-jsd <- melt(js)
+#hypo_sizes <- (5 - colMeans(answ))
-#ggplot(jsd, aes(x=value, colour=Question))+geom_density()+facet_grid(Parameter ~ Fish)
+#leg_sizes <- c(4, 3, 2, 1, 0.01)
-#ggplot(as.data.frame(js), aes(x = anss.pred, y = Sampled))+geom_point()+stat_ellipse()
-coda.j <- coda.samples(
+#pdf(file="pcoa_plot.pdf", height=6, width=7.5)
-  jags,
+plot(pcoa_caps, display = c("sp", "wa"), type="n")#, xlim=c(-6,4.5)) ## PCoA biplot, full scale
-  c('mus', 'Omegas', 'anss.pred'),
+points(pcoa_caps, display= c("sp"), col="gray40") # adding the people points
+points(pcoa_caps, display= c("wa"), pch=19)#, cex=hypo_sizes, col=trait.cols)
-)
+text(pcoa_caps, display=c("wa"), srt=25, cex=0.5)
+#legend(x=-6, y=3.8, levels(traits), fill=colstr, bty="n", cex=1)
+#legend(-6, -2, legend=c("Very likely", "Moderately likely",
+#                        "No opinion", "Moderately unlikely", "Very unlikely"),
+#       pch=21, pt.cex = leg_sizes, bty="n", cex=1)
+#dev.off()
+</rcode>
-plot(coda.j)
+==== Bayes model ====
-######## fish.param contains expected values of the distribution parameters from the model
+* Model run 3.3.2017. All variables assumed independent. [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=lwlSwXazIDHDyJJg]
+* Model run 3.3.2017. p has more dimensions. [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=ZmbNUuZeb7UOf8NP]
+* Model run 25.3.2017. Several model versions: strange binomial+multivarnormal, binomial, fractalised multivarnormal [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=pKe0s2Ldm1mbIVuO]
+* Model run 27.3.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=2hY9p2r8CTJi3Qwq]
+* Other models except multivariate normal were [http://en.opasnet.org/en-opwiki/index.php?title=Goherr:_Fish_consumption_study&oldid=40185 archived] and removed from active code 29.3.2017.
+* Model run 29.3.2017 with raw data graphs [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=BB8nePJb7hzSw6Ha]
+* Model run 29.3.2017 with salmon and herring ovariables stored [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=2Hz4tYjrQLnUfIXw]
+* Model run 13.4.2017 with first version of coordinate matrix and principal coordinate analysis [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=2k2dKhYPc2UkOCY5]
+* Model run 20.4.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=QXRSZOliFNfJil39] code works but needs a safety check against outliers
+* Model run 21.4.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=xbt2OoXPG8fi5COL] some model results plotted
+* Model run 21.4.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=waDgScuIkUvEShLy] ovariables produced by the model stored.
+* Model run 18.5.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=hy2clSt3Vo5y4xN5] small updates
+* 13.2.2018 old model run but with new Opasnet [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=flr56viVhZ0VzZQu]
-fish.param <- list(
+<rcode name="bayes" label="Initiate Bayes model (for developers only)" graphics=1>
-  mu = apply(js[,,1,], MARGIN = c(1,3), FUN = mean),
+# This is code Op_en7749/bayes on page [[Goherr: Fish consumption study]]
-  Omega = lapply(
-:2,
-    FUN = function(x) {
-      solve(apply(js[,,2:5,], MARGIN = c(1,3,4), FUN = mean)[,,x])
-    } # solve matrix: precision->covariace
-  )
-)
-objects.store(fish.param)
+library(OpasnetUtils)
-cat("List fish.param stored.\n")
+library(ggplot2)
-</rcode>
+library(reshape2)
+library(rjags)
+library(car)
+library(vegan)
+library(MASS)
+#library(gridExtra) # Error: package ‘gridExtra’ was built before R 3.0.0: please re-install it
-==== Initiate ovariables ====
+# Fish intake in humans
+# Data from data.frame survey from page [[Goherr: Fish consumption study]]
-'''Amount estimated from a bayesian model.
+objects.latest("Op_en7749", "preprocess2") # [[Goherr: Fish consumption study]]: survey, surv, ...
-* Model run 24.5.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=3UORzPwospQxp82h]
+cat("Version with multivariate normal.\n")
-<rcode name="modeljsp" label="Initiate jsp from model (for developers only)">
+# Development needs:
-# This is code Op_en7749/modeljsp on page [[Goherr: Fish consumption study]]
+## Correlation between salmon.often and herring.often needs to be estimated.
+## Gender, country and age-spesific values should be estimated.
-library(OpasnetUtils)
+mod <- textConnection("
+   model{
-jsp <- Ovariable(
+    for(i in 1:S) {
-  "jsp",
+      survs[i,1:4] ~ dmnorm(mus[], Omegas[,])
-   dependencies = data.frame(Name = "fish.param", Ident = "Op_en7749/bayes"),
+     }
-  formula = function(...) {
+     for(j in 1:H) {
-    require(MASS)
+      survh[j,1:4] ~ dmnorm(muh[], Omegah[,])
-     require(reshape2)
+    }
-     jsp <- lapply(
+     mus[1:4] ~ dmnorm(mu0[1:4], S2[1:4,1:4])
-:2,
+     Omegas[1:4,1:4] ~ dwish(S3[1:4,1:4],S)
-      FUN = function(x) {
+    anss.pred ~ dmnorm(mus[], Omegas[,])
-        mvrnorm(openv$N, fish.param$mu[,x], fish.param$Omega[[x]])
+    muh[1:4] ~ dmnorm(mu0[1:4], S2[1:4,1:4])
-      }
+    Omegah[1:4,1:4] ~ dwish(S3[1:4,1:4],H)
-     )
+    ansh.pred ~ dmnorm(muh[], Omegah[,])
+  }
-    jsp <- rbind(
+")
-      cbind(
-        Fish = "Salmon",
+jags <- jags.model(
-        Iter = 1:nrow(jsp[[1]]),
+  mod,
-        as.data.frame(jsp[[1]])
+  data = list(
-      ),
+    survs = surv[surv$Eatsalm,c(8:11)],
-      cbind(
+    S = sum(surv$Eatsalm),
-        Fish = "Herring",
+    survh = surv[surv$Eatherr,c(13:16)],
-        Iter = 1:nrow(jsp[[2]]),
+    H = sum(surv$Eatherr),
-        as.data.frame(jsp[[2]])
+     mu0 = rep(2,4),
-      )
+    S2 = diag(4)/100000,
-    )
+     S3 = diag(4)/10000
-    jsp <- melt(jsp, id.vars = c("Iter", "Fish"), variable.name = "Question", value.name = "Result")
+  ),
-     jsp <- Ovariable(output=jsp, marginal = colnames(jsp) %in% c("Iter", "Fish", "Question"))
+  n.chains = 4,
-     return(jsp)
+   n.adapt = 100
-   }
 )
-objects.store(jsp)
+update(jags, 100)
-cat("Ovariable jsp stored.\n")
+samps.j <- jags.samples(
-</rcode>
+  jags,
+  c('mus', 'Omegas', 'anss.pred','muh','Omegah','ansh.pred'),
-'''Amount estimates directly from data rather than from a bayesian model.
+)
-* Initiation run 18.5.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=crW1kboP72BN1JbK]
+js <- array(
-* Initiation run 24.2.2018: sampling from survey rather than each respondent once [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=yJavDqd5zPUyt5pK]
+  c(
+    samps.j$mus[,,1],
-<rcode name="surveyjsp" label="Initiate jsp from survey (for developers only)" embed=1>
+    samps.j$Omegas[,1,,1],
-# This is code Op_en7749/surveyjsp on page [[Goherr: Fish consumption study]]
+    samps.j$Omegas[,2,,1],
-# The code produces amount esimates (jsp ovariable) directly from data rather than bayesian model.
+    samps.j$Omegas[,3,,1],
+    samps.j$Omegas[,4,,1],
-library(OpasnetUtils)
+    samps.j$anss.pred[,,1],
+    samps.j$muh[,,1],
+    samps.j$Omegah[,1,,1],
+    samps.j$Omegah[,2,,1],
+    samps.j$Omegah[,3,,1],
+    samps.j$Omegah[,4,,1],
+    samps.j$ansh.pred[,,1]
+  ),
+  dim = c(4,1000,6,2),
+  dimnames = list(
+    Question = 1:4,
+    Iter = 1:1000,
+    Parameter = c("mu","Omega1", "Omega2", "Omega3", "Omega4", "ans.pred"),
+    Fish = c("Salmon", "Herring")
+  )
+)
-jsp <- Ovariable(
+# Mu for all questions about salmon
-  "jsp",
+scatterplotMatrix(t(js[,,1,1]))
-  dependencies = data.frame(Name="survey1", Ident="Op_en7749/preprocess2"), # [[Goherr: Food cunsumption study]]
+# All parameters for question 1 about salmon
-   formula = function(...) {
+scatterplotMatrix(js[1,,,1])
-    require(reshape2)
+# Same for herring
+scatterplotMatrix(js[1,,,2])
-    sur <- survey1[
-      sample(1:nrow(survey1), openv$N, replace=TRUE, prob=survey1$Weighting) ,
+jsd <- melt(js)
-       c(157,1,3,158,162:178,81,82,131,132) # Removed, not needed:16,29,30?
+#ggplot(jsd, aes(x=value, colour=Question))+geom_density()+facet_grid(Parameter ~ Fish)
-      ]
+#ggplot(as.data.frame(js), aes(x = anss.pred, y = Sampled))+geom_point()+stat_ellipse()
-     sur$Iter <- 1:openv$N
+coda.j <- coda.samples(
-    #colnames(sur)
+  jags,
-    #[1] "Row"                      "Country"                  "Gender"
+   c('mus', 'Omegas', 'anss.pred'),
-    #[4] "Ages"                     "Baltic.salmon.n"          "How.often.BS.n"
-    #[7] "How.much.BS.n"            "How.often.side.BS.n"      "How.much.side.BS.n"
+)
-    #[10] "Better.availability.BS.n" "Less.chemicals.BS.n"      "Eat.BH.n"
-    #[13] "How.often.BH.n"           "How.much.BH.n"            "How.often.side.BH.n"
+plot(coda.j)
-    #[16] "How.much.side.BH.n"       "Better.availability.BH.n" "Less.chemicals.BH.n"
-    #[19] "Eatfish"                  "Eatsalm"                  "Eatherr"
+######## fish.param contains expected values of the distribution parameters from the model
-    #[22] "Recommended.BS"           "Not.recommended.BS"       "Recommended.BH"
-    #[25] "Not.recommended.BH"       "Iter"
+fish.param <- list(
+  mu = apply(js[,,1,], MARGIN = c(1,3), FUN = mean),
-    # Make sure that Row is kept separate from Iter because in the sampling version they are different.
+  Omega = lapply(
-    # sur contained columns Eat.fish, How.often.fish, Eat.salmon. Are these needed, as all other questions are melted? No
+:2,
-    # Columns 5-7 removed, so colnames list above does not match.
+    FUN = function(x) {
+       solve(apply(js[,,2:5,], MARGIN = c(1,3,4), FUN = mean)[,,x])
-    colnames(sur)[5] <- "Eat.BS"
+     } # solve matrix: precision->covariace
-    colnames(sur) <- gsub("\\.n","",colnames(sur))
+  )
-    sur[22:25] <- sapply(sur[22:25], as.numeric)
+)
-     sur <- melt(
-      sur,
+objects.store(fish.param)
-      id.vars=c(1:4,26),
+cat("List fish.param stored.\n")
-      variable.name="Question",
+</rcode>
-      value.name="Result"
-     )
+==== Initiate ovariables ====
-     sur$Fish <- ifelse(grepl("BH",sur$Question),"Herring","Salmon")
-    sur$Question <- gsub("\\.BS","",sur$Question)
+=====jsp taken directly from data WITHOUT salmpling=====
-    sur$Question <- gsub("\\.BH","",sur$Question)
+<rcode name="nonsamplejsp" label="Initiate jsp from data without sampling (for developers only)" embed=1>
+# This is code Op_en7749/nonsamplejsp on page [[Goherr: Fish consumption study]]
+# The code produces amount esimates (jsp ovariable) directly from data rather than bayesian model or sampling.
+library(OpasnetUtils)
+jsp <- Ovariable(
+  "jsp",
+  dependencies = data.frame(Name="survey1", Ident="Op_en7749/preprocess2"), # [[Goherr: Fish consumption study]]
+  formula = function(...) {
+     require(reshape2)
+     sur <- survey1[c(157,1,3,158,162:178,81,82,131,132,154)] # Removed, not needed:16,29,30?
-     # The adjustments below probably should go to the preprocess2 code.
+     #colnames(sur)
-     sur$Result[sur$Question %in% c(
+    #[1] "Row"                      "Country"                  "Gender"
-      "Better.availability",
+    #[4] "Ages"                     "Baltic.salmon.n"          "How.often.BS.n"
-       "Less.chemicals",
+     #[7] "How.much.BS.n"            "How.often.side.BS.n"      "How.much.side.BS.n"
-       "Recommended",
+    #[10] "Better.availability.BS.n" "Less.chemicals.BS.n"      "Eat.BH.n"
-      "Not.recommended"
+    #[13] "How.often.BH.n"           "How.much.BH.n"            "How.often.side.BH.n"
-     ) & is.na(sur$Result)] <- 4 # If missing --> no change
+    #[16] "How.much.side.BH.n"       "Better.availability.BH.n" "Less.chemicals.BH.n"
-     sur$Result[is.na(sur$Result)] <- 1 # Replace missing values with 1. That will produce 0 g/d.
+    #[19] "Eatfish"                  "Eatsalm"                  "Eatherr"
+    #[22] "Recommended.BS"           "Not.recommended.BS"       "Recommended.BH"
+    #[25] "Not.recommended.BH"       "Weighting"
+     # Make sure that Row is kept separate from Iter because in the sampling version they are different.
+     # sur contained columns Eat.fish, How.often.fish, Eat.salmon. Are these needed, as all other questions are melted? No
+    # Columns 5-7 removed, so colnames list above does not match.
-     return(Ovariable(
+    colnames(sur)[5] <- "Eat.BS"
-       output = sur,
+    colnames(sur) <- gsub("\\.n","",colnames(sur))
-       marginal = colnames(sur) %in% c("Fish", "Iter", "Question")
+    sur[22:25] <- sapply(sur[22:25], as.numeric)
-     ))
+    sur <- melt(
-   }
+      sur,
+      id.vars=c(1:4,26),
+      variable.name="Question",
+      value.name="Result"
+    )
+    sur$Fish <- ifelse(grepl("BH",sur$Question),"Herring","Salmon")
+    sur$Question <- gsub("\\.BS","",sur$Question)
+    sur$Question <- gsub("\\.BH","",sur$Question)
+    ########## If the missing values are not adjusted, they drop out in the next stage.
+    if(TRUE) {
+      # The adjustments below probably should go to the preprocess2 code.
+      sur$Result[sur$Question %in% c(
+        "Better.availability",
+        "Less.chemicals",
+        "Recommended",
+        "Not.recommended"
+      ) & is.na(sur$Result)] <- 4 # If missing --> no change
+      sur$Result[is.na(sur$Result)] <- 1 # Replace missing values with 1. That will produce 0 g/d.
+    }
+     return(Ovariable(
+       output = sur,
+       marginal = colnames(sur) %in% c("Fish", "Iter", "Question")
+     ))
+   }
 )
 objects.store(jsp)
-cat("Ovariable jsp with actual survey data: a respondent is sampled for each iteration.\n")
+cat("Ovariable jsp with actual survey data: every respondent is kept in data without sampling.\n")
 </rcode>
-'''Initiate other ovariables
+===== Amount estimated from a bayesian model =====
+* Model run 24.5.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=3UORzPwospQxp82h]
-* Code stores ovariables assump, often, much, oftenside, muchside, amount.
+<rcode name="modeljsp" label="Initiate jsp from model (for developers only)">
-* Model run 19.5.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=FauP3x6hviC4xWCD]
+# This is code Op_en7749/modeljsp on page [[Goherr: Fish consumption study]]
-* Initiation run 24.5.2017 without jsp [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=6U3FWqVRaNumPcok]
-* Model run 8.6.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=euGolhOQvPqzQDvf]
-<rcode name="initiate" label="Initiate ovariables (for developers only)" embed=1>
-# This is code Op_en7749/initiate on page [[Goherr: Fish consumption study]]
 library(OpasnetUtils)
-# Combine modelled survey answers with estimated amounts and frequencies by:
+jsp <- Ovariable(
-# Rounding the modelled result and merging that with value in ovariable assump
+   "jsp",
+   dependencies = data.frame(Name = "fish.param", Ident = "Op_en7749/bayes"),
-often <- Ovariable(
-   "often",
-   dependencies = data.frame(
-    Name=c("jsp","assump"),
-    Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
-  ),
    formula = function(...) {
-     out <- jsp[jsp$Question == "How.often" , ]
+     require(MASS)
-     out$Value <- round(result(out))
+    require(reshape2)
-    out <- merge(
+    jsp <- lapply(
-      assump@output[assump$Variable == "freq",],
+:2,
-       out@output
+      FUN = function(x) {
+        mvrnorm(openv$N, fish.param$mu[,x], fish.param$Omega[[x]])
+      }
+    )
+    jsp <- rbind(
+      cbind(
+        Fish = "Salmon",
+        Iter = 1:nrow(jsp[[1]]),
+        as.data.frame(jsp[[1]])
+      ),
+      cbind(
+        Fish = "Herring",
+        Iter = 1:nrow(jsp[[2]]),
+        as.data.frame(jsp[[2]])
+       )
      )
-     out <- out[!colnames(out) %in% c("Value", "Variable", "Result","Question")]
+     jsp <- melt(jsp, id.vars = c("Iter", "Fish"), variable.name = "Question", value.name = "Result")
-     colnames(out)[colnames(out) == "assumpResult"] <- "Result"
+     jsp <- Ovariable(output=jsp, marginal = colnames(jsp) %in% c("Iter", "Fish", "Question"))
-     return(out)
+     return(jsp)
    }
 )
-much <- Ovariable(
+objects.store(jsp)
-  "much",
+cat("Ovariable jsp stored.\n")
-  dependencies = data.frame(
+</rcode>
-    Name=c("jsp","assump"),
-    Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
+===== Amount estimates directly from data rather than from a bayesian model =====
-  ),
-  formula = function(...) {
+* Initiation run 18.5.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=crW1kboP72BN1JbK]
-    out <- jsp[jsp$Question == "How.much" , ]
+* Initiation run 24.2.2018: sampling from survey rather than each respondent once [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=yJavDqd5zPUyt5pK]
-    out$Value <- round(result(out))
-    out <- merge(
-      assump@output[assump$Variable == "amdish",],
-      out@output
-    )
-    out <- out[!colnames(out) %in% c("Value", "Variable", "Result","Question")]
-    colnames(out)[colnames(out) == "assumpResult"] <- "Result"
-    return(out)
-  }
-)
-oftenside <- Ovariable(
+<rcode name="surveyjsp" label="Initiate jsp from survey (for developers only)" embed=1>
-   "oftenside",
+# This is code Op_en7749/surveyjsp on page [[Goherr: Fish consumption study]]
-   dependencies = data.frame(
+# The code produces amount esimates (jsp ovariable) directly from data rather than bayesian model.
-    Name=c("jsp","assump"),
-    Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
+library(OpasnetUtils)
-  ),
+jsp <- Ovariable(
+   "jsp",
+   dependencies = data.frame(Name="survey1", Ident="Op_en7749/preprocess2"), # [[Goherr: Food cunsumption study]]
    formula = function(...) {
-     out <- jsp[jsp$Question == "How.often.side" , ]
+     require(reshape2)
-     out$Value <- round(result(out))
-     out <- merge(
+    sur <- survey1[
-      assump@output[assump$Variable == "freq",],
+      sample(1:nrow(survey1), openv$N, replace=TRUE, prob=survey1$Weighting) ,
-      out@output
+      c(157,1,3,158,162:178,81,82,131,132) # Removed, not needed:16,29,30?
-     )
+      ]
-     out <- out[!colnames(out) %in% c("Value", "Variable", "Result","Question")]
+     sur$Iter <- 1:openv$N
-     colnames(out)[colnames(out) == "assumpResult"] <- "Result"
-     return(out)
+    #colnames(sur)
-  }
+     #[1] "Row"                      "Country"                  "Gender"
-)
+    #[4] "Ages"                     "Baltic.salmon.n"          "How.often.BS.n"
+     #[7] "How.much.BS.n"            "How.often.side.BS.n"      "How.much.side.BS.n"
-muchside <- Ovariable(
+     #[10] "Better.availability.BS.n" "Less.chemicals.BS.n"      "Eat.BH.n"
-  "muchside",
+    #[13] "How.often.BH.n"           "How.much.BH.n"            "How.often.side.BH.n"
-  dependencies = data.frame(
+     #[16] "How.much.side.BH.n"       "Better.availability.BH.n" "Less.chemicals.BH.n"
-    Name=c("jsp","assump"),
+     #[19] "Eatfish"                  "Eatsalm"                  "Eatherr"
-     Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
+    #[22] "Recommended.BS"           "Not.recommended.BS"       "Recommended.BH"
-  ),
+     #[25] "Not.recommended.BH"       "Iter"
-  formula = function(...) {
-     out <- jsp[jsp$Question == "How.much.side" , ]
+    # Make sure that Row is kept separate from Iter because in the sampling version they are different.
-     out$Value <- round(result(out))
+    # sur contained columns Eat.fish, How.often.fish, Eat.salmon. Are these needed, as all other questions are melted? No
-     out <- merge(
+    # Columns 5-7 removed, so colnames list above does not match.
-       assump@output[assump$Variable == "amside",],
-       out@output
+    colnames(sur)[5] <- "Eat.BS"
+     colnames(sur) <- gsub("\\.n","",colnames(sur))
+     sur[22:25] <- sapply(sur[22:25], as.numeric)
+     sur <- melt(
+       sur,
+      id.vars=c(1:4,26),
+      variable.name="Question",
+       value.name="Result"
      )
-     out <- out[!colnames(out) %in% c("Value", "Variable", "Result","Question")]
+     sur$Fish <- ifelse(grepl("BH",sur$Question),"Herring","Salmon")
-     colnames(out)[colnames(out) == "assumpResult"] <- "Result"
+    sur$Question <- gsub("\\.BS","",sur$Question)
-     return(out)
+    sur$Question <- gsub("\\.BH","",sur$Question)
-   }
-)
+    # The adjustments below probably should go to the preprocess2 code.
+    sur$Result[sur$Question %in% c(
+      "Better.availability",
+      "Less.chemicals",
+      "Recommended",
+      "Not.recommended"
+    ) & is.na(sur$Result)] <- 4 # If missing --> no change
+     sur$Result[is.na(sur$Result)] <- 1 # Replace missing values with 1. That will produce 0 g/d.
+    return(Ovariable(
+      output = sur,
+      marginal = colnames(sur) %in% c("Fish", "Iter", "Question")
+     ))
+   }
+)
-assump <- Ovariable(
+objects.store(jsp)
-  "assump",
+cat("Ovariable jsp with actual survey data: a respondent is sampled for each iteration.\n")
-  ddata = "Op_en7749", subset = "Assumptions for calculations"
+</rcode>
-)
+===== Initiate other ovariables =====
-effinfo <- Ovariable( # Effect of information
+* Code stores ovariables assump, often, much, oftenside, muchside, amount.
-   "effinfo",
+* Model run 19.5.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=FauP3x6hviC4xWCD]
-   dependencies = data.frame(
+* Initiation run 24.5.2017 without jsp [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=6U3FWqVRaNumPcok]
-     Name=c("jsp","assump"),
+* Model run 8.6.2017 [http://en.opasnet.org/en-opwiki/index.php?title=Special:RTools&id=euGolhOQvPqzQDvf]
+<rcode name="initiate" label="Initiate ovariables (for developers only)" embed=1>
+# This is code Op_en7749/initiate on page [[Goherr: Fish consumption study]]
+library(OpasnetUtils)
+# Combine modelled survey answers with estimated amounts and frequencies by:
+# Rounding the modelled result and merging that with value in ovariable assump
+often <- Ovariable(
+   "often",
+   dependencies = data.frame(
+     Name=c("jsp","assump"),
      Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
    ),
    formula = function(...) {
-     away <- c("Value","Variable","Result","Question","assumpUnit","assumpSource","jspResult","jspSource")
+     out <- jsp[jsp$Question == "How.often" , ]
-    # Better availability of fish
+     out$Value <- round(result(out))
-    o1 <- jsp[jsp$Question == "Better.availability" , ]
+     out <- merge(
-     o1$Value <- round(result(o1))
+       assump@output[assump$Variable == "freq",],
-     o1 <- merge(
+       out@output
-       assump@output[assump$Variable == "change",],
-       o1@output
      )
-     colnames(o1)[colnames(o1)=="assumpResult"] <- "Better.availabilityResult"
+     out <- out[!colnames(out) %in% c("Value", "Variable", "Result","Question")]
-    o1 <- Ovariable(name="Better.availability", output=o1[!colnames(o1) %in% away])
+     colnames(out)[colnames(out) == "assumpResult"] <- "Result"
+     return(out)
-    # Less chemicals
-    o2 <- jsp[jsp$Question == "Less.chemicals" , ]
-    o2$Value <- round(result(o2))
-    o2 <- merge(
-      assump@output[assump$Variable == "change",],
-      o2@output
-    )
-     colnames(o2)[colnames(o2)=="assumpResult"] <- "Less.chemicalsResult"
-    o2 <- Ovariable(name="Less.chemicals", output=o2[!colnames(o2) %in% away])
-    out <- (o1 + o2)@output
-     return(Ovariable(output=out, marginal=colnames(out) %in% c("Iter","Fish")))
    }
 )
-effrecomm <- Ovariable( # Effect of recommendations
+much <- Ovariable(
-   "effrecomm",
+   "much",
    dependencies = data.frame(
      Name=c("jsp","assump"),
@@ Line 1,554: / Line 1,907: @@
    ),
    formula = function(...) {
-     # Recommend more fish
+     out <- jsp[jsp$Question == "How.much" , ]
-    o1 <- jsp[jsp$Question == "Recommended" , ]
+     out$Value <- round(result(out))
-     o1$Value <- round(result(o1))
+     out <- merge(
-     o1 <- merge(
+       assump@output[assump$Variable == "amdish",],
-       assump@output[assump$Variable == "change",],
+       out@output
-       o1@output
      )
-     colnames(o1)[colnames(o1)=="assumpResult"] <- "Result"
+    out <- out[!colnames(out) %in% c("Value", "Variable", "Result","Question")]
-     o1$Recommendation <- "Eat more"
+    colnames(out)[colnames(out) == "assumpResult"] <- "Result"
+    return(out)
-     # Recommend less fish
+  }
-     o2 <- jsp[jsp$Question == "Not.recommended" , ]
+)
-     o2$Value <- round(result(o2))
-     o2 <- merge(
+oftenside <- Ovariable(
-       assump@output[assump$Variable == "change",],
+  "oftenside",
-       o2@output
+  dependencies = data.frame(
+    Name=c("jsp","assump"),
+    Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
+  ),
+  formula = function(...) {
+    out <- jsp[jsp$Question == "How.often.side" , ]
+    out$Value <- round(result(out))
+    out <- merge(
+      assump@output[assump$Variable == "freq",],
+      out@output
+    )
+    out <- out[!colnames(out) %in% c("Value", "Variable", "Result","Question")]
+    colnames(out)[colnames(out) == "assumpResult"] <- "Result"
+    return(out)
+  }
+)
+muchside <- Ovariable(
+  "muchside",
+  dependencies = data.frame(
+    Name=c("jsp","assump"),
+    Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
+  ),
+  formula = function(...) {
+    out <- jsp[jsp$Question == "How.much.side" , ]
+    out$Value <- round(result(out))
+    out <- merge(
+      assump@output[assump$Variable == "amside",],
+      out@output
+    )
+    out <- out[!colnames(out) %in% c("Value", "Variable", "Result","Question")]
+    colnames(out)[colnames(out) == "assumpResult"] <- "Result"
+    return(out)
+  }
+)
+assump <- Ovariable(
+  "assump",
+  ddata = "Op_en7749", subset = "Assumptions for calculations"
+)
+effinfo <- Ovariable( # Effect of information
+  "effinfo",
+  dependencies = data.frame(
+    Name=c("jsp","assump"),
+    Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
+  ),
+  formula = function(...) {
+    away <- c("Value","Variable","Result","Question","assumpUnit","assumpSource","jspResult","jspSource")
+    # Better availability of fish
+    o1 <- jsp[jsp$Question == "Better.availability" , ]
+    o1$Value <- round(result(o1))
+    o1 <- merge(
+      assump@output[assump$Variable == "change",],
+      o1@output
+    )
+    colnames(o1)[colnames(o1)=="assumpResult"] <- "Better.availabilityResult"
+    o1 <- Ovariable(name="Better.availability", output=o1[!colnames(o1) %in% away])
+    # Less chemicals
+    o2 <- jsp[jsp$Question == "Less.chemicals" , ]
+    o2$Value <- round(result(o2))
+    o2 <- merge(
+      assump@output[assump$Variable == "change",],
+      o2@output
+    )
+    colnames(o2)[colnames(o2)=="assumpResult"] <- "Less.chemicalsResult"
+    o2 <- Ovariable(name="Less.chemicals", output=o2[!colnames(o2) %in% away])
+    out <- (o1 + o2)@output
+    return(Ovariable(output=out, marginal=colnames(out) %in% c("Iter","Fish")))
+  }
+)
+effrecommRaw <- Ovariable( # Effect of recommendations
+  "effrecommRaw",
+  dependencies = data.frame(
+    Name=c("jsp","assump"),
+    Ident=c("Op_en7749/surveyjsp","Op_en7749/initiate")
+  ),
+  formula = function(...) {
+    # Recommend more fish
+    o1 <- jsp[jsp$Question == "Recommended" , ]
+    o1$Value <- round(result(o1))
+    o1 <- merge(
+      assump@output[assump$Variable == "change",],
+      o1@output
+    )
+     colnames(o1)[colnames(o1)=="assumpResult"] <- "Result"
+     o1$Recommendation <- "Eat more"
+     # Recommend less fish
+     o2 <- jsp[jsp$Question == "Not.recommended" , ]
+     o2$Value <- round(result(o2))
+     o2 <- merge(
+       assump@output[assump$Variable == "change",],
+       o2@output
      )
      colnames(o2)[colnames(o2)=="assumpResult"] <- "Result"
      o2$Recommendation <- "Eat less"
      out <- rbind(o1, o2)[!colnames(o1) %in%
              c("Value","Variable","Question","assumpUnit","assumpSource","jspResult","jspSource")
 ]
+    return(Ovariable(output=out, marginal=colnames(out) %in% c("Iter","Fish","Recommendation")))
+  },
+  unit = "change of amount in fractions"
+)
+# effrecommRaw and effrecomm are needed to enable decisions to affect the effect of recommendation before
+# the ovariable is collapsed. So, target the decisions to effrecommRaw and collapses to effrecomm.
+effrecomm <- Ovariable(
+  "effrecomm",
+  dependencies = data.frame(Name="effrecommRaw"),
+  formula = function(...) {
+    return(effrecommRaw)
+  }
+)
+amountRaw <- Ovariable(
+  "amountRaw",
+  dependencies = data.frame(Name = c(
+    "often",
+    "much",
+    "oftenside",
+    "muchside",
+    "assump"
+  )),
+  formula = function(...) {
+    away <- c(
+      "assumpUnit",
+      "jspResult",
+      "jspSource",
+      "Variable",
+      "Value",
+      "Explanation"
+    )
+    often <- often[ , !colnames(often@output) %in% away]
+    much <- much[ , !colnames(much@output) %in% away]
+    oftenside <- oftenside[ , !colnames(oftenside@output) %in% away]
+    muchside <- muchside[ , !colnames(muchside@output) %in% away]
+    assump <- assump[assump$Variable == "ingredient", !colnames(assump@output) %in% away]
+    out <- (often * much + oftenside * muchside * assump)/365 # g /d
+    return(out)
+  }
+)
+amount <- Ovariable(
+  "amount",
+  dependencies = data.frame(Name = c(
+    "amountRaw",
+    "effinfo",
+    "effrecomm"
+  )),
+  formula = function(...) {
+    out <- amountRaw * (1 + effinfo + effrecomm)
+    result(out)[result(out)<0] <- 0
+    result(out)[result(out)>100] <- 100
+    return(out)
+  }
+)
+rm(wiki_username)
+objects.store(list=ls())
+cat("Ovariables", ls(), "stored.\n")
+</rcode>
+==== Other code ====
+This is code for analysing EFSA food intake data about fish for BONUS GOHERR manuscript Ronkainen L, Lehikoinen A, Haapasaari P, Tuomisto JT. 2019.
+<rcode>
+################################
+# This is code for analysing EFSA food intake data about fish for manuscript
+# Ronkainen L, Lehikoinen A, Haapasaari P, Tuomisto JT. 2019
+library(ggplot2)
-    return(Ovariable(output=out, marginal=colnames(out) %in% c("Iter","Fish","Recommendation")))
+# Change the encoding from UCS-2 to UTF-8 first.
-  }
+dat <- read.csv(
+#  "C:/_Eivarmisteta/EFSA fish chronic intake.csv",
+  "C:/_Eivarmisteta/EFSA herring salmon chronic intake.csv",
+  header=TRUE, sep=",",dec=".",quote='\"'
 )
-amountRaw <- Ovariable(
+#> colnames(dat)
-  "amountRaw",
+#[1] "ï..Survey.s.country"     "Survey.start.year"       "Survey"
-  dependencies = data.frame(Name = c(
+#[4] "Population.Group..L2."   "pop.order"               "Exposure.hierarchy..L1."
-     "often",
+#[7] "Number.of.subjects"      "Number.of.consumers"     "Mean"
-    "much",
+#[10] "Standard.Deviation"      "X5th.percentile"         "X10th.percentile"
-    "oftenside",
+#[13] "Median"                  "X95th.percentile"        "X97.5th.percentile"
-    "muchside",
+#[16] "X99th.percentile"        "Comment"
-    "assump"
-  )),
-  formula = function(...) {
-     away <- c(
-      "assumpUnit",
-      "jspResult",
-      "jspSource",
-      "Variable",
-      "Value",
-      "Explanation"
-     )
-    often <- often[ , !colnames(often@output) %in% away]
-    much <- much[ , !colnames(much@output) %in% away]
-    oftenside <- oftenside[ , !colnames(oftenside@output) %in% away]
-    muchside <- muchside[ , !colnames(muchside@output) %in% away]
-    assump <- assump[assump$Variable == "ingredient", !colnames(assump@output) %in% away]
-    out <- (often * much + oftenside * muchside * assump)/365 # g /d
-    return(out)
-  }
-)
-amount <- Ovariable(
+colnames(dat)[c(1,2,4,12)] <- c("Country","Year","Group","Fish")
-  "amount",
-  dependencies = data.frame(Name = c(
-    "amountRaw",
-    "effinfo",
-    "effrecomm"
-  )),
-  formula = function(...) {
-    out <- amountRaw * (1 + effinfo + effrecomm)
+#> levels(dat$Group)
-    result(out)[result(out)<0] <- 0
+#[1] "Adolescents"     "Adults"          "Elderly"         "Infants"         "Lactating women"
-    result(out)[result(out)>100] <- 100
+#[6] "Other children"  "Pregnant women"  "Toddlers"        "Very elderly"
-    return(out)
-   }
-)
-rm(wiki_username)
+levels(dat$Group) <- c("Children","Adults","Elderly","Children","Pregnant women",
-objects.store(list=ls())
+                       "Children","Pregnant women","Children","Elderly")
-cat("Ovariables", ls(), "stored.\n")
+dat <- dat[dat$Fish !="Diadromous fish",] # Foodex2 level 7 entries
+ggplot(dat,aes(x=Country, y=Mean, colour=Group))+
+  geom_point()+coord_flip()+facet_wrap(~Fish, scales="free")+
+  labs(title="Fish consumption in European countries",
+       y="Average consumption (g/d)")
 </rcode>