-
Notifications
You must be signed in to change notification settings - Fork 4
/
manuscript.tex
1556 lines (1386 loc) · 83.2 KB
/
manuscript.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
\documentclass[3p]{elsarticle} %review=doublespace preprint=single 5p=2 column
%%% Begin My package additions %%%%%%%%%%%%%%%%%%%
\usepackage[hyphens]{url}
\usepackage{lineno} % add
\providecommand{\tightlist}{%
\setlength{\itemsep}{0pt}\setlength{\parskip}{0pt}}
\bibliographystyle{elsarticle-harv}
\biboptions{sort&compress} % For natbib
\usepackage{graphicx}
\usepackage{booktabs} % book-quality tables
%%%%%%%%%%%%%%%% end my additions to header
\usepackage[T1]{fontenc}
\usepackage{lmodern}
\usepackage{amssymb,amsmath}
\usepackage{ifxetex,ifluatex}
\usepackage{fixltx2e} % provides \textsubscript
% use upquote if available, for straight quotes in verbatim environments
\IfFileExists{upquote.sty}{\usepackage{upquote}}{}
\ifnum 0\ifxetex 1\fi\ifluatex 1\fi=0 % if pdftex
\usepackage[utf8]{inputenc}
\else % if luatex or xelatex
\usepackage{fontspec}
\ifxetex
\usepackage{xltxtra,xunicode}
\fi
\defaultfontfeatures{Mapping=tex-text,Scale=MatchLowercase}
\newcommand{\euro}{€}
\fi
% use microtype if available
\IfFileExists{microtype.sty}{\usepackage{microtype}}{}
\usepackage{graphicx}
% We will generate all images so they have a width \maxwidth. This means
% that they will get their normal width if they fit onto the page, but
% are scaled down if they would overflow the margins.
\makeatletter
\def\maxwidth{\ifdim\Gin@nat@width>\linewidth\linewidth
\else\Gin@nat@width\fi}
\makeatother
\let\Oldincludegraphics\includegraphics
\renewcommand{\includegraphics}[1]{\Oldincludegraphics[width=\maxwidth]{#1}}
\ifxetex
\usepackage[setpagesize=false, % page size defined by xetex
unicode=false, % unicode breaks when used with xetex
xetex]{hyperref}
\else
\usepackage[unicode=true]{hyperref}
\fi
\hypersetup{breaklinks=true,
bookmarks=true,
pdfauthor={},
pdftitle={Resolving the measurement uncertainty paradox in ecological management},
colorlinks=false,
urlcolor=blue,
linkcolor=magenta,
pdfborder={0 0 0}}
\urlstyle{same} % don't use monospace font for urls
\setcounter{secnumdepth}{0}
% Pandoc toggle for numbering sections (defaults to be off)
\setcounter{secnumdepth}{0}
% Pandoc header
\journal{The American Naturalist}
\usepackage[nomarkers]{endfloat}
\begin{document}
\begin{frontmatter}
\title{Resolving the measurement uncertainty paradox in ecological management}
\author[a]{Milad Memarzadeh}
\author[a, 1]{Carl Boettiger}
\address[a]{Dept of Environmental Science, Policy, and Management, University of
California Berkeley, Berkeley CA 94720-3114, USA}
\begin{abstract}
Ecological management and decision-making typically focus on uncertainty
about the future, but surprisingly little is known about how to account
for uncertainty of the present: that is, the realities of having only
partial or imperfect measurements. Our primary paradigms for handling
decisions under uncertainty -- the precautionary principle and optimal
control -- have so far given contradictory results. This paradox is best
illustrated in the example of fisheries management, where many ideas
that guide thinking about ecological decision making were first
developed. We find that simplistic optimal control approaches have
repeatedly concluded that a manager should increase catch quotas when
faced with greater uncertainty about the fish biomass. Current best
practices take a more precautionary approach, decreasing catch quotas by
a fixed amount to account for uncertainty. Using comparisons to both
simulated and historical catch data, we find that neither approach is
sufficient to avoid stock collapses under moderate observational
uncertainty. Using partially observed Markov decision process (POMDP)
methods, we demonstrate how this paradox arises from flaws in the
standard theory, which contributes to over-exploitation of fisheries and
increased probability of economic and ecological collapse. In contrast,
we find POMDP-based management avoids such over-exploitation while also
generating higher economic value. These results have significant
implications for how we handle uncertainty in both fisheries and
ecological management more generally.
\end{abstract}
\begin{keyword} POMDP, measurement uncertainty, decision theory, fisheries, conservation\end{keyword}
\end{frontmatter}
Imperfect information is ubiquitous in ecological management and
conservation decision making. While the pressing concerns of global
change have put the spotlight on forecasting, i.e.~the uncertainty of
the future (e.g. Petchey et al. 2015), management decisions must also
contend with uncertainty of the present: How many fish are in the sea
today? What regions harbor the most fragile biodiversity? How do we make
use of a rapidly expanding but opportunistically collected data, such as
in citizen science efforts, which may involve strong sampling biases?
Much of our focus in dealing with uncertainty has been through the lens
of statistical approaches to model fitting. The rise of approaches such
as Hierarchical Bayesian Modeling have given us sophisticated tools to
reflect uncertainty in our data and uncertainty in our knowledge of
mechanisms and parameters in our models (Ellison 2004). But estimating a
model is not the same as making a decision, particularly when a model
leaves us with considerable uncertainty about current and future states.
A model can only provide the rules of the chess board. It is the role of
decision theory to provide us with a strategy.
Decision theory has a long history in the ecological literature (e.g
Schaefer 1954; Walters and Hilborn 1978; Clark 1990; Shea 1998; Polasky
et al. 2011) which has sometimes been overshadowed by recent emphasis on
model fitting. While in principle a model will be updated throughout the
decision process (as we will discuss), it is convenient to think of the
model as ``given,'' estimated, with uncertainty, using the best
available methods. The goal of a decision theory is not maximize some
abstract notion of model fit (least squares, likelihood, etc), but a
more practical notion of utility (happiness, economic value, or any
combination of stakeholder objectives, see Halpern et al. (2013)).
Approaches for decision making under uncertainty in ecological systems
can be divided into two camps: (1) those based upon ``optimal control''
solutions, and usually favored by natural resource economists, and (2)
those based in heuristic methods such as scenario planning, resilience
thinking, and precautionary rules of thumb, more commonly found in both
ecological literature and actual practice (Fischer et al. 2009; Polasky
et al. 2011). While researchers have recognized the need to unify the
transparent and quantitative algorithmic approach of optimal control
with the greater complexity and uncertainty of real ecosystems that is
acknowledged by heuristic methods, computational barriers to doing so
have stymied this progress. We illustrate how the limitations of these
approaches have manifested in the example of fisheries management, and
present a new approach that can combine the reality of measurement
uncertainty and the rigor of optimization to resolve a long-standing
paradox and suggest a more robust approach to management.
The challenge of making decisions under uncertainty is not unique to
fisheries. Optimal control-based decision methods which assume perfect
measurements are applied to a wide range of population and ecosystem
management issues, including fire ecology (Richards et al. 1999),
invasive species (Shea 1998; Blackwood et al. 2010), disease outbreaks
(Shea et al. 2014; Li et al. 2017), and protecting biodiversity (Dee et
al. 2017; Iacona et al. 2017), and in common textbooks and reviews
(Mangel 1985; Clark 1990; Marescot et al. 2013). Yet fisheries
conservation and management has long been both a crucible and proving
ground for the theory of ecological management more generally, including
topics such as adaptive management (Walters and Hilborn 1978),
ecosystem-based management (Levin and Lubchenco 2008) and resilience
thinking (Holling 1973; May 1977), while also giving rise to the
sub-discipline of resource economics (Gordon and Press 1954; Schaefer
1954; Beverton and Holt 1957), thanks to its global relevance, long
history, and readily available data. Despite the generality of the
models and issue of measurement uncertainty considered here, any
application must be grounded in both the history and context of the
particular decision problem. In light of this, we begin with a brief
background on existing decision theory approaches common in fisheries
management.
In this paper, we examine three strategies to managing a natural
population, such as a fishery, under such uncertainty. This problem has
a long history in both theory and practice, where seemingly innocuous
simplifying assumptions have led to a deep paradox in how managers
should respond to uncertainty. The precautionary principle (Kriebel et
al. 2001) would seem to suggest that the greater the uncertainty in our
models, the more cautious we should be in management policies. Yet a
well-known mathematical proof by Reed (1979) established that under
quite general conditions, the optimal strategy under uncertainty is no
different than under no uncertainty at all. Attempts to resolve this
have since only deepened this paradox, arguing that adding additional
sources of uncertainty, such as imperfect measurements, should lead to
optimal harvest that increases with greater uncertainty (e.g. Clark and
Kirkwood 1986; Sethi et al. 2005). Meanwhile, much of actual practice
has (perhaps fortunately, as we shall see) ignored these results in
favor of a more heuristic application of the precautionary principle.
Here, we show that this paradox can be resolved by uncovering previous
shortcuts and tackling the optimal management problem head-on, which we
can solve thanks to the computational efficiency of sophisticated
algorithms developed more recently in the field of robotics. This
approach allows us to reject this paradox largely in favor of the
precautionary principle. Further, our comparisons show that widely
recognized existing approaches in both theory and practice are not by
themselves sufficient to manage a population under substantial
uncertainty over the long term, but our recently borrowed approach from
robotics literature can under identical conditions sustain both higher
biomass and higher economic yields than those approaches. This
conclusion has important implications not only for fisheries but
ecosystem more broadly: First, our results highlight that measurement
error must be accounted for in both the decision process itself, and not
merely in the estimation of model. Second, these results underscore the
opportunity to learn from engineering fields such as robotics when faced
with complex and uncertain decisions that we have previously dismissed
as intractable.
\hypertarget{the-decision-problem}{%
\section{The Decision Problem}\label{the-decision-problem}}
We consider the management problem of setting catch quotas for a marine
fishery in the face of imperfect information about the current stock
size and uncertainty about future recruitment. We seek to determine the
sequence of actions \(h_t\) for \(t \in [1, ... \infty]\) that maximize
the net present value (discounted sum of all future profits) of the
fishery. We will denote the discount rate \(\gamma\). For simplicity we
will once again follow classic theory and assume a fixed price for fish,
(equivalently, measuring our value in units of discounted fish rather
than discounted dollars). Each year, the manager also obtains an
estimate \(y_t\) of the true \(x_t\) stock size subject to some
measurement uncertainty \(\xi_t\)
\begin{equation}
y_t = \xi_t x_t
\end{equation}
For simplicity, we will assume measurement uncertainty is normally
distributed with standard deviation \(\sigma_m\),
\(\xi \sim \mathcal{N}(1, \sigma_m)\). Given an estimated population
model along with this (uncertain) measurement of the stock size, the
manager must choose the set of a harvest quotas \(\lbrace h_t \rbrace\)
for \(t \in [0, 1, \infty]\) that maximizes expected long term utility
of the stock:
\begin{equation}
\max_{ \lbrace h_t \rbrace } \mathbb{E} \left \{ \sum_{t=0}^{\infty} \gamma^t \cdot U(x_t, h_t) \right \}
\end{equation}
For convenience of presentation, we will assume this utility is simply
equal to the harvest itself.
\begin{equation}
U(x_t, h_t) = \min(h_t, x_t)
\end{equation}
where the \(\min\) function ensures no additional reward for attempting
to harvest more than the entire stock. In economic terms, this
corresponds to a fixed price per fish caught, with no proportional cost
on effort required to do so, but the results do not depend on this
assumption, as existing analyses have previously established (Reed
1979). Using this form allows a more direct comparison to approaches
such as maximum sustainable yield (MSY) which do not include an economic
model but merely maximize long-term catch. In principle, utility could
also reflect ecosystem services or other intrinsic values for fish left
in the sea and not merely the market for consumption (Halpern et al.
2013).
The solution to this decision problem is conditional on a population
model for the underlying ecological process, describing the probability
that the system moves to state \(x_{t+1}\) given that it was in state
\(x_t\) and the manager selected a harvest \(h_t\), which we discuss
below. Such a model would typically be estimated and parameterized from
available data, often (but not necessarily) using a hierarchical
Bayesian approach that acknowledges the existence of uncertainty in both
measurements and process (e.g.~see Dichmont et al. 2016 for a discussion
of how uncertainties are frequently included in stock assessment
models). However, accounting for measurement uncertainty when going from
data to model does not mean we have also accounted for this uncertainty
when going from model to decision. For example, a harvest decision
policy based upon maximum sustainable yield simply depends on the stock
size which maximizes the expected growth rate of the model. Measurement
uncertainty in model estimation can indeed influence the position of
that maximum, but as we shall see, that strategy is only optimal in the
absence of any uncertainty.
There is one further subtlety arising in this seemingly simple problem
statement that we must address. Given our discrete-time formulation of
the recruitment process, it is necessary to decide if the measurement
\(y_t\) happens before or after harvest \(h_t\): that is do we: measure,
recruit, harvest, or measure, harvest, recruit? Following convention in
the optimal control literature (e.g. Reed 1979; Clark 1990; Sethi et al.
2005), we will assume the latter; measurement occurs immediately before
harvest, followed by recruitment. This assumption has occasionally been
reversed to introduce some additional uncertainty from stochastic
recruitment immediately prior to setting the harvest quota (e.g. Clark
and Kirkwood 1986; Weitzman 2002), which appears to mimic uncertainty in
measurements but does not allow uncertainty to accumulate since the
measurement step still occurs without error.
The optimal strategy must think not only about the next year, but rather
considers the sequence of all future actions \({h_t}\), under all
possible realizations of the underlying model. This strategy must also
acknowledge that the manager will receive new information each year, and
may need to modify actions accordingly. It is this ability of the
decision problem to think ahead that allows it to tolerate short-term
costs (e.g.~reduced harvests in the current season) for larger future
payoffs. When Eq (1) is ignored, such optimal control problems are known
as Markov Decision Processes or MDPs. These are more commonly referred
to in the ecological literature by their solution method of Stochastic
Dynamic Programming (SDP; Marescot et al. 2013), and widely used in both
ecological management and animal behavior (Mangel 1985; Mangel and Clark
1988). The addition of measurement uncertainty in Eq (1) turns this
problem into a Partially Observed Markov Decision Process (POMDP), for
which solution methods have not been widely available for large problems
until more recently. We discuss these decision methods in more detail
below.
\hypertarget{population-model}{%
\subsection{Population model}\label{population-model}}
To facilitate tractability and interpretation, we will focus on the
well-studied Gordon-Schaefer model (Gordon and Press 1954; Schaefer
1954) of logistic population growth (stock recruitment),
\begin{equation}
x_{t+1} = \varepsilon_t (x_t-h_t) r \left(1 - \frac{(x_t-h_t)}{K}\right) \label{Gordon-Schaefer}
\end{equation}
where \(x_t\) is the current stock size, \(h_t\) the harvest chosen that
year, with parameters \(r\) giving the individual growth rate, and \(K\)
the carrying capacity and \(\varepsilon\) representing stochastic
recruitment in a variable environment. We have written this in terms of
\(x_t - h_t\) to underscore that we assume the convention of observe,
harvest, recruit. We will assume for simplicity
\(\varepsilon \sim \mathcal{N}(1, \sigma_g)\). This and similar models
are widely used in large scale analyses across diverse stocks (e.g.
(Costello et al. 2016; Britten et al. 2017)), and form the basis for
much of bioeconomic theory (Clark 1990). As such, it will be easier to
compare our results against intuition and classic theory, and avoid the
possibility of differences arising only because of some particular
subtle assumption hidden in a more complex model.
It is important to remember that all three decision methods we compare
here will use this same model as given. In the case of numerical
simulations, we will simply pick reasonable fixed parameter values for
the model above. To illustrate the application to empirical data, we
will first estimate these parameters using a hierarchical Markov Chain
Monte Carlo (MCMC), and then use the resulting estimates to drive all
three decision models. As such, the details of how the model is
estimated are immaterial to the differences in performance between these
decision methods. In all cases, the model is considered fixed.
Technically, it would be possible to re-estimate the model parameters
given each new observation during the decision process. Adaptations of
the decision processes we consider here include this possibility for
learning about the model during the decisions. Some include the
possibility of taking knowingly bad actions, like suddenly increasing
harvest rates, to improve learning over parameters. We investigate these
in subsequent work (Memarzadeh and Boettiger 2018). For better
comparison with established theory, here, the model is considered fixed.
In the case of the simulations, the model is known without error (though
still stochastic in nature), such that additional observations cannot
improve the parameter estimates in any event. Despite that obvious
simplification, we shall illustrate how existing decision methods fail
to account for measurement uncertainty appropriately. Even with these
simple models, decisions are complex, and knowing the rules of the game
is not the same as knowing how to win.
\hypertarget{current-theory-and-practice}{%
\section{Current Theory and
Practice}\label{current-theory-and-practice}}
Gordon and Press (1954) and Schaefer (1954) independently showed that
the maximum sustainable yield is achieved by reducing the stock to the
population to the size at which it obtains the maximum growth rate. This
stock size is known as Biomass at Maximum Sustainable Yield,
\(B_{MSY}\). For the Gordon-Schaefer model (and many others), this is
achieved at \(B_{MSY} = K/2\). They observed that fishing at a
\emph{constant yield} (that is, an individual fish mortality, \(F\),
such that harvest is biomass times mortality, \(H = F \cdot B\), and
thus \(F_{MSY} := \tfrac{H_{MSY}}{B_{MSY}}\)) will eventually lead to a
population that converges to the biomass \(B_{MSY}\) and produces the
maximum sustainable harvest, \(H_{MSY} = r K / 4\) in this model. From
here, theory and practice diverge.
This \emph{constant yield} (constant mortality) solution thus
corresponds to an equilibrium analysis which does not solve the
time-dependent optimization problem above. In particular, this maximum
sustainable yield (MSY) strategy will not be optimal whenever the stock
is away from \(B_{MSY}\). Clark (1973) demonstrated (assuming no
uncertainty in measurement or stochasticity in population dynamics) that
the optimal time-dependent strategy is not one of constant yield, but
rather of \emph{constant escapement}. We will return to this approach in
a moment.
\hypertarget{uncertainty-in-practice-msy-tac}{%
\subsection{Uncertainty in Practice: MSY \&
TAC}\label{uncertainty-in-practice-msy-tac}}
Maximum Sustainable Yield (MSY) remains the basis of international law
(including the UN, IWC, IATTC, ICCAT, ICNAF; Mace (2001)) and a familiar
standard of management in terrestrial ecosystems as well as aquatic
(Clark 1990). Critics have for some time observed the limitations of
harvesting at MSY in face of uncertainty in stock sizes and population
dynamics (Larkin 1977; Botsford et al. 1997). Many US fisheries reflect
this uncertainty through a series of adjustments that effectively reduce
the target fishing mortality level to reflect this uncertainty.
Typically, a stock assessment model provides a best estimate of
\(B_{MSY}\) and corresponding mortality \(F_{MSY}\) is used to define
the stock Over-Fishing Limit, (OFL). Based on this, a somewhat lower
level is set as the Allowable Biological Catch (ABC), reflecting
uncertainty in the stock assessment. To reflect possible uncertainty
between reported and actual catch, the ABC may be reduced further to
define the Total Allowable Catch, (TAC), which forms the basic unit of
management for many such fisheries. To reflect this process, our
analysis will also consider policies in which the harvest quota is set
at 80\% of the level expected under the MSY policy:
\(H_{TAC}(t) = 0.8 \cdot F_{MSY} \cdot B_t\), for a biomass estimated at
\(B(t)\) in year \(t\). To distinguish this approach from MSY, we will
refer to this approach as a TAC policy. This more closely represents a
heuristic (e.g. Hilborn 2010) or resilience-based approach (sensu
Fischer et al. 2009) than an optimization-based policy. Importantly,
this approach shares the fundamentally stationary assumptions of an MSY
policy by defining a constant mortality rather than a dynamic policy.
Thus, despite being more cautious overall and generating lower economic
yield than a constant escapement policy, TAC policies continue to
harvest at non-zero rates even if a stock falls below \(B_{MSY}\), while
the constant escapement policy does not.
It is important to note that actual management strategies are quite
diverse. Our focus on comparisons TAC, MSY, and Constant Escapement (CE)
reflects only the subset of current practices. Another increasingly
common approach to address uncertainty in both process and models is
known as Management Strategy Evaluation (MSE, Smith 1994; Dorner et al.
2009; Bunnefeld et al. 2011; Punt et al. 2016). MSE performs forward
simulations under a suite of pre-selected candidate strategies to
determine which strategy achieves the best expected outcomes. Crucially,
by limiting the analysis to a suite of candidate strategies over a
finite number of replicate simulations, this approach is able to
accommodate substantially greater complexity and uncertainty than has
been possible with more formal optimal control approaches we discuss
below. Optimal control solutions rely on dynamic programming to consider
all possible strategies (all possible sequences of harvest under all
possible realizations of stock dynamics). As such, MSE can be viewed as
an approximation to the optimal control solutions considered below, a
notion that has been formalized in the literature on approximate dynamic
programming (Nicol and Chadès 2011). Future work may be able to leverage
the forward-simulation approach used in MSE to extend the POMDP methods
introduced here to yet more complex dynamics.
\hypertarget{optimal-management}{%
\subsection{Optimal management}\label{optimal-management}}
So far, attempts to provide a more formal basis for managing uncertainty
than the heuristic adjustment of catch limits described above have
largely foundered in a series of paradoxes. Without uncertainty, the
theoretical policies are quite intuitive: The result derived by Clark
(1973) for an optimal dynamic strategy to replace the equilibrium
solution of MSY can be summarized as: If the stock is most productive at
\(B_{MSY}\), then obtain \(B_{MSY}\) as quickly as possible. Thus, at
any level below \(B_{MSY}\), the economically optimal thing to do is to
completely shut down the fishery, while above this stock, harvests
greater than \(H_{MSY}\) will be needed to bring the stock back to
\(B_{MSY}\). (This intuition must be adjusted slightly in the case of
economic discounting of future profits, but is otherwise quite general,
see Clark (1990)). This strategy is known as \emph{constant escapement}
(CE), since a constant sock size \(B_{MSY}\) escapes harvest each year.
Because CE and MSY converge to the \emph{same long-term biomass and same
long-term average yield}, CE can appear to be merely a more formal
justification for MSY. However, in practice the two approaches are very
different: because natural populations fluctuate year-to-year, under CE,
a different harvest quota must be set each year to achieve the constant
escapement. Because CE is derived as the economic optimum (assuming
perfect measurements), it is sometimes referred to as Maximum Economic
Yield, (MEY; Burgess et al. 2018), or as Rights Based Fisheries
Management (RBFM, Costello et al. 2016).
In the face of a naturally fluctuating population, previous arguments
made by Gordon and Press (1954); Schaefer (1954) or Clark (1973) no
longer hold. As noted above, the optimal harvest of a population under a
model such as the stochastic Gordon Schaefer, Eq
\eqref{Gordon-Schaefer}, is an example of a Markov Decision Problem
(MDP) which must be solved with stochastic dynamic programming. Thanks
to a mathematical proof provided by Reed (1979), fisheries managers have
largely been able to bi-pass such computational effort. However, this
convenient result also opened the door to paradox of uncertainty that
has persisted in ecological management for the past four decades.
\hypertarget{reeds-paradox-s-d}{%
\subsection{\texorpdfstring{Reed's Paradox:
\(S = D\)}{Reed's Paradox: S = D}}\label{reeds-paradox-s-d}}
Reed (1979) proved that, under sufficiently general assumptions, the
optimal escapement \(S\) for a population under stochastic growth, is
identical to Clark (1973)'s optimal escapement \(D\) for a deterministic
population, \(S = D = B_{MSY}\). This surprising result suggests that in
going from a world where a manager has perfect knowledge of absolutely
everything into a scenario where the manager faces considerable
uncertainty about the future state of the world, no additional
precaution is needed.
\hypertarget{clarks-paradox}{%
\subsection{Clark's Paradox}\label{clarks-paradox}}
Clark and Kirkwood (1986) was among the first attempts to resolve Reed's
Paradox. Clark and Kirkwood (1986) (quite correctly, as we will see),
identified the crux of Reed's Paradox as the absence of measurement
uncertainty:
\begin{quote}
An important tacit assumption in Reed's analysis, as in the other works
referred to above, is that the recruitment level X is known accurately
prior to the harvest decision, {[}\ldots{}{]} In the case of fishery
resources, the stock level X is almost never known very accurately,
owing to the difficulty of observing fish in their natural environment.
\end{quote}
Clark and Kirkwood (1986) were unable to solve the resulting problem
exactly, but had to adopt an almost equally troublesome assumption:
\begin{quote}
For reasons of tractability, we shall adopt the simplifying assumption
that the escapement level S, is known exactly at the end of that period.
(The mathematical difficulty of the problem increases markedly if this
assumption is relaxed.)
\end{quote}
Unfortunately, this ``simplifying assumption'' serves to squash most of
the measurement error, and their results instead only deepened the
paradox, finding policies that become even \emph{less} cautious as
uncertainty increases:
\begin{quote}
{[}Our{]} results appear to contradict the conventional wisdom of
renewable resource management, under which high uncertainty would call
for increased caution in the setting of quotas.
\end{quote}
Relying on a quite different but still flawed assumption nearly two
decades later, Sethi et al. (2005) largely confirm Clark's Paradox,
which they likewise observed with some concern:
\begin{quote}
It may seem counter-intuitive that a measurement error causes lower
expected escapements below the deterministic fishery closure threshold.
\end{quote}
Despite these notes of caution, both Clark and Kirkwood (1986) and Sethi
et al. (2005) ultimately attempt to rationalize this counter-intuitive
conclusion rather than reject it. Intuitively, uncertainty in the
population growth model (stochasticity) or measurement error, would seem
to justify harvesting fewer fish. Instead, existing optimal control
theory has presented only the paradoxical results that uncertainty
either should not matter at all, or that we should increase rather than
decrease harvests in response to increased uncertainty. Numerous other
attempts have been made to resolve this paradox through alternative
assumptions, the most common being to assume from the outset to look
only for `constant-escapement' type policies (Ludwig and Walters 1981;
Roughgarden and Smith 1996; Engen et al. 1997; Moxnes 2003). Owing to
the ``mathematical difficulty'' Clark first observed, none have
attempted a direct solution which we will present here.
In practice, TAC-managed fisheries are more common that CE-managed
fisheries, at least in the United States. CE-based regulation is limited
primarily to salmon, species for which population estimates may be more
precise and thus closer to Reed's assumption. We are not aware of any
fishery that has put Clark's Paradox into practice, harvesting more in
response to increased uncertainty. As we shall illustrate, this is
fortunate; though approaches that properly account for both forms of
uncertainty can perform even better.
\hypertarget{pomdps-an-optimal-treatment-of-measurement-uncertainty}{%
\subsection{POMDPs: An optimal treatment of measurement
uncertainty}\label{pomdps-an-optimal-treatment-of-measurement-uncertainty}}
Why does the introduction of measurement uncertainty make the decision
problem so much harder? Consider the thought experiment of managing a
fish stock where in each of the previous four years stock assessments
have put the population size at, say, 51, 54, 49, 51, (say, measured in
percent of carrying capacity, \(K\)). Under a constant escapement target
of 50, the harvest quota would be 1, 4, 0, 1, respectively. Note that
this control rule depends only on the measurement for the year in
question, which is assumed to be measured without error. If we then
measure a stock size of 75, an assumption that this measurement is made
without error would justify the sudden increase to a quota of 25. Yet if
we admit the possibility of measurement error, this sudden bump starts
to look very suspicious in light of all the previous observations.
Perfect measurements let us exploit the Markov property of the model --
the most recent measurement tells us all we need to know. In contrast,
the introduction of measurement uncertainty breaks this Markovian
assumption: a rational decision maker would suspect this 75 to be an
over-estimate based on prior observations. Under imperfect observations
introduced by Eq (1), we no longer have a Markov Decision Process (MDP)
over the observed state space. Rather, we have a Partially Observed
Markov Decision Process (POMDP). A POMDP policy cannot be expressed as a
policy function at all (much less as policy function with a single
constant-escapement target, as much previous work has assumed). Instead,
a POMDP policy can be thought of as depending not only on the most
recent observation, but the recent observation combined with a
\emph{prior belief} about what state the system is in, which itself is
determined by all past observations. In this manner, the POMDP can
consider how much weight to give the most recent observation based on
how it compares to prior beliefs.
Algorithmic innovations from the robotics community have now made it
possible to numerically solve non-trivial POMDP problems. The need to
consider all past observations becomes particularly demanding given the
forward-looking nature of a decision problem. Not only must we consider
all possible sequences of future actions over all possible sequences of
future states, but we must add to that all possible sequences of future
measurements. The computational complexity involved has historically
limited POMDP applications to contexts with only a handful of possible
states and actions (e.g. Chadès et al. 2008, 2011; Fackler and Haight
2014; Fackler and Pacifici 2014), insufficient to capture fisheries
models appropriately. Meanwhile, methods to solve decision problems with
imperfect information have advanced steadily in the field of robotics
and artificial intelligence. Almost contemporaneous with Reed's work,
papers by Smallwood and Sondik (1973) and Sondik (1978) laid
mathematical foundations for algorithms that could efficiently solve
POMDP problems (e.g. Kaelbling et al. 1998; Pineau et al. 2003). More
recently, driven by demands in areas such as autonomous vehicle decision
making, newer point-based approximation algorithms such as SARSOP
(Kurniawati et al. 2008) have made it possible to solve POMDPs with 100s
of states and actions. We have adapted this algorithm for application to
the fisheries context, and provide an efficient implementation as a
preliminary R package (Boettiger et al. 2018) which can be used to
replicate and further explore results presented here. Details of the
analysis presented here including annotated code to reproduce and
further explore all of the following results are provided in the
supplementary material.
Precise definitions of both the (fully observed) MDPs and POMDPs (not to
be confused with Markov Process Models and Hidden Markov Process Models
-- methods for which estimate the model but do not involve the feedback
of any decision process) in an ecological context can be found in
Williams (2011), or in classic work from the engineering and robotics
community (e.g. Smallwood and Sondik 1973; Sondik 1978). For
convenience, we summarize the POMDP definition and common notation here.
The POMDP problem is posed identically to that of the MDP problem, with
the addition of an observation process The POMDP problem for fisheries
question considered here can be summarized as follows:
\textbf{Transition} process (state equation): \(T(x_t, x_{t+1}, a_t)\):
the probability that a system is in state \(x_{t+1}\) at time \(t+1\)
given that it began in state \(x_t\) at time \(t\) and the manager took
action \(a_t\). In our context, this relationship is given by the
Gordon-Schaefer stock recruitment function \(f\) with normally
distributed growth uncertainty
\(x_{t+1} \sim \mathcal{N}(f(x_t,a_t), \sigma_g)\), truncated at zero to
exclude negative population sizes. The action \(a_t\) represents the
harvest quota set for the fishery. Attempting to implement a quota the
exceeds the true population size results in collapse of the fishery. As
Clark and Kirkwood (1986) eloquently argues, assuming that extinction is
impossible regardless of fishing intensity would unreasonably bias the
decision problem.
\textbf{Observation} function: \(O(x_t,y_t,a_{t-1})\) the probability of
observing state \(y_t\) given a system in state \(x_t\). In principle,
the action chosen can influence the precision of the observation. In our
case, we simply assume normally distributed errors around the true
state, \(y_t \sim \mathcal{N}(x_t, \sigma_m)\), truncated at zero to
exclude negative population sizes.
\textbf{Utility} function: \(U(x_t,a_t)\), the utility received at time
\(t\) for taking action \(a_t\), given that the system is in state
\(x_t\). For simplicity of analysis, we will simply set the utility to
be equal to the harvested stock: \(U(x_t, a_t) = \min(x_t, a_t)\),
indicating that realized harvest cannot be negative. This choice ensures
that in the case of no uncertainty (\(\sigma_g = \sigma_m = 0\)), the
optimal solution matches that expected under a simple MSY calculation.
More realistic utility functions may include diminishing returns with
increasing harvest (supply and demand effects), and the cost of fishing,
both of which act to suppress large harvests. By focusing on a simple
utility we can be sure that our comparison to MSY is driven by the
treatment of uncertainty rather than merely differing economic
assumptions.
The optimization problem is to select the action \(a_t\) that will
maximize the net present utility over all time. Future utility may be
discounted by a factor \(\gamma\), so that a value \(V\) in \(t\) year
is valued at \(\gamma^t V\) today. Numerically, each of these functions
are defined over a discrete set of possible states, observations, and
actions, and can thus be represented as a collection of matrices or
tensors.
\hypertarget{simulations}{%
\subsection{Simulations}\label{simulations}}
We consider the average fish biomass across 100 replicate simulations of
the same stock dynamics (Eq 4) under three different management
strategies: constant escapement (CE), total allowable catch (TAC; equal
to 80\% MSY), and partially observed Markov decision process (POMDP)
management. Each suite of simulations is considered under three
sequentially higher measurement error regimes, from \(\sigma_m = 0\) (no
measurement errors), \(\sigma_m = 0.1\) (low error), and
\(\sigma_m = 0.15\) (moderate error), as indicated, while stochastic
recruitment (environmental noise) is set to a moderate
\(\sigma_g = 0.15\). Empirical estimates of measurement error in stock
sizes vary widely, with estimates often dependent upon methodological
assumptions. Early estimates have put possible error as high as 50\%
(Clark and Kirkwood 1986) and more recent analyses have suggested an
average of closer to 36\%, though with cases that may be much higher
(Ralston et al. 2011). Consequently, range between 0 to 15\% provides a
probably conservative estimate of typical uncertainty ranges, which we
show is nevertheless large enough to drive substantially negative
ecological and economic outcomes under management strategies that do not
appropriately account for that error over the long run. We provide
complete code and data files to reproduce the analysis shown here in the
Appendices and as research compendium (Marwick et al. 2018) at
\url{https://github.com/boettiger-lab/pomdp-intro}, archived at
(Boettiger and Memarzadeh 2018).
\hypertarget{results}{%
\section{Results}\label{results}}
\begin{figure}
\centering
\includegraphics{manuscript_files/figure-latex/sims-1.pdf}
\caption{Average fish biomass under different management strategies
under increasing levels of measurement uncertainty. Each plot the mean
stock size over time across 100 replicate simulations under each policy:
constant escapement (CE), Total allowable catch (TAC = 80\% MSY), and
the proposed partially observed Markov decision process (POMDP) method.
Standard deviation from the mean across replicate simulations is shown
as faint colored bands, indicating significant variation due to
stochasticity between individual replicates. Measurement error increases
as a normal distribution with standard deviation 0, 0.1, or 0.15, as
indicated at the top of the panel. Environmental stochasticity is fixed
a standard deviation of 0.15 in each panel. Carrying capacity K
normalized to 1, r = 0.75. Additional environmental noise levels and
comparison to MSY rather than TAC can be found in the supplementary
material. \label{sims}}
\end{figure}
In the absence of either environmental noise or measurement error POMDP,
CE, and MSY would converge to stock at the \(B_{MSY}\), while TAC would
maintain the stock at a slightly higher level (see Supplemental
material, Figure S1). The introduction of stochastic growth has a
significant negative impact on the TAC strategy, with a mean biomass
significantly lower than \(B_{MSY}\) (first panel, Figure \ref{sims};
this impact is even more severe for MSY, see Figure S2). Without
measurement error, the CE and POMDP strategies are nearly identical with
both approximately maintaining the stock at \(B_{MSY}\) despite the
significant environmental stochasticity. As measurement error increases
in the subsequent panels, CE and TAC strategies perform increasingly
poorly, while the POMDP continues to maintain the average stock close to
\(B_{MSY}\). Notably, CE is significantly more impacted by measurement
uncertainty than TAC, with CE averaging even lower biomass than TAC
under moderate measurement uncertainty. These results confirm that the
precautionary approach represented by the TAC does indeed prove more
robust to the problem of measurement error than the optimization
solution represented by CE, but is less robust to environmental error.
Many of the replicate simulations under CE experience complete stock
extinction under large measurement error, owing to the arbitrarily large
harvests permitted by the ``bang-bang'' nature of the CE policy. In
contrast, the POMDP approach successfully handles both sources of
uncertainty, maintaining higher stocks, near \(B_{MSY}\) level, and
producing the highest yield.
\begin{figure}
\centering
\includegraphics{manuscript_files/figure-latex/econ-1.pdf}
\caption{Expected net present value of the fishery across strategies
under increasing levels of measurement error \(\sigma_m\). Net present
value is the sum of all future harvests discounted over time and
averaged across 100 replicates simulations under each strategy, assuming
discount factor \(\gamma = 0.95\), as shown in Figure \ref{sims}. The
value under constant escapement (CE) is optimal when measurements are
perfect but decreases rapidly with increasing measurement error. The
more cautious TAC is not economically optimal but largely unimpacted by
increasing error, while POMDP attains consistently high economic yield
despite the increasing uncertainty. \label{econ}}
\end{figure}
This pattern in management success in ecological terms (the relatively
recovery and maintenance of fish biomass) is also borne out in terms of
economic performance. We find the mean net present value (averaging
across replicates and discounting future profits by the discount factor
\(\gamma = 0.95\)) for these same simulations at increasing levels of
measurement uncertainty (Figure \ref{econ}). As before, environmental
stochasticity is set at \(\sigma_g = 0.15\). In the absence of
measurement uncertainty, constant escapement (CE) is optimal (as per
Reed (1979)), while the heuristic caution built into the total allowable
catch (TAC) strategy (at 80\% MSY) results in a sub-optimal economic
yield. The presence of stochasticity in recruitment
(\(\sigma_g = 0.15\)), also contributes to the reduction in yield under
TAC. Though CE is quite robust to the environmental stochasticity level,
this strategy proves very sensitive to increasing measurement error,
showing sharp declines in economic value, falling below the TAC economic
value at \(\sigma_m = 0.15\). Though the risk of stock collapse does
increase with increasing measurement error under TAC (as seen by the
mean declines in Figure \ref{sims}), these have little impact on the
economic value due to the discount rate. A smaller discount rate would
penalize unlikely but not improbable collapses more, since a significant
amount of time is required to realize those rare events. Supplemental
Figure S3 summarizes these economic trends across different noise values
\(\sigma_g\) and includes comparison to a simple MSY policy. In contrast
to the declining economic performance of CE and the consistently
sub-optimal economic yield for TAC, the POMDP strategy continues to
generate economic yields at approximately the optimal level (as attained
by CE in the absence of measurement error) despite the increasingly
uncertain measurements. This demonstrates that the reduced risk of stock
collapse and higher average stock biomass attained by the POMDP strategy
in the presence of high measurement error, Figure \ref{sims} is not the
result of a trivial reduction in harvesting under all circumstances, but
rather, evidence of a more nuanced strategy that manages to account for
the uncertainty in measurement while maintaining a reasonable harvest.
\begin{figure}
\centering
\includegraphics{manuscript_files/figure-latex/overest-1.pdf}
\caption{Average fish biomass when the POMDP strategy assumes a high
level of measurement uncertainty, while simulations reflect perfect
measurements. Despite this overestimation, biomass under POMDP strategy
closely tracks the optimal biomass under CE in the absence of
measurement error. Solid lines indicate averages over 100 simulations,
colored bands indicate +/- one standard deviation. \label{overest}}
\end{figure}
The POMDP strategy is also robust to overestimation of the level of
measurement uncertainty. In the simulation results shown in Figures
\ref{sims} and \ref{econ}, we have assumed that the level of measurement
uncertainty \(\sigma_m\) was known, and saw that ignoring this
uncertainty (as the CE policy does) has significant negative impact on
ecological and economic outcomes. If the level of measurement
uncertainty is not known precisely, \emph{overestimating} the
measurement error while using the POMDP strategy provides a
precautionary approach that can nevertheless achieve nearly optimal
ecological and economic outcomes. Figure \ref{overest} summarizes the
results of the same simulations as before, but under the scenario in
which the POMDP approach assumes a measurement error of
\(\sigma_m = 0.15\), when in fact all simulated measurements are made
without error (\(\sigma_m = 0\)). This represents an extreme case of
overestimating the measurement error. Stochastic growth remains the same
as before, \(\sigma_g = 0.15\), and simulations under TAC and CE
policies are shown for comparison. When measurement error is absent,
Reed's proofs hold and the CE strategy is optimal. Figure \ref{overest}
shows that the POMDP outcomes track almost exactly the CE outcomes
despite the misplaced assumption that measurements are quite poor. (As
we have already seen, the TAC strategy is insufficiently cautious for
this level of stochasticity in recruitment, resulting in
over-exploitation and long-term decline). The ability of the POMDP
solution to perform nearly optimally even when significantly
overestimating the level of measurement uncertainty contrasts sharply to
the significant declines from ignoring measurement uncertainty seen in
the CE solutions in Figure \ref{sims}. This demonstrates that while we
may not know precisely the level of measurement error, we achieve far
better outcomes overestimating measurement uncertainty than
underestimating it. This also underscores the observation that POMDP
policy is quite robust to the details of the uncertainty. We can get a
better understanding for the performance of POMDP in these simulations
by looking more closely at how any individual decision under a POMDP
strategy compares to the action chosen by the current alternatives.
\begin{figure}
\centering
\includegraphics{manuscript_files/figure-latex/policy-1.pdf}
\caption{Comparison of the harvest and escapement policies under each
strategy as a function of the observed stock size. Escapement refers to
the expected fraction of fish left in the sea, \(x-h\), while harvest
refers to the target catch; two different conventions for plotting the
same action. Uniquely, the POMDP policy plot will depend not only on the
observed stock size, but also depends on prior information (bottom
panel) as determined from any prior observations and actions. Depending
on this prior it may harvest less or more than the other policies given
an identical observation.\label{policy}}
\end{figure}
To better understand the differences in performance of these strategies
and resolve the paradox of uncertainty, we must take a closer look at
how the specific action recommended by each policy compares given the
same observation. Figure \ref{policy} shows the action taken by each
strategy in response to a measurement of the stock size (biomass
estimate). These plots show, for any possible observations, which
strategy will attempt to harvest most and which will attempt to harvest
least. For comparison purposes, we plot policies both in terms of
expected escapement, \(S = Y - H\) as is typical in the optimal control
literature, (e.g. Reed 1979; Clark and Kirkwood 1986; Sethi et al.
2005), and also directly in terms of the harvest quota \(H\). These
plots clearly illustrate the contrast between the constant escapement
(CE) strategy and the precautionary Total Allowable Catch (TAC)
strategy: CE sets harvest strictly to zero for stocks estimated at
biomass below \(B_{MSY}\), while TAC permits a modest harvest of even
very small stocks. (Comparable plots with MSY can be found in the
Supplemental Material). In principle, the CE policy could depend on
\(\sigma_g\), but as Reed (1979) proved, the constant escapement level
with stochasticity is the same as in the deterministic case, S = D,
unless the noise level is quite high (comparison plots in Supplemental
material.) In contrast to this, our POMDP plots depend very much on both
the choice of \(\sigma_m\) and \(\sigma_g\). If \(\sigma_m = 0\), they
reduce exactly to the CE solution. Here we show the corresponding
policies for POMDP solutions focusing on \(\sigma_g = 0.15\) as above,
with a modest measurement uncertainty \(\sigma_m = 0.1\). Alternate
combinations of measurement uncertainty can be found in the appendix but
do not change the general pattern. In addition to an explicit dependence
on the measurement error, our POMDP solutions depend on another piece of
information: any prior observations of the stock size.
Differences in attention to prior beliefs, determined by prior
observations, drive the differences between the POMDP strategy and the
other strategies and can resolve the uncertainty paradox. While the
catch quota under both TAC and CE strategies can be completely
determined given the most recent observation of the stock size by using
the policy curves shown in Figure \ref{policy}, this is not the case for
the POMDP approach. This fundamental difference is key to understanding
the difference in performance and resolving the paradox of uncertainty.
The POMDP policy cannot be specified by the most recent observation
alone. Instead, the POMDP policy depends on all prior observations, not
just the most recent. The reason for this complexity comes from the
Markov property. Observations of the state in the perfectly observed
system satisfy the Markov property: once we have measured the current
biomass exactly, we cannot get any better estimate of the current stock
size by studying older measurements. When measurements are uncertain
this is no longer the case: intuitively, by comparing the most recent
measurement to previous observations we may be able to infer when any
given measurement is unusually high or unusually low. POMDP formalizes
this intuition by capturing the information from all previous
observations into a \emph{prior belief}. This prior belief is updated
after every subsequent action and observation in accordance with Bayes
Law. The mechanics of this process are well documented in the extensive
literature on POMDPs (e.g. Smallwood and Sondik 1973; Sondik 1978;
Kurniawati et al. 2008; Williams 2011), but for our purposes it is
sufficient to observe how this evolving prior belief serves to
continually adjust the POMDP policy. Supplemental Figure S9 (Appendix A)
illustrates how the prior belief evolves in response to subsequent
observations and actions over the course of an individual simulation.
Figure \ref{policy} shows two separate policy curves (in terms of
harvest and escapement) for the same POMDP solution given two different
prior belief distributions (panel 3, priors). While both priors shown
express considerable uncertainty about the precise stock size prior to
the most recent observation, the lower prior is centered at a value
\(\tfrac{1}{4}K\) stock size, while the high prior is centered at a
value of \(\tfrac{3}{4}K\), relative to a (post-harvest, before
observation) target size of \(B_{MSY} = K/2\). The POMDP policy is
determined by the combination of this prior information and the most
recent observation, as indicated by the two different POMDP curves for
harvest/escapement shown corresponding to the different priors. As with
the other policies, the higher the most recent observation (x axis) the
higher the POMDP recommended harvest. Yet unlike the alternative
strategies, the POMDP solution always reflects the prior information.
Consequently, relative to constant escapement (that is, no measurement
error), the POMDP with low prior starts harvesting only at higher stock
sizes and always harvests less. In contrast under the high prior, the
POMDP always harvests at the same or higher level than the constant
escapement solution.
\textbf{A resolution to the paradox.}
Herein lies our resolution to the paradox of uncertainty. Previous work
created this paradox by suggesting that increased harvest rates
(decreased target escapement) would often be the rational response to
increased uncertainty. The exact solutions from the POMDP reveal that
this is only an accident of the assumptions: it is indeed true that
under certain circumstances, harvest levels should increase relative to
the case of no uncertainty, but \emph{only when prior knowledge suggests
the stock size should be much higher than the most recent estimate would
suggest}. In the POMDP solution, all information must be put into its
historical (and constantly updated) context. When a measurement roughly
matches the expectation of this prior context, Figure \ref{policy} shows
that the optimal response from POMDP is roughly comparable or slightly
more cautious to the harvest under no uncertainty, and not more
aggressive as the paradox would suggest. Measurements that exceed
expectations are tempered with some skepticism: while the CE solution is
willing to meet a high stock measurement with a large harvest, the POMDP
solutions increase harvest more cautiously.
This difference between underestimating and overestimating accounts for
the poor performance of the CE solution under large measurement
uncertainty, where it over-harvests whenever measurements are too large.
Even though underestimating is equally likely under the measurement
uncertainty model, sooner or later a run of ``heads'', a sequence of
overestimations relative to the true stock, can drive stocks to very low
levels where the chance of stock extinction becomes possible. It is
precisely this asymmetry: that too much over-harvesting leads to an
irreversible state of extinction, while too much under-harvesting is
always reversible (modulo some lost revenue) that lay behind Clark's
original intuition that there was something fishy about Reed's result
that \(S = D\): that uncertainty required no extra caution. Constant
escapement is particularly susceptible to over-harvesting starting from
stock sizes much higher than \(B_{MSY}\) since its bang-bang optimal
solution attempts to bring stocks back to target level as fast as
possible. CE is not so vulnerable to collapse from small stock sizes,
since it shuts down all harvests once estimates fall below \(B_{MSY}\).
Stock collapse under TAC can also be driven by such a string of heads,
but is unlikely at high stock sizes since harvest never exceeds
\(H_{MSY}\). In both cases, some measurement uncertainty interacts with
inherent stochasticity, which provides a continued source of variation
to sizes above \(B_{MSY}\), where CE strategy is most vulnerable, and
below \(B_{MSY}\), where TAC is most vulnerable.
\begin{figure}
\centering
\includegraphics{manuscript_files/figure-latex/historical-1.pdf}
\caption{Comparisons of harvest level that would be recommended by
policies considered here relative to historical harvest levels in two
commercial fish stocks. POMDP solution assumes a measurement error of
10\%. \label{historical}}
\end{figure}
\textbf{Historical examples and implementation}.
So far we have focused on simulation and an examination of the relative
policies under different prior information. Figure \ref{historical}
compares the harvest level that would have been recommended by each of
the strategies we have considered here against the historically observed