forked from Envirometrix/PredictiveSoilMapping
-
Notifications
You must be signed in to change notification settings - Fork 0
/
03-Soil_variables.Rmd
executable file
·2105 lines (1780 loc) · 113 KB
/
03-Soil_variables.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
# Soil observations and variables {#soil-variables-chapter}
*Edited by: Hengl T., MacMillan R.A. and Leenaars J.G.B.*
This chapter identifies and provides comprehensive definitions and
descriptions for a standardized set of soil properties (and classes), which are
commonly predicted using PSM. We first discuss the complexity of measuring and standardizing (or harmonizing) soil attributes, then focus on the key soil properties and classes of interest for global soil
mapping. The purpose of this chapter is to serve as a reference, and background, for other
chapters where the focus is on generating soil maps, interpreting
accuracy results and similar.
Please note that this chapter draws extensively from materials previously
published as part of the specifications for the *GlobalSoilMap* project [@arrouays2014globalsoilmap].
Large blocks of text extracted verbatum from these prevously published
*GlobalSoilMap* specifications were, in fact, originally largely composed
and written by the second author of this chapter in his former role as
Science Coordinator for the *GlobalSoilMap* project (www.globalsoilmap.net).
We acknowledge the source of much of the content of this chapter as having
originated from the previously published *GlobalSoilMap* specifications.
The R tutorial at the end of the chaper reviews soil data classes and
functions for R. It illustrates how to organize and reformat soil data in R for spatial analysis,
how to import soil data into R and how to export data and plot it in Google Earth.
To learn more about the Global Soil Information Facilities (GSIF) package,
visit the main documentation page (http://gsif.r-forge.r-project.org/).
## Basic concepts
### Types of soil observations
As mentioned in the previous chapter, values for soil properties or
attributes are obtained through observation and/or measurement of a soil
feature, using a specified method. We refer to observations and
measurements of the characteristics of soil properties and/or feature
attributes as *soil observations* (see also the [Observation and Measurements
OGC standard](http://www.opengeospatial.org/standards/om); ISO/DIS 19156). From the perspective of the technology
used, soil observations can be grouped as follows (see also
Fig. \@ref(fig:soil-vars)):
1. *Direct measurements obtained using analytical instruments and
procedures in a laboratory or in the field* — the results of
measurements are analytical values considered representative for a
given soil property.
2. *Indirect soil measurements obtained using mechanical devices,
analytical instruments and procedures* — measurement of soil
properties that can be used to infer information about a different
target soil property. These can be based on soil spectroscopy and
similar close-range or remote sensing systems
[@ShepherdWalsh2007JNIS; @ViscarraRossel2010DSS].
3. *Direct observations of soil properties and interpretations* —
subjectively assessed values based on protocols for soil description as presented in manuals i.e.
abundance of mottles, soil drainage class, soil colour.
4. *Indirect or derived interpretations* — subjectively assessed values
or conditions based mainly on an expert’s knowledge and
interpretation of observations e.g. soil classification, soil
fertility class.
```{r soil-vars, echo=FALSE, fig.cap="Types of soil observations in relation to data usage and production costs. Descriptive soil observations (e.g. manual texture or diagnostic soil horizons) are often not directly useable by end users, who are often more interested in specific secondary soil properties (e.g. water holding capacity, erosion index, soil fertility) as inputs to their modeling. However, descriptive field observations are often orders of magnitude more affordable to obtain than laboratory analysis.", out.width="65%", out.extra="angle=0"}
knitr::include_graphics("figures/Fig_types_observations.png")
```
Field campaigns are usually the most costly part of soil surveys. Large numbers
of soil observations are made in the field to assess the spatial
distribution of readily observable soil properties to provide empirical
evidence for soil mapping. Because a soil analytical measurement in the
laboratory is generally much more costly than a soil observation in the field,
only a smaller subset of soil samples is taken from the larger number of
field soil observations and brought to the laboratory for subsequent analysis.
Ideally, every soil observation would be accompanied by corresponding soil analytical
measurements to produce the most accurate and comprehensive soil information possible.
```{block2, type="rmdnote"}
Soil can be assessed quantitatively based on direct or indirect
measurements using analytical techniques (in a laboratory or in the
field) and qualitatively or descriptively based on observations in the
field that adhere to some soil description guidelines. Examples of
subjective observations are: diagnostic soil materials and horizons,
soil classes, Munsell color classes, manual texture assessment
(texture-by-hand), structure, compaction, root abundance and similar.
```
It is important to emphasize that soil properties, and the methods used to assess
soil properties, are two distinctly different concepts. The two can
be defined together (functional definition) or can be defined
separately, as given by numerous national and international manuals and
guidelines for analytical procedures and soil description: e.g. in
@Burt2004SSIR [@carter2007soil; @food2006guidelines], and/or
@VanReeuwijk2002. In this chapter we also make a distinction between the
*‘targeted variable’* (targeted soil properties) and *‘paths’* (determination methods).
Soil analytical data obtained in a laboratory are typically an order of
magnitude more expensive to produce than descriptive field observations
[@Burrough1971; @GehlRice2005; @Kempen2011PhDthesis]. To reduce these
high costs, surveyors collect descriptive soil observations (Fig. \@ref(fig:soil-vars)),
which can subsequently be interpreted and linked to soil types and soil classes,
which are then assumed to be characterised by a limited and definable range
of soil properties [@bouma1998exploring]. It is also possible to convert
observed values for certain soil properties to values comparable to
those measured by analytical methods (albeit with unknown precision) by
using various calibration models or *conversion functions*. For example,
*manual texturing* analysis [@FAO1990; @SSDS1993] can be used as a basis for estimating
soil texture fractions with a precision of ±5 % at fraction of
the cost of laboratory analysis.
Soils are usually sampled per depth interval or layer, generally using a
genetic A-B-C-R system i.e. corresponding to a *soil horizon* — a
relatively homogeneous layer of soil (with upper and lower depth) that
is *“distinctly different from other layers and informative for the
soil’s nature”* [@harpstead2001soil]. Actual soil samples are either
taken from the centre of a soil horizon or are mixed samples of the
material from the whole horizon (Fig. \@ref(fig:soi-var-depth)). Decades
of soil survey have shown that soil horizons can be fuzzy objects.
They may be difficult for different surveyors to distinguish and delineate consistently
[@Burrough1989JSS; @DeGruijter1997Geoderma]. Soil
correlation exercises try (not always successfully) to help different
surveyors consistently recognize similar soil horizons and assign similar codes with
comparable upper and lower boundaries so as to produce similar
descriptions and classifications for any observed soil.
```{r soi-var-depth, echo=FALSE, fig.cap="Soil observations can refer to genetic horizons (left), fixed depths i.e. point support (center) and/or can refer to aggregate values for the complete profile (right).", out.width="70%"}
knitr::include_graphics("figures/Fig_soi_var_depth.png")
```
An emerging approach to soil characterization is to scan the complete soil
profile in different parts of the spectra, and then decide on vertical
stratification *a posteriori* [@ViscarraRossel2010DSS]. Nevertheless,
much of the analytical data available in existing legacy soil profile
databases is sampled per soil layer and described by soil horizon.
Soil observations are taken at a geographic position and at a specific
depth (or depth interval), which is either 3D or refers to the whole solum. The 3D
(longitude, latitude, depth) position implies that the property varies
not only in geographic space, but also with depth. Soil properties that
describe an entire site are by implication 2D, as are soil properties
that summarise or refer to the soil profile as a whole entity. For
example, soil type does not change with depth. Also rock outcrops, depth
to bedrock and depth to ground water table are single attributes that apply to an entire profile.
### Soil properties of interest for global soil mapping
There are many soil properties, possibly hundreds, used in the
international domain of soil science including pedology, soil survey,
soil fertility, soil hydrology, soil biology, etc. Not all of these can
be mapped globally, nor are all of explicit interest for global
applications or use.
Soil data have been, and are, collected and
compiled into maps at various scales for various purposes and soil
inventory projects typically begin by first carefully identifying the
specific list of soil properties that are of most interest for the
anticipated uses of the planned survey. Different soil data are required
for different purposes, such as applying different models with different
data requirements.
In the past, soil surveys typically elected to focus on observing
and measuring soil attributes and properties that were considered
to be relatively stable, or static, in time. For example the particle
size distribution of a soil, or its depth to bedrock, were considered
to be relatively stable and not subject to large changes over relatively
short time periods (e.g. decades). Even attributes that were known to
change with management and time, such as topsoil thickness, organic carbon
or pH, were treated as relatively stable properties for the purposes of
mapping.
This choice to emphasize relatively stable soil properties and
attributes was a logical consequence of the fact that it could take years
to produce a single soil map and decades to complete mapping for an entire
area of interest. Consequently, for maps to be relevant, and to remain
relevant and useful for their anticipated lifetime of use, they had to
restrict themselves to trying to describe the variation in only space (not time)
of properties that could be considered stable and static.
The idea that soil properties could be assumed to remain relatively stable
through time was partially based on an assumption that most soils had
achieved a relatively stable condition that was in equilibrium with their
current environment. If a soil is in equilibrium with its environment, it
can be assumed that it will retain its present attributes, since there are
no strong drivers for change. This may well apply to undisturbed soils in
their natural environment, but it is not valid for disturbed or managed soils.
It is well established that human management practices can, and do, significantly
alter some key soil properties, such as pH, organic matter and topsoil thickness.
Most conventional soil maps recognized, and reported on, differences in
soil properties, such as pH or organic matter, between natural soils and
managed soils. However, it was never a common practice to name, map and
characterize managed soils separately from natural soils.
Local or national soil survey projects are of direct
relevance to global soil mapping initiatives if the range of data
collected encompasses the minimum data set as specified for global
initiatives. For example, completion of an update to the SOTER database for the World
requires an extensive range of soil property data as specified in the
procedures manual [@VanEngelen2012]. An update of the Harmonised World
Soil Database [@FAO2012HWSD] requires a smaller range of attributes. The
*GlobalSoilMap* project [@Arrouays201493] selected a list of only
*12 soil properties* considered relevant for global analyses, and
feasible to map globally. This list includes seven basic attributes,
assessed through primary observation or measurement, and three derived
attributes which are calculated from the primary soil properties
(Tbl. \@ref(tab:globalsoilmap)). These attributes are being mapped
(where possible) at a fine resolution of six depth intervals in the
vertical and, 3–arcseconds in the horizontal dimension (ca. 100 m)
(Fig. \@ref(fig:scheme-solum)).
```{r scheme-solum, echo=FALSE, fig.cap="Standard soil horizons, solum thickness and depth to bedrock (left) vs six standard depths used in the GlobalSoilMap project (right).", out.width="75%"}
knitr::include_graphics("figures/Fig_scheme_solum.png")
```
```{r globalsoilmap, echo=FALSE}
knitr::kable(
head(read.csv("extdata/globalsoilmap_specs.csv", header = TRUE, stringsAsFactors = FALSE), 10), booktabs = TRUE,
caption = 'The GlobalSoilMap project has selected seven primary (depth to bedrock, organic carbon content, pH, soil texture fractions, coarse fragments), three derived (effective soil depth, bulk density and available water capacity) and two optional (effective cation exchange capacity and electrical conductivity) target soil properties of interest for global soil mapping and modelling.'
)
```
### Reference methods
A pragmatic solution to ensuring efficient exchange, sharing and interpretation of global soil data
is to establish reference methods for soil measurement and description. The
*GlobalSoilMap* project agreed that their target soil properties would be
assessed and reported relative to specific, designated *reference methods*.
For example, soil organic carbon content of the fine earth
fraction is to be assessed and reported according to ISO10694 dry
combustion method [@Sleutel2007CSSPA]. Values for pH are to be be
reported for a 1:5 suspension of soil in water or using the CaCl$_2$
solution, with a precision of 1 decimal place. It has also been
recommended that ISO TC 190 — soil quality standards — should be used to
assess and report all data measured from air-dried soil samples.
Soil properties designated as optional for the *GlobalSoilMap*
consortium include Effective Cation Exchange Capacity assessed and
reported according to ISO11260 Barium Chloride (cmol+/kg = centi-mole+ per kilogram)
and Electrical conductivity in 1:1 soil–water solution (dS/m = deci-siemens per metre).
The list of soil properties identified for routine global soil mapping and
modelling is likely to grow in the years to come.
```{block2, type="rmdnote"}
The International Organisation for Standardisation (ISO) provides
international standard definitions of soil properties, and of associated
methods to assess those soil properties, through `ISO TC-190` and
`ISO TC-345`. Such unambiguously defined international standards are
crucial for success of the multi-partner global soil mapping projects.
```
In the following sections we focus our discussion on the soil properties
that were first mapped for the https://soilgrids.org project:
depth to bedrock, occurrence of the `R` horizon, organic carbon content of the fine earth
fraction, pH of the fine earth fraction, particle size class contents
(sand, silt, clay) of the fine earth fraction, gravel content of the
whole soil, bulk density of the whole soil (and subsequently of the fine
earth fraction) and Cation Exchange Capacity of the fine earth fraction.
We define those attributes as completely and unambiguously as possible,
including the associated reference method. For each soil property the
following will be discussed:
- *Brief introduction to the soil property (what is it, what does it
reflect, why is it of interest, considerations; in general terms)*;
- *Definition of the soil feature related to the soil property and
its spatial domain (2D, 3D)*;
- *Definition of the reference methods used to assess the soil property
value*;
- *Definition of the convention used to express the soil property value
(units, precision, range)*;
- *Review of the variation in soil property definitions and in methods
to assess the attribute, including listings of several of the most
widely used conversion functions cited from literature, and with
emphasis on harmonization or conversion to the reference method*.
We also identify, and review, a number of other widely used measurement
methods, in addition to our selected standard methods. We describe if and
how these other methods relate to the selected reference methods and
discuss issues related to harmonization and standardization for
attributes of current interest for global mapping.
### Standard soil variables of interest for soil mapping
Some standard soil legends for listed soil properties are embedded
within the GSIF package and can be loaded by:
```{r, tidy=TRUE}
library(GSIF)
data(soil.legends)
str(soil.legends)
```
which illustrates the referent cumulative probabilities (`CPROB`) and
appropriate color legend (`COLOR`; coded as a six-digit, three-byte
hexadecimal number) for the values of the target soil variables. The cumulative
probabilities were derived using the collection of records in the WoSIS
repository [@Batjes2017ESSD], and can be considered as an estimate of global prior
probabilities for soil pH (see further Fig. \@ref(fig:sprops-phiho5)).
A general intention is to maintain a *Global Soil Data Registry* so that
a short variable name (in further text *“variable code”*)
can be linked to a unique set of metadata which should include:
- Full description (text);
- Variable type (numeric, quantity, binary, factor etc);
- Determination / measurement method (unique code);
- Measurement unit (following the International System of Units);
- Biblio reference (URL or DOI);
- ISO code (if available);
- Physical limits (lower / upper);
- Detection limit (i.e. numeric resolution);
- Priority level (required, suggested or optional);
Note that MySQL has some restrictions considering column names: special
characters, such as those outside the set of alphanumeric characters from the
current character set, can not be used in the column names. Proposed
abbreviations for standard method names are $\mathtt{VOL}$ — volume
fraction, $\mathtt{ABU}$ — abundance or relative area cover,
$\mathtt{PCT}$ — mass percentage, $\mathtt{ICM}$ — thickness in cm,
$\mathtt{MHT}$ — texture by-hand or manual hand texture and
$\mathtt{MNS}$ — Munsell color codes, horizon sequence is coded with the
capital ASCII letters e.g. $\mathtt{A}$, $\mathtt{B}$,
$\mathtt{C}$,$\ldots$ $\mathtt{Z}$. Another option is to simply use
the US Goverment National Cooperative Soil Characterization Database column names (http://ncsslabdatamart.sc.egov.usda.gov/).
Also note that the metadata can be easily separated from the code so
that the short codes (variable name) can be used as a shorthand (replacement)
for the long description of the complete metadata. Using short
codes is also important for programming because unique code names are
used consistently in all scripts / functions.
## Descriptive soil profile observations
### Depth to bedrock
Soil depth (specifically depth to bedrock) is predicted because it is an
important consideration for a wide variety of engineering, hydrological
and agronomic interpretations. Shallow and lithic soils are of
particular interest as they impose restrictions for foundations and
structures in engineering, limit infiltration and storage of moisture
and produce more rapid runoff and erosion and limit growth of many crops
by restricting rooting depth and limiting available moisture storage.
Most soil legacy profile data do not provide any information about the
soil below depths of 1 m [@Richter1995]. This characteristic of legacy
soil data limits its usefulness for predicting soil depths greater than
2 m.
Soil depth is measured from the soil surface downwards and expressed in
positive values increasing with depth. Google Earth and the KML data
standard (via the `altitudeMode` tag) allow one to specify if the
vertical dimension refers to actual altitude (vertical distance from the
land surface) or to distance from the sea level (`absolute`). In this
case soil depths can be represented using `clampToGround` and negative
values. For example, a depth of 30 cm can be expressed as [@OGCKML2008]:
```{bash, eval=FALSE}
<Placemark> <Point>
<altitudeMode>clampToGround</altitudeMode>
<coordinates>17.2057,45.8851,-0.3</coordinates>
</Point> </Placemark>
```
Soil surface (depth = 0 cm) is the top of the mineral soil; or, for soils
with a litter layer (`O` horizon), the soil surface is the top of the
part of the `O` horizon that is at least slightly decomposed [@FAO2006].
Fresh leaf or needle fall that has not undergone observable
decomposition is not considered to be part of the soil and may be
described separately. For organic soils, the top of any surface horizon
identified as an `O` horizon is considered the soil surface.
The *depth to bedrock* i.e. depth to the `R` horizon is measured from
the soil surface downwards and is expressed in cm with a precision of ±1 cm.
Depth to bedrock deeper than e.g. 2–3 m is most often not recorded. Bedrock
is consolidated hard rock, with only a few cracks, underlying the soil. It
is not necessarily parent material. We imagine it often as something
distinct and easy to recognize in the field. In practice, depth to
bedrock can be difficult to determine, and is often confused with
stoniness or depth to parent material (which can be unconsolidated
material). Another issue is that, for most of the soils in the world,
hard bedrock is >2 m deep so that we actually don’t know the correct
depth to enter, other than >2 m. Rootability is physically restricted by
the bedrock, whether hard or soft (see Fig. \@ref(fig:scheme-depth-to-bedrock)).
```{r scheme-depth-to-bedrock, echo=FALSE, fig.cap="Depth to bedrock for censored and uncensored observations. Image source: Shangguan et al. (2017) doi: 10.1002/2016MS000686.", out.width="100%"}
knitr::include_graphics("figures/Fig_depth_2_bedrock.png")
```
In traditional soil characterisation, the total depth of the `O`,
`A`, `E`, and `B` horizons is referred to as the *solum*
[@harpstead2001soil], while the underlaying layer is referred to as
parent material or substratum [@SSDS1993]. Parent material can be coarse
or fine unconsolidated deposits of e.g. alluvial, colluvial or windblown
origin (`C` horizon) or consolidated residual hard bedrock (`R`
horizon).
```{block2, type="rmdnote"}
Depth to bedrock is the mean distance to `R` horizon (layer
impenetrable by roots or agricultural machinery). Depth to bedrock deeper
than 2 m is most often not recorded in field survey descriptions.
```
### Effective soil depth and rooting depth
*Effective soil depth* is of interest for soil mapping because it is a
key indicator of the capability of the soil to store moisture, support
crop growth and sustain beneficial land uses. It is often an essential
indicator of soil health. The effective soil depth is the depth to which
micro-organisms are active in the soil, where roots can develop and
where soil moisture can be stored [@FAO2006].
```{r rootingdepths, echo=FALSE}
knitr::kable(
head(read.csv("extdata/maximum_rooting_depth.csv", header = TRUE, stringsAsFactors = FALSE), 10), booktabs = TRUE,
caption = 'Summary of maximum rooting depth by biome (after Canadell et al. (1996)). MMRD = Mean maximum rooting depth in m; HVRD = Highest value for rooting depth in m.'
)
```
There are many thoughts on how to define effective soil depth. Effective
soil depth is closely related to, but not necessarily equivalent to, the
*rooting depth*. Rooting depth is measured and reported relative to a
specific prevailing land cover and land use category, while effective
soil depth is supposedly the maximum possible depth of soil that can be
used by any growing plant (see Tbl. \@ref(tab:rootingdepths)).
In some cases soil ends with an abrupt change of material which is
either solid, compacted or distinctly impenetrable for plants and
organisms living in soil. The root restricting i.e. plant accessible
depth, is the depth at which root penetration is strongly inhibited
because of physical (including soil temperature), chemical or
hydrological characteristics [@SSDS1993 p.60]. Restriction means the
inability to support more than a very few fine (or few very fine roots) if
depth from the soil surface and water state, other than the occurrence
of frozen water, are not limiting. For some crops like cotton plants or
soybeans, and possibly other crops with less abundant roots than the
grasses, the “very few” class is used instead of the “few” class. The
restriction may be below where plant roots normally occur because of
limitations in water state, temperatures, or depth from the surface.
This evaluation can be based on the specific plants that are important
to the use of the soil, as indicated in Tbl. \@ref(tab:rootingdepths); see
also @SSDS1993 [p.60].
Root restriction can be also influenced by certain pedogenic horizons,
such as *fragipans*. A change in particle size distribution alone, as
for example loamy sand over gravel, is not always a basis for physical
root restriction. A common indication of physical root restriction is a
combination of structure and consistence which together suggest that the
resistance of the soil fabric to root entry is high and that vertical
cracks and planes of weakness for root entry are absent or widely
spaced. Root restriction is inferred for a continuously cemented zone of
any thickness; or a zone >10 cm thick that when very moist or wet is
massive, platy, or has weak structure of any type for a vertical repeat
distance of >10 cm and while very moist or wet is very firm (firm, if
sandy), extremely firm, or has a large penetration resistance. Chemical
restrictions, such as high extractable aluminium, manganese and/or low
extractable calcium, can also be considered but are plant-specific.
Root-depth observations preferably should be used to make the
generalization. If these are not available then inferences may be made
from soil morphology.
As a general recommendation, it is advisable to focus first on mapping soil properties that
limit rooting, including content of coarse fragments and the depth to
bedrock, and then define effective soil depth *a posteriori* using
distinct analytical rules. A similar approach has also been promoted by
@rijsberman1985effect and @driessen1992land who refer to it as the
*Soil-productivity Index* — a product of soil-water sufficiency, soil pH
sufficiency and soil bulk density sufficiency. Here we consider a somewhat
wider range of soil properties that can affect rooting depth, such as:
- coarse fragments,
- compaction / porosity (possibly derived from structure and
consistence),
- drainage i.e. soil oxygen availability,
- toxicity e.g. Al content,
- acidity, salinity and similar.
In-field expert interpretation explicitly summarising observations into
a single expression for rooting depth is likely the most effective and
reliable source of information. The genetically determined maximum
rooting depth of vegetation isn’t always a reliable indicator of actual
observed effective rooting depth of a given soil at a given site (Fig. \@ref(fig:lri-scheme)).
Possibly a more robust way to determine the effective rooting depth is
to map all limiting soil properties with high accuracy, and then derive
rooting index per layer.
```{r lri-scheme, echo=FALSE, fig.cap="Derivation of the Limiting Rooting Index: (left) soil pH values and corresponding LRI, (right) coarse fragments and corresponding LRI. Based on Leenaars et al. (2018) doi: 10.1016/j.geoderma.2018.02.046.", out.width="90%"}
knitr::include_graphics("figures/Fig_LRI_scheme.png")
```
By using the GSIF package, one can determine Limiting Rooting Index, which
can be a good indicator of the effective rooting depth. Consider the
following soil profile from Nigeria [@Leenaars2012]:
```{r}
## sample profile from Nigeria (ISRIC:NG0017):
UHDICM = c(0, 18, 36, 65, 87, 127)
LHDICM = c(18, 36, 65, 87, 127, 181)
SNDPPT = c(66, 70, 54, 43, 35, 47)
SLTPPT = c(13, 11, 14, 14, 18, 23)
CLYPPT = c(21, 19, 32, 43, 47, 30)
CRFVOL = c(17, 72, 73, 54, 19, 17)
BLD = c(1.57, 1.60, 1.52, 1.50, 1.40, 1.42)*1000
PHIHOX = c(6.5, 6.9, 6.5, 6.2, 6.2, 6.0)
CEC = c(9.3, 4.5, 6.0, 8.0, 9.4, 10.9)
ENA = c(0.1, 0.1, 0.1, 0.1, 0.1, 0.2)
EACKCL = c(0.1, 0.1, 0.1, NA, NA, 0.5)
EXB = c(8.9, 4.0, 5.7, 7.4, 8.9, 10.4)
ORCDRC = c(18.4, 4.4, 3.6, 3.6, 3.2, 1.2)
x <- LRI(UHDICM=UHDICM, LHDICM=LHDICM, SNDPPT=SNDPPT,
SLTPPT=SLTPPT, CLYPPT=CLYPPT, CRFVOL=CRFVOL,
BLD=BLD, ORCDRC=ORCDRC, CEC=CEC, ENA=ENA, EACKCL=EACKCL,
EXB=EXB, PHIHOX=PHIHOX, print.thresholds=TRUE)
x
## Most limiting: BLD.f and CRFVOL, but nothing < 20
```
where `UHDICM` and `LHDICM` are the upper and lower horizon depth in cm,
`SNDPPT`, `SLTPPT` and `CLYPPT` are the sand, silt and clay content in
percent, `CRFVOL` is the volume percentage of coarse fragments (>2 mm),
`BLD` is the bulk density in t/m$^3$, `ORCDRC` is the soil organic carbon
concentration in permille or g/kg, `ECN` is the electrical conductivity in dS/m,
`CEC` is the Cation Exchange Capacity in cmol/kg (centi-mol per kilogram), `ENA` is the exchangable Na
in cmol/kg, `EACKCL` is the exchangeable acidity in cmol/kg, `EXB` is the exchangeable
bases in cmol/kg, `PHIHOX` is the soil pH in water suspension, `CRB` is the
CaCO$_3$ (carbonates) in g/kg, `GYP` is the CaSO$_4$ (gypsum) in g/kg, and `tetaS`
is the volumetric percentage of water.
For this specific profile, the most limiting soil property is `tetaS`, but
because none of the soil properties got <20 points, we can conclude
that the maximum rooting depth is >180 cm. Note that the threshold values in
the `LRI` function used to derive Limiting Rootability scores are set
based on common soil agricultural productivity tresholds (e.g. for
maize; see also Fig. \@ref(fig:lri-scheme)), and can be adjusted via the
`thresholds` argument. The computation is done per list of soil layers
(minimum three) to account for textural changes i.e. sudden changes in
sand and clay content and for the limiting layers such as layer
saturated with water. To determine further the effective rooting depth we can run:
```{r}
sel <- x==FALSE
if(!all(sel==FALSE)){
UHDICM[which(sel==TRUE)[1]]
} else {
max(LHDICM)
}
xI <- attr(x, "minimum.LRI")
## derive Effective rooting depth:
ERDICM(UHDICM=UHDICM, LHDICM=LHDICM, minimum.LRI=xI, DRAINFAO="M")
```
## Chemical soil properties
### Soil organic carbon
Organic carbon is a soil property of great current global interest
[@Smith2004SUM; @Smith2010CUP; @Panagos2013439]. It is commonly
recognized and used as a key indicator of soil health. The amount of
carbon present in the soil, and particularly in topsoil horizons, is
grossly indicative of potential productivity for crops. Amounts of
organic carbon throughout the profile influence soil structure,
permeability, porosity, bulk density, water holding capacity, nutrient
retention and availability and, consequently, overall soil health. The
ability of soils to sequester significant quantities of atmospheric
carbon is of considerable interest as a potential mechanism for
mitigating the adverse effects of increases in green house gasses in the
atmosphere [@Smith2004SUM; @Conant2010; @Scharlemann2014CM].
Consequently, soil organic carbon is probably the soil property of
greatest current interest and utility from the point of view of global
mapping, and interpretation, of soil properties.
```{block2, type="rmdnote"}
Soil Organic Carbon is one of the key measures of soil health.
The standard reference method for assessing and reporting soil organic
carbon content of the fine earth fraction is by dry combustion to at
least 1000°C (ISO 10694). Values of organic carbon content are typically reported in
permilles (0–1000) with integer precision.
```
The *dry combustion method* (Leco at 1000°C) is based on thermal oxidation of
both mineral carbon (IC) and organic carbon by means of a furnace.
It is a reliable method for the determination of the soil organic carbon when IC is
removed through combustion at low temperature prior to combustion at
high temperature. Dry combustion is considered to ensure oxidation of
all organic carbon, and is considered an accurate method which has been used in many
studies as a reference method against which to calibrate other methods
[@Grewal1991JSS; @Meersmans2009SUM; @Bisutti2004TAC]. A global estimate
of the probability distribution of soil organic carbon is shown in
Fig. \@ref(fig:sprofs-soil-carbon).
```{r sprofs-soil-carbon, echo=FALSE, fig.cap="Histogram and soil-depth density distribution for a global compilation of measurements of soil organic carbon content (ORCDRC) in permilles. Based on the records from WOSIS (http://www.earth-syst-sci-data.net/9/1/2017/). The log-transformation is used to ensure close-to-normal distribution in the histogram.", out.width="90%"}
knitr::include_graphics("figures/Fig_sprofs_ORCDRC.png")
```
Total organic carbon can be determined directly or indirectly. Direct determination
includes removal of any carbonates present by treating the
soil with hydrochloric acid. Indirect determination consists of applying
an empirical correction to the total carbon content to account for
the inorganic carbonates present.
Examples of studies that have used dry combustion for calibrating other
methods of analyzing organic carbon include @Kalembasa1973JSFA
[@Grewal1991JSS; @Soon1991CSSPA; @Wang1996AJSR; @Konen2002SSSAJ; @Brye2003CSSPA; @Mikhailova2003CSSPA; @Bisutti2004TAC; @Jankauskas2006CSSPA; @DeVos2007SUM]
and @Meersmans2009SUM. It is possible to produce
regression equations to permit conversion of results for organic carbon
produced by one method into equivalent values into a specified reference
method (generally dry combustion). However, local calibration
equations that reflect differences in soils on a regional basis are
usually needed. It is not possible to provide a single universal equation
suitable for use everywhere to convert organic carbon values produced
using other methods of analysis to equivalent values in the reference
method of dry combustion.
### Soil pH
Soil pH is of interest for global soil mapping because it is one of the more
widely available and easily interpreted chemical measures of the health
and productivity of the soil. pH provides an indication of base status
of the soil which influences nutrient availability, mobility of both
beneficial and detrimental ions and the ecology of micro-organisms
within the soil. For most crops and agricultural uses, a pH in the range
of 5.5 to 7.5 is optimal (considering the agricultural productivity of
soil). Low pH is associated with acidic conditions and with increased
mobility of toxic ions such as aluminium iron and even acid sulphates.
High pH is associated with reduced availability of phosphorus and at
higher levels with alkaline conditions that impede water uptake by
plants. A global estimate of the distribution of the soil pH is shown in
Figs. \@ref(fig:sprops-phiho5) and \@ref(fig:sprops-phikcl).
PH index approximates concentration of dissolved hydrogen ions
(H$_3$O$^+$) in a soil suspension. It is estimated as the negative
decimal logarithm of the hydrogen ion activity in a soil suspension. As
a single measurement, pH describes more than relative acidity or
alkalinity. It also provides information on nutrient availability, metal
dissolution chemistry, and the activity of microorganisms
[@Miller2010SSSAJ].
```{r sprops-phiho5, echo=FALSE, fig.cap="Histogram and soil-depth density distribution for a global compilation of measurements of soil pH (suspension of soil in H2O). Based on the records from WOSIS (http://www.earth-syst-sci-data.net/9/1/2017/).", out.width="100%"}
knitr::include_graphics("figures/Fig_sprofs_PHIHO5.png")
```
```{r sprops-phikcl, echo=FALSE, fig.cap="Histogram and soil-depth density distribution for a global compilation of measurements of soil pH (suspension of soil in KCl). Based on the records from WOSIS (http://www.earth-syst-sci-data.net/9/1/2017/).", out.width="100%"}
knitr::include_graphics("figures/Fig_sprofs_PHIKCL.png")
```
The standard reference method for reporting pH is ISO
10390:2005. This standard specifies an instrumental method for the
routine determination of pH using a glass electrode in a 1:5 (volume
fraction) suspension of soil in water (pH in H$_2$O), in potassium
chloride solution (pH in KCl) or in calcium chloride solution (pH in
CaCl$_2$).
The most common method for analyzing pH in North America is a 1:1
soil/water suspension [@Miller2010SSSAJ]. Adopting ISO 10390:2005 as a
standard with its specification of pH measured in a 1:5 suspension of
soil in water requires US values to be converted from 1:1 soil/water
to 1:5 soil/water equivalent values.
```{r color-legend-phi, echo=FALSE, fig.cap="Histogram for soil pH and connected color legend available via the GSIF package. Color breaks in the legend have been selected using histogram equalization (i.e. by using constant quantiles) to ensure maximum contrast in the output maps.", out.width="85%"}
knitr::include_graphics("figures/Fig_color_legend_PHI.png")
```
The ratio of soil to water in a suspension has a net effect of
increasing the pH with a decrease in the soil/water ratio. @Davis1943SS
has shown that decreasing the soil/water ratio from 10:1 to 1:10
resulted in an increase of 0.40 pH units. Values for pH computed using
methods with a lower ratio of soil to water (e.g. 1:1 or 1:2.5) will
generally be lower than equivalent values for pH in 1:5 water.
Several authors have demonstrated that fitting quadratic or curvilinear
functions to soil pH data produces regression equations with higher
coefficients of determination than those obtained from a linear fit
[@Aitken1991AJSR; @Miller2010SSSAJ]. For example, @Brennan1998 have estimated
that (at least in Southwestern Australia) pH in CaCl$_2$ can be estimated
from the pH 1:5 water by using a simple conversion:
```{r}
ph_h2o = 7.2
0.918*ph_h2o-0.3556
```
This model fitted explains 94% of variation in the values of pH CaCl$_2$ (R-square=0.9401).
```{block2, type="rmdnote"}
Soil pH is negative decimal logarithm of the hydrogen ion activity in a
soil suspension. Soil pH values are usually in the range 3–11, and are
recorded with a precision of ±0.1. Soil pH in the range of 5.5 to 7.5 is
optimal for growing crops.
```
Soil pH varies with season and soil moisture content, with higher pH
values associated with wetter soils and winter conditions and lower pH
values with drier soils and summer conditions [@Miller2010SSSAJ]. The
effects of both temporal variation in pH and variation due to different
analytical methods means that differences in pH of less than some
specified range (e.g. ±0.3 units) may not be meaningful in the
context of predictions made using noisy legacy soils data analyzed using
a variety of different analytical methods. Consequently, it is not
necessary or beneficial to report pH with a precision greater than
±0.1 unit. Natural variation of pH in soils is over a range of 2–11 with
a standard deviation of 1.4. Note also that pH follows
a close-to-normal distribution, although it is often argued that, locally,
it can show bimodal or even trimodal peaks
(Fig. \@ref(fig:color-legend-phi)).
### Soil nutrients
Nutrients are chemical elements or substances essential for the growth
of plants. The most essential elements important for the growth of
plants are carbon, hydrogen and oxygen. Other essential elements
can be separated into macro-nutrients (>100 $\mu$g or >100 ppm) and micro-nutrients
(<100 ppm), although there is no strict border between the two
[@harpstead2001soil; @hengl2017soil]. Some macro-nutrients of global importance for soil
management and protection are (http://en.wikipedia.org/wiki/Plant_nutrition):
- *Nitrogen* (N) — Nitrogen is often considered synonymous with
soil fertility. Controls leafy growth. Occurs in soil as
nitrates (e.g. NO$_3$, NO$_2$).
- *Phosphorus* (P) — High phosphorus deficiency may result in the
leaves becoming denatured and showing signs of necrosis. Occurs in
the form of phosphates.
- *Potassium* (K) — Potassium deficiency may result in higher risk of
pathogens, wilting, chlorosis, brown spotting, and higher chances of
damage from frost and heat.
- *Sulfur* (S) — Symptoms of deficiency include yellowing of leaves
and stunted growth. Occurs in soil in the form of sulfate salts
(SO$_4$).
- *Calcium* (Ca) — Calcium is involved in photosynthesis and
plant structure. Calcium deficiency results in stunting. Occurs in
the form of calcium carbonates (CaCO$_3$).
- *Magnesium* (Mg) — Magnesium is also an important part
of chlorophyll. Magnesium deficiency can result in
interveinal chlorosis.
```{block2, type="rmdnote"}
Nitrogen, Phosphorus and Potassium are the three relatively mobile and
dynamic nutrients in soil that are most often lacking and hence have
been identified of primary interest for the fertilizer industry. Other
micro-nutrients of interest for global soil mapping would be: Iron (Fe),
Zinc (Zn), Manganese (Mn), Copper (Cu), Boron (B), Chlorine (Cl),
Molybdenum (Mo), Nickel (Ni) and Sodium (Na).
```
Apart from macro- and micro-nutrients important for plant growth, there
is an increasing interest in the distribution of heavy metals in soils,
especially ones that are considered toxic or dangerous for human health.
Some common heavy metals of interest for soil management and soil
protection in developed industrial and / or developing countries are
Lead (Pb), Arsenic (As), Zinc (Zn), Cadmium (Cd), Nickel (Ni), Copper
(Cu), and Aluminium (Al)
[@Markus2001399; @reimann2011statistical; @Morel2005202; @Rodriguez-Lado23082013; @hengl2017soil].
Macro- and micro-nutrients and heavy metals are measured and mapped in
parts per million or $\mu$g per kg of soil. The AfSIS project, provides a good
example of mapping macro- and micro-nutrients over a large area [@hengl2017soil]. The
problem with mapping such chemical soil properties, however, is that they
are highly dynamic. For example, nitrogen, phosphorus, and potassium are
highly mobile nutrients. Their concentration changes from month to
month, even from day to day so that space-time models (2D-T or 3D-T)
need to be developed and the amount of analysis / storage needed can
easily escalate.
## Physical and hydrological soil properties
### Coarse fragments
Soil texture is connected with soil granulometry or the composition of
the particle sizes, typically measured as volume percentages. The most
common subdivision of soil granulometry is [@Shirazi2001SSSAJ]:
1. Fine earth (<2 m)
1.1 sand (coarser particles in the fine earth),
1.2 silt (medium size particles),
1.3 clay (fine particles <2 $\mu$m),
2. Coarse fragments (>2 mm)
2.1 gravel (2 mm to 8 cm),
2.2 stones or boulders (>8 cm),
Coarse fragments occupy volume in the soil matrix, reducing water and
nutrient availability as well as influencing rooting depth and
workability. We elect to produce maps of coarse fragment content because many assessments, such as total stocks of carbon or
available water, are volumetric based and require knowledge of the
volume of non-soil materials throughout the profile. This information is required to support
calculations of the total volume of the fine earth fraction that is
available to hold water or retain organic carbon. Without some estimate
of the volume of the soil occupied by solid particles larger than 2 mm, it
would not be possible to compute volumetric estimates of stocks of soil
carbon or available moisture for fine earth soil.
Coarse fragments include stones as well as gravel (hard and soft
nodules) and the attribute is defined as consisting of all mineral
particles with a size exceeding 2 mm. Coarse fragment content is most
commonly expressed in volume fraction (volume percentage) of the
horizon, layer or sample considered. Laboratory analyses tend to be
applied to the fine earth fraction of the soil only and commonly omit
consideration of the coarse fragment content. Data for coarse fragment
content are generally derived from descriptive field observations on
soil layer morphology. Those descriptions generally express the content
of coarse fragments by class values or categories as for example
*‘frequent stones’* indicating an estimated volumetric content of 15–40% according to
the FAO guidelines of 1977 (similar to *‘many stones’* according to
SOTER conventions and the FAO guidelines of 2006). Because coarse
fragment content is most frequently based on generalized visual field
estimates, and is often lacking in legacy soil descriptions, it is not
reasonable to predict or present estimates of coarse fragment content
with a precision greater than 1–5%.
Note that the uncertainty associated with coarse fragment content,
propagated from the field observed class values, has significant impact
on estimations of the volumetric totals of attributes assessed and
mapped for the fine earth fraction (see also
chapter \@ref(SOC-chapter)). Whilst a 1 meter deep
soil, with a bulk density of 1.5 tonne per cubic-metre and an organic
carbon content of 10 g per kg, contains 150 tonnes organic carbon,
a similar soil with bulk density adjusted for the
presence of *‘frequent stones’* contains only 90–127.5 tonnes organic carbon.
Despite the inaccuracy of the data for field observed coarse fragments content,
it is strongly recommended to collect and compile these data as completely as
possible because of their relevance for estimating whole soil bulk
density, total volume and volume of the fine earth fraction alone.
The possible nature (and size) of coarse fragments is highly variable
(quartz, carbonate, iron, basalt) with consequent variable manageability and
variable characteristics such as breakability, solubility, bulk density, etc.
Where the coarse fragment content is dominant (>80%), approaching 100%,
rootability is near nil which is a determinant for the rooting or
effective soil depth and generally also for depth to bedrock. An
estimated global distribution of coarse fragments and soil textures is
given in Fig. \@ref(fig:sprofs-crfvol).
```{r sprofs-crfvol, echo=FALSE, fig.cap="Histogram and soil-depth density distribution for a global compilation of measurements of coarse fragments in percent. Based on the records from WOSIS (http://www.earth-syst-sci-data.net/9/1/2017/). This variable in principle follows a zero inflated distribution.", out.width="90%"}
knitr::include_graphics("figures/Fig_sprofs_CRFVOL.png")
```
### Particle size class distribution: sand, silt and clay
The majority of global soil mapping initiatives elect to predict the
spatial distribution of particle size classes (soil texture) because
texture controls or influences many mechanical, hydrological and
engineering aspects of use of the soil. Soil texture affects how a soil
responds to engineering uses such as construction of roads, buildings,
dams and other structures, how water infiltrates into the soil and is
stored or transmitted through it, how nutrients, chemicals and dissolved
substances adhere to surfaces and are retained or transformed and how
energy and matter enter into the soil and are stored or transmitted
through it. Texture is the fundamental physical and mechanical property
of soils and, as such, it is one of the most widely analysed and
widely reported soil properties.
The size of particles in the soil varies greatly from less than a 2 $\mu$m to
several cm’s and occasionally even meters (boulders). This represents a
range from 1 $\mu$m to 1 million $\mu$m. Generally, particle size distribution has been
simplified through aggregation or classification. The fine earth
fraction (<2 mm) is the soil considered for laboratory analyses. This fine
earth is further subdivided into particle size classes including,
depending on the guidelines or laboratory concerned, fine and coarse
clay, fine and coarse silt and very fine, fine, medium, coarse and very
coarse sand. The three major particle size classes of the fine earth
fraction though are sand, silt and clay. They are generally reported in
units of percent by weight with a precision of ±1%.
Soil texture represents the relative composition of sand, silt, and clay
in soil. The *particle-size class distribution* is usually represented
in a texture diagram, relating the percentages of sand, silt, and clay
(mass percentage of fine earth) to a *texture class* [@Minasny2001AJSR].
Particle size distribution has been defined using a number of systems.
One of the most widely used systems is the USDA Soil Survey Laboratory
Methods Manual [@Burt2004SSIR]. The USDA definition of particle size
classes has also been recommended by FAO for use in the Soil Map of the
World (Fig. \@ref(fig:texture-limits)). The standard reference method for
reporting particle size classes of sand, silt and clay, is as
per the USDA Soil Survey Laboratory Methods Manual [@Burt2004SSIR
p.347]. An estimated global distribution of sand, silt, and clay is
given in Figs. \@ref(fig:sprofs-snd), \@ref(fig:sprofs-slt) and
\@ref(fig:sprofs-cly).
```{r texture-limits, echo=FALSE, fig.cap="Particle size limits used in European countries, Australia and America. Image source: Minasny and McBratney (2001) doi: 10.1071/SR00065.", out.width="85%"}
knitr::include_graphics("figures/Fig_texture_limits_Minasny2001.jpg")
```
```{r sprofs-snd, echo=FALSE, fig.cap="Histogram and soil-depth density distribution for a global compilation of measurements of sand content in percent. Based on the records from WOSIS (http://www.earth-syst-sci-data.net/9/1/2017/).", out.width="90%"}
knitr::include_graphics("figures/Fig_sprofs_SNDPPT.png")
```
```{r sprofs-slt, echo=FALSE, fig.cap="Histogram and soil-depth density distribution for a global compilation of measurements of silt content in percent. Based on the records from WOSIS (http://www.earth-syst-sci-data.net/9/1/2017/).", out.width="90%"}
knitr::include_graphics("figures/Fig_sprofs_SLTPPT.png")
```
```{r sprofs-cly, echo=FALSE, fig.cap="Histogram and soil-depth density distribution for a global compilation of measurements of clay content in percent. Based on the records from WOSIS (http://www.earth-syst-sci-data.net/9/1/2017/).", out.width="90%"}
knitr::include_graphics("figures/Fig_sprofs_CLYPPT.png")
```
The current standard for particle size classes adopted by FAO for use in
the Harmonized World Soil Database is ISO 10390:2005. This standard
differs from the USDA definition in defining the size range for silt as
2–63 $\mu$m instead of 2–50 $\mu$m and sand as 63–2000 $\mu$m instead of
50–2000 $\mu$m. This is a relatively
new standard for FAO which previously adopted the USDA definitions for
the digital soil map of the world [@Nachtergaele2010press]. These
differences in attribute definition cause differences in values reported
for soil particle size classes. Differences in values can also arise
because of differences in method of analysis (e.g. hygrometer, pipette,
laser diffraction, dispersion etc). Most literature on harmonization of soil
texture data deals with harmonizing differences in attribute definitions
or the reported particle size classes
(Fig. \@ref(fig:texture-limits)).
```{block2, type="rmdnote"}
The most commonly used standard for designation of fine earth texture
fractions, used by the *GlobalSoilMap* project, is the USDA
system (sand: 50–2000 $\mu$m, silt: 2–50 $\mu$m, clay: <2 $\mu$m).
```
@Minasny2001AJSR identified two major textural classifications in the
world as the International and USDA/FAO systems (Tbl. \@ref(tab:usdafaotexture)).
The significant difference between these two was the choice of a
threshold value for differentiating silt from sand of 20 $\mu$m for the
International and 50 $\mu$m for the USDA/FAO systems. The new ISO/FAO standard
adds an additional difference by changing the threshold value between
silt and sand from 50 $\mu$m to 63 $\mu$m. Another very important difference in attribute
definition concerns the Russian system which defines the clay fraction
as <1 $\mu$m and the fine earth fraction, or the upper limit of the sand
fraction, at 1 cm instead of 2 cm.
```{r usdafaotexture, echo=FALSE}
knitr::kable(
head(read.csv("extdata/usda_fao_texture_limits.csv", header = TRUE, stringsAsFactors = FALSE)), booktabs = TRUE,
caption = 'Differences between the International, USDA and ISO/FAO particle size classifications.'
)
```
Both @Nemes1999G and @Minasny2001AJSR investigated options for
harmonizing values for sand, silt and clay reported using different
systems for classifying particle size fractions. Using a compilation of
four large databases consisting of a total of 1620 samples,
@Minasny2001AJSR developed a single multiple linear regression model for
converting between silt fraction based on the international standard of
2–20 $\mu$m ($P_{\mathtt{2-20}}$) to the 2–50 $\mu$m range of the USDA standard
($P_{\mathtt{2-50}}$) and vice versa:
\begin{equation}
\begin{cases}
\begin{matrix} \hat P_{\mathtt{2-50}} = & -18.3914 + 2.0971 \cdot P_{\mathtt{2-20}} + 0.6726 \cdot P_{\mathtt{20-2000}} \\
& - 0.0142 \cdot P_{\mathtt{2-20}}^2 - 0.0049 \cdot P_{\mathtt{20-2000}}^2
\end{matrix} & \text{ if } \hat P_{\mathtt{2-50}} > 0 \\ \begin{matrix} \hat P_{\mathtt{2-50}} = & 0.8289 \cdot P_{\mathtt{2-20}} + 0.0198 \cdot P_{\mathtt{20-2000}} \end{matrix} & \text{ if } \hat P_{\mathtt{2-50}} < 0
\end{cases}
(\#eq:P2_50)
\end{equation}
where $P_{\mathtt{20-2000}}$ is the international sand fraction. This
conversion is fairly accurate since the model explains most of the observed variability
in the original values ($R^2$=0.823). Together with the conversion of
the silt fraction is the conversion of the sand fraction.
@Minasny2001AJSR argued that most countries should consider adopting the
particle size limits and texture classes of the USDA system. They noted
that the 2–50 $\mu$m particle size range is usually more useful than the 2–20 $\mu$m range
for estimating water retention in pedo-transfer functions and observed
that translations from one system into another were relatively easy,
given improved computing power and algorithms.
@Nemes1999 [@Nemes1999G] evaluated four different interpolation methods
(log-linear interpolation, fitting a Gompertz curve, spline
interpolation, and similarity method) in order to achieve compatibility
of particle-size distributions within the European soil hydraulic
database HYPRES (http://www.macaulay.ac.uk/hypres/). They introduced a new similarity procedure, which
uses an external reference data set that contains a wide variety of
reference soil materials, each with 7 or 8 measured particle-size
fractions. The procedure involves searching for soil samples in the
external reference data set that match the particle-size distribution of
the soil to be interpolated. From each search. 10 similar soils are
selected that have fractions at the particle size limits similar to the
soil under investigation. The arithmetic mean of the fractions of these
10 soils at the specified particle size limit is calculated and assigned
as the estimate of the fraction for the soil under investigation.
The HYPRES reference database and the similarity procedures applied to
it are appropriate for harmonizing a wide range of soils from a variety
of countries and could be used as one of the main methods in a global
Soil Reference Library. The generic nature of this conversion approach,