(This paper can be freely copied for eductional purposes)
PREFACE
This brief text discusses the fundamental operations
used in computer-assisted map analysis. Also, it provides
an overview of the capabilities contained in pMAP. The analytical
procedures presented are neither database nor application specific.
By organizing the set of processing methods into a mathematical structure, a generalized cartographic modeling framework
is developed. Within this framework, you can logically
sequence primitive analytic operators on map variables in
a manner that is analogous to traditional algebra and descriptive
statistics. This map-ematical approach provides a "tool
box" of analytical procedures in a common, flexible, and intuitive manner.
EXECUTING THE TUTORIALS
By accessing the TUTOR25 database and READing the
appropriate command file (TUTOR1.CMD through TUTOR8.CMD) you
can reproduce the examples of the applications described in this
text. Each tutorial command file includes comments describing
the processing taking place. The TMAP command files (TMAP0.CMD
through TMAP10.CMD) present more advanced analysis and correspond
to the book BEYOND MAPPING: CONCEPTS, ALGORITHMS AND iSSUES
IN GIS," by Joseph K. Berry (GIS World, Inc., Fort Collins, Colorado,
1993). Also included in the set of tutorial files is a demonstration
of map encoding procedures (TUTOR9.CMD). Several additional
command files, denoted as TU-_____.CMD, describe other concepts
and applications. These supplemental command files continually evolve and grow in number. You should periodically
check with Spatial Information Systems for extensions and additions
to the set of tutorials.
After installing the pMAP System, access the TUTOR25
database as described in the READ.ME instructions. When you
receive pMAP's CMD: prompt, enter
CMD: read tutor1.cmd
to begin the first tutorial. This tutorial is designed
to familiarize you with the use of the other tutorials
through a simple application model. The tutorial will PAUSE
to give you time to read the "notes" describing the
processing that has or will take place. When you complete your review of
the notes, press the "Enter" key to continue with
the tutorial. When a map is displayed, it will remain on the screen until
you press the Enter key. While a map is displayed, you should
pop-up its legend by pressing the F5 key. Use the up and down
arrow keys to scroll the legend. Pressing the Enter key will
pop-away the legend. Press "Enter" again to pop-away
the map and continue with the tutorial. TUTOR1.CMD provides practical
experience with these basic instructions.
To activate the other tutorials, specify CMD: read tutor_.cmd
where the appropriate tutorial number is substituted
for the underscore (_). The tutorials corresponding with
the discussion of this brief text are referenced at the appropriate
places within the text. Each tutorial covers a specific
topic and corresponds to the figure of the same number. For
example, the discussion of figure 2 is demonstrated in TUTOR2.CMD.
The topics of the tutorials are as follows:
Basic Tutorials:
Digitized Data Encoding:
The following text discusses the fundamental map analysis operations contained in pMAP. Four classes of operations are described, with each section heading identifying the specific pMAP commands covered. The figure captions identify the tutorial corresponding to the discussion. You can enter the pMAP commands in the figure space using the TUTOR25 database to generate the maps forming the figure. The file TEXT.FIG con tains WordPerfect 5.0 graphics of the "composite" figures. Those individuals with access to WordPerfect can print this file (Shift/F7; use High and paste the figures in the appropriate place in this text.
INTRODUCTION
Information relating to the spatial characteristics of resources has been difficult to incorporate into land planning and management decisions. Manual techniques of map analysis are both tedious and analytically limiting. Computer-assisted geographic information systems (GIS), on the other hand, hold promise for providing capabilities clearly needed for effective decisions.
This need has caused the developing science of map analysis to be somewhat prematurely cast into specific operational contexts. The result has been a growing number of special purpose, expensive systems designed for unique applications. Thi
s situation has severely limited general use of fundamental analytic procedures common to most applications. Recent efforts, however, have attempted to recognize the processing similarities among the many applications as a means to develop more general c
apabilities and theoretical framework. The pMAP system incorporates such an approach and provides advanced analytic capabilities in an extremely flexible and intuitive manner.
Most GIS systems include processing capabilities related to encoding, storage, analysis and display of spatial data. Within this context, the functions of a GIS system may be characterized as computer mapping, database management, spatial st
atistics, and cartographic modeling. Most contemporary systems provide capabilities for mapping and management with limited analytic functionality. The extensive modeling and statistical operations of pMAP make it a unique and powerful tool.
This brief text describes pMAP's set of operations
that relate to the analysis of mapped data in general, and to the
needs of natural resource and environmental planning in particular.
The fundamental map processing operations discussed are
common to a broad range of applications. Among the major classes of
primitive operations identified are those that
By organizing primitive operations in a logical manner, a
generalized cartographic modeling approach can be developed.
This fundamental approach can be conceptualized as "map algebra"
and "spatial statistics" in which entire maps are treated as
variables. In this context, primitive map analysis operations
can be seen as analogous to traditional mathematical and
statistical operations. The sequencing of pMAP operations is
similar to the algebraic solution of equations to find unknowns.
In this case, however, the unknowns represent whole maps. This
familiar structure is particularly effective in providing spatial
analysis techniques to individuals with limited experience in
computers and geographic information processing.
FUNDAMENTAL CONSIDERATIONS
To use primitive operations in a modeling context, two
fundamental considerations must be met: 1) a common data
structure, and 2) a flexible processing structure. The variety
of mappable characteristics associated with any given geographic
location may be organized as a series of spatially registered
computer-compatible maps, or "overlays." An overlay is simply a
special form of a geographic map in which each cartographic
location has one and only one thematic attribute (i.e., thematic
categories are mutually exclusive). By contrast, a composite
map, such as a conventional USGS topographic sheet, has each
location characterized simultaneously by several themes
(elevation, water features, vegetation, political designation,
etc.). In overlay format, each individual map theme is stored as
a separate data plane, in the same manner that color separation
negatives are used in the printing process. For convenience, the
following discussion uses the terms "overlay" and "map"
interchangeably.
As a data structure, the title, certain descriptive parameters,
and set of categories, termed "regions," comprise each map. A
region is simply one of the thematic designations on a map used
to characterize geographic locations. An overlay of water bodies
titled WATER, for example, might include regions associated with
dry land, streams, ponds and lakes. Each region is represented
by a name (i.e., a label) and a numerical value. The structure
so far does not account for locational characteristics. The
handling of locational information is not only what distinguishes
geographic information processing from other types of automated
data processing, but is also what distinguishes one GIS system
from another.
The two basic approaches in representing locational information
are 1) polygon, or "vector," and 2) gridded, or "raster." The
vector approach stores information about the boundaries between
regions; the raster, information on the interiors of regions.
While these differences are significant in terms of
implementation strategies and may vary considerably in terms of
geographic precision, they need not affect the definition of a
set of fundamental analytical techniques. In light of its
conceptual and processing simplicity, the gridded structure is
best suited for map analysis. Therefore, it forms the primary
data structure of pMAP. Line segment data, however, may be used
for input and then converted to cells for processing (see
Appendix B). Also, lines can be superimposed over pMAP output
graphics.
The commonly used gridded data structure, termed "regular
rectangular grid," is based on the condition that numbered rows
and columns define all spatial locations. The smallest
addressable unit of space corresponds to a square parcel of land,
a "point," or individual cell. Assigning numerical values to all
points within a particular "region," or map category, represents
spatial patterns. Also, this structure allows each point to be
addressed as part of a "neighborhood," or adjacent cells, of
surrounding values. Figure A-1 illustrates the relationships
among values, points, regions, and neighborhoods for the
COVERTYPE overlay used in several of the following figures. In
the upper-right portion of the figure is a display of the values
stored at each location.
To generate a hardcopy of this figure, print
TEXT.FIG
on a HP Laserjet III or compatible printer using WordPerfect 5.0 or compatible word processing program.
To generate an interactive screen version of this figure using pMAP, enter
C:\PMAP> pmap tutor25
If primitive operations are to be flexibly combined, a processing structure that accepts input and generates output in the same format must be used. Using the data structure outlined above, this may be accomplished by requiring that each analytic operation involve
In the raster structure of pMAP, this processing involves the efficient reading, computing and writing of matrices of numbers. Appendix C describes the data structure in more detail.
The cyclical nature of this processing structure is similar to the evaluation of "nested parentheticals" in traditional algebra.
A computerized GIS consists of a database of spatially registered maps and procedures for the encoding, storage, analysis, and display of mapped data. The cyclical processing of cartographic modeling involves retrieving one or more maps from the database that then are used to create a new map. This new map becomes part of the database and is available for subsequent processing. In pMAP you control the processing by entering a series of command sentences.
The logical sequencing of primitive operations on a set of maps forms a cartographic model of specified application. As with traditional algebra, fundamental techniques involving several primitive operations can be identified (e.g., a "proximity" map) that are applicable to numerous situations. The use of primitive analytical operations in a generalized modeling context accommodates a variety of analyses in a common, flexible, and intuitive manner. Review of the TUTOR1.CMD tutorial provides experience in this process as it describes a simple cartographic model.
RECLASSIFYING MAPS:
CLUMP, COMPUTE (constants), CONFIGURE, RENUMBER, SIZE, SLICE
The first and most fundamental class of analytical operations involves the reclassification of map categories. Each operation entails the creation of a new map by assigning thematic values to categories of an existing overlay. The initial value, position, contiguity, size, or shape of the spatial configuration associated with each category determines the values to be assigned. All reclassification operations involve the simple repackaging of information on a single overlay and do not result in new boundary delineations of regions. Such operations can be thought of as the purposeful "recoloring" of maps.
Figure A-2 shows an example of reclassifying a map as a function of its initial thematic values. The thematic attributes of a map are simply its legend categories expressed as a list of names and numbers. For display, a unique symbol can be associated with each thematic value. In figure A-2, the COVERTYPE map has categories of OPEN WATER, MEADOW and FOREST, stored as thematic values 1, 2, and 3. These values can be displayed as the colors blue, brown and green, shown as increasing greytones. A wide array of character-based symbols, raster patterns, and color codes can be used for graphic presentation. To create a binary map that isolates the meadow as in figure A-2, simply assign to the open water and forest areas the value 0 that corresponds to the graphic symbol (blank), retaining the value 2, and its symbol, for meadows.
To generate a hardcopy of this figure, print TEXT.FIG on a HP Laserjet III or compatible printer using WordPerfect 5.0 or compatible word processing program.
To generate an interactive screen version of this figure using pMAP, enter C:\PMAP> pmap tutor25 CMD: display covertype CMD: renumber covertype for meadow assign 0 to 1 assign 0 to 3 CMD: display meadow
RENUMBER. A similar reclassification operation might involve the ranking or weighing of qualitative map categories to generate a new map with quantitative values. A map of soil types, for example, can be assigned values that indicate the relative suitability of each soil type for residential development.
Quantitative values can also be reclassified to yield new quantitative values. This reclassification might simply involve a specified reordering of map categories. For example, a given map of soil moisture content could be used to generate a map of suitability levels for plant growth.
Reclassification operations can also relate to locational attributes (e.g., position) associated with a map, as well as thematic attributes. A map category represented by a single "point" location, for example, might be reclassified according to its latitude and longitude. Similarly a linear or areal feature (a feature with area, not just length) could be reassigned values indicating its center of gravity or orientation.
SLICE. A reclassification also could involve the application of a generalized reclassifying function such as "level slicing" that splits a continuous range of map category values into discrete intervals (e.g., the derivation of a contour map from a map of topographic elevation values involves slicing the range of elevation values into uniform groups, such as 200-foot intervals).
COMPUTE USING CONSTANTS. Other quantitative reclassification functions include a variety of arithmetic operations involving map category values and a specified or computed constant. Among these operations are addition, subtraction, multiplication, division, exponentiation, normalization, and other scaler mathematical and statistical operators. For example, a map of topographic elevation expressed in feet may be converted to meters by multiplying each map value by the appropriate conversion factor of 3.28083 feet per meter.
CLUMP. Another reclassifying operation, termed "parceling," characterizes category contiguity. This procedure identifies individual "clumps" of one or more points that have the same numerical value and are geographically contiguous (e.g., generation of a map identifying each lake as a unique value from a generalized water map representing all lakes as a single category value).
SIZE. Size is another locational characteristic. In the case of map categories associated with linear features or point locations, the basis for reclassifying those categories might be overall length or number of points. Similarly, a map category associated with a planar area can be reclassified according to its total acreage or the length of its perimeter. For example, an overlay of surface water may be reassigned values to indicate the areal extent of individual lakes or the length of stream channels. This same technique also can be used to deal with volume. Given a map of depth to bottom for a group of lakes, each lake can be assigned a value indicating total water volume based on the areal extent of each depth category.
CONFIGURE. Another basis for reclassifying map categories is shape. Categories represented by point locations have measurable "shapes" insofar as the set of points imply linear or areal forms (just as stars imply constellations). Shape characteristics associated with linear forms identify the patterns formed by multiple line segments (e.g., dendritic stream pattern). Primary shape characteristics associated with areal forms include spatial integrity, boundary convexity, and nature of edge. Spatial integrity of an area describes the degree in which a feature is broken into numerous "fragments" and/or contains interior "holes." The Euler number summarizes the relationship among fragments and holes. This statistic is computed as the number of holes within a feature less the number of fragments making up the entire feature minus one. A zero Euler number indicates spatially balanced features; larger negative or positive numbers indicate less spatial integrity.
Other shape characteristics, convexity and edge, address the boundary configuration of areal features. Convexity is the measure of the extent to which an area is enclosed by its background relative to the extent to which the area encloses the background. The "convexity index" is computed by the ratio of a feature's perimeter to its area. The most regular configuration is that of a circle that is totally convex and, therefore, not enclosed by the background at any point along its boundary. Comparison of a feature's computed convexity with that of a circle of the same area results in a standard measure of boundary regularity.
The nature of the boundary at each edge location can be used for a detailed descriptive boundary configuration. At some locations the boundary could be an entirely concave intrusion; others may be at entirely convex protrusions. Depending on the "degree of edginess," each point can be assigned a value indicating the actual boundary configuration at that location.
This explicit use of cartographic shape as an analytic parameter is unfamiliar to most users. However, implicit in any visual assessment of mapped data is a nonquantitative consideration of shape. The potential for applying quantitative shape analysis techniques in the areas of digital image classification and wildlife habitat modeling is particularly promising. A map of forest stands might be reclassified such that each stand is characterized according to the amount of forest edge relative to total acreage and the frequency of interior forest canopy gaps. Those stands with a large proportion of edge and a high frequency of gaps generally indicate better wildlife habitat for many species.
OVERLAYING OPERATIONS:
AVERAGE, COMPOSITE, COMPUTE, COVER, CROSSTAB, INTERSECT
Operations for overlaying maps begin to relate to both the spatial and the thematic nature of cartographic information. The general class of overlay operations can be characterized as "light-table gymnastics." This class of operations involves the creation of a new map in which the value assigned to every point, or set of points, is a function of the independent values associated with that location on two or more existing maps.
In a simple "location-specific" overlay, the value assigned is a function of the point-by-point aligned coincidence of the existing overlays. In a "category-wide" composite, values are assigned to entire thematic regions as a function of the values associated with those regions on other maps. The first overlay approach conceptually involves only the vertical spearing of a set of overlays. The latter approach uses one overlay to identify boundaries from which information is extracted in a horizontal summary fashion from another map(s).
Figure A-3. Point-by-point overlay is location specific.
Each map location is assigned a unique value identifying the
cover type and slope conditions occurring at that location.
(see TUTOR3.CMD)
COMPUTE AND AVERAGE. A specific function used to compute new map
category values from those of existing maps being overlaid may
vary according to the nature of the data being processed and the
specific use of that data within a modeling context. Most
typical environmental analyses are those involving the
manipulation of quantitative values to generate new values that
are also quantitative in nature. Among these manipulations are
the basic arithmetic operations such as addition, subtraction,
multiplication, division, roots, and exponentiation. For
example, given maps of assessed land values in 1970 and 1990,
a map that shows the percent change in land values over that
period can be expressed in pMAP software syntax as follows:
Functions related to simple statistical parameters (maximum,
minimum, median, mode, majority, standard deviation, or weighted
average) may also be applied in this manner. Type of data being
manipulated dictates the appropriateness of the mathematical or
statistical procedure used. For example, the addition of
qualitative maps, such as soils and land use, would result in
meaningless sums because their numeric values have no
mathematical relationship.
Other map overlay techniques can be used to process either
quantitative or qualitative values and to generate like values.
Among these are masking, comparison, calculation of diversity,
and reclassification of cross-tabulated map categories. (Figure
A-3 shows an example of the latter.)
More complex statistical techniques can also be applied in this
manner assuming that the inherent interdependence among spatial
observations can be taken into account. This approach treats
each map as a variable, each point as a case, and each value as
an observation. A predictive statistical model can then be
evaluated for each location, resulting in a spatially continuous
surface of predicted values. The mapped predictions contain
additional information over traditional nonspatial procedures
such as direct consideration of coincidence among regression
variables and the ability to locate areas of a given prediction
level.
INTERSECT AND CROSSTAB. Two maps also can be combined on a
point-by-point basis depending on the pairwise combination of
values. This procedure is, in effect, a two map "renumber." A
new number is assigned to all joint occurrences that a user
specifies. Figure A-3 shows the results of user assigned values
to specified pairs of values on the COVERTYPE and SLOPE maps. A
related operation can be used to generate a crosstab table which
reports statistics on all pairwise combinations between two maps.
COVER. Covering, or "masking," one map with another identifies a
process in which values on one map are replaced with those from
another map. This process is similar to manual map overlaying
using acetate sheets. The "clear" areas on the top map allow the
information from the bottom map to show through. In computer
processing, the value zero is used to represent clear areas and
the values at such a locations are replaced (show through). If a
nonzero value is present, the other maps value does not replace
the current value.
COMPOSITE. An entirely different approach to overlaying maps
involves category-wide summarization of values. Rather than
combining information on a point-by-point basis, this group of
operations summarizes the spatial coincidence of entire
categories of two or more maps. Figure A-4 shows an example of a
category-wide overlay operation using the same input maps as
those in figure A-3. In this example, the categories of the
COVERTYPE map are used to define an area for which the
coincidental values of the SLOPE map are averaged. The computed
values of average slope are then assigned to the cover-type
categories (AVERAGE-SLOPE). Note that the Forest category has
the highest average slope of 29%.
Summary statistics that can be used in this way include the
total, average, maximum, minimum, median, mode, or minority
value; the standard deviation, variance, or diversity of values;
To generate a hardcopy of this figure, print
TEXT.FIG
on a HP Laserjet III or compatible printer
using WordPerfect 5.0 or compatible word
processing program.
Figure A-4. Category-wide overlays summarize the spatial
coincidence of areal features. Each of the three cover
types (OPEN WATER, MEADOW, FOREST) are assigned a value
equal to their average slopes. In computing these averages
the SLOPE values occurring within each COVERTYPE boundary
are averaged separately. (see TUTOR4.CMD)
and the correlation, deviation, or uniqueness of particular value
combinations. For example, a map indicating the proportion of
undeveloped land within each of several counties can be generated
by superimposing a map of county boundaries on a map of land use
and then computing the ratio of undeveloped land to total land
area for each county. As with location-specific overlay
techniques, data types must be consistent with the summary
procedure used. The order of processing is also important.
Operations such as addition and multiplication are independent of
the order of processing. Other operations such as subtraction
and division yield different results depending on the order in
which they are processed. This latter type of operation, termed
noncommutative, cannot be used for category-wide summaries.
MEASURING DISTANCE AND CONNECTIVITY: DRAIN, RADIATE, SPAN,
SPREAD, STREAM
Most geographic information systems contain analytic capabilities
for reclassifying and overlaying maps. These operations address
the majority of applications that parallel manual map analysis
techniques. To more fully integrate spatial considerations into
decision-making, however, new techniques are emerging. The
concept of distance historically has been associated with the
"shortest straight line between two points." This measure is
both easily conceptualized and implemented with a ruler but is
frequently insufficient in a decision-making context. A straight
line route may indicate the distance "as the crow flies" but
offer little information for a walking or hitchhiking crow, or
other flightless individual. Equally important to most travelers
is the measurement of distance expressed in more relevant terms
such as time or cost. The group of operations concerned with
measuring distance, therefore, are best characterized as "rubber
rulers."
The basis of any system for the measurement of distance requires
two components: a standard measurement unit and a measurement
procedure. The measurement unit used in most computer-oriented
systems is the "grid space" defined by superimposing an imaginary
uniform grid over a geographic area. The distance from one
location to another is computed as the number of intervening grid
spaces. The measurement procedure always retains the requirement
of the "shortest" connection between points, although the
"straight-line" requirement may be relaxed.
SPREAD. A frequently employed measurement procedure involves
expanding the concept of distance to one of proximity. Rather
than sequentially computing the distance between pairs of
locations, concentric equidistant zones are established around a
location, or set of locations. In effect, a map of proximity to
a "target" location is generated that indicates the shortest
straight-line distance from any map cell to the nearest target
area. Figure A-5(a) is an example of a map, SIMPLE-PROXIMITY,
that indicates the shortest straight line distance from the RANCH
to all other locations.
Within many application contexts, the shortest route between two
locations may not always be a straight line. Even if it is
straight, the Euclidean length of that line may not always
To generate a hardcopy of this figure, print
TEXT.FIG
on a HP Laserjet III or compatible printer
using WordPerfect 5.0 or compatible word
processing program.
To generate an interactive screen version of
this figure using pMAP, enter
C:\PMAP> pmap tutor25
Figure A-5. Distance between locations can be determined as
Euclidean length or as a function of the effect on implied
movement of absolute or relative barriers. Inset (a)
identifies equidistant zones around the ranch. Inset (b) is
a travel-time map identifying time zones of hiking from the
ranch. This travel-time map was generated by considering
the relative ease of travel through various cover type and
slope conditions (COVER-SLOPE) where flat meadows are the
fastest to traverse, steeply forested areas intermediate,
and flat water slowest. (see TUTOR5.CMD)
reflect a meaningful measure of distance. Distance in these
applications is best defined in terms of movement expressed as
traveltime, cost, or energy consumed at rates that vary over
time and space. Distance-modifying effects may be expressed
cartographically as "barriers" located within the space in which
the distance is being measured. This implies that distance is
the result of some sort of movement over that space and through
those barriers. How these barriers affect the implied movement
identifies two major types. "Absolute barriers" are those that
completely restrict movement and imply an infinite distance
between the points they separate unless a path around the barrier
is available. A river might be regarded as an absolute barrier
to a nonswimmer. A swimmer or a boater, however, might regard
the same river as a relative, rather than an absolute, barrier.
"Relative barriers" are ones that are passable although only at a
cost equated with an increase in effective distance.
Figure A-5(b) shows a map, WEIGHTED-PROXIMITY, of hiking time
around the target location identified as the ranch. The map was
generated by reclassifying the various cover/slope categories on
the COVER-SLOPE map (figure A-3) in terms of their relative ease
of foot-travel. The example uses two types of barriers. The
lake is treated as an absolute barrier completely restricting
hiking. The land areas represent relative barriers to hiking
indicating varied impedance to movement for each point as a
function of the cover/slope conditions occurring at that
location. Consider a similar example. Movement by automobile
may be effectively constrained to a network of roads (absolute
barriers) of varying speed limits (relative barriers) to generate
a riding travel-time map. An example from an even less
conventional perspective might be a weighted distance expressed
in terms of accumulated cost of powerline construction from an
existing trunkline to all other locations in a study area. The
cost surface developed can be a function of a variety of social
and engineering factors, such as visual exposure and adverse
topography, expressed as absolute and/or relative barriers.
The ability to move, whether physically or abstractly, may vary
as a function of the implied movement and the static conditions
at a location. Direction is one aspect that may affect the
ability of a barrier to restrict that movement. A topographic
incline, for example, generally impedes hikers differently
according to whether their movement is uphill, downhill, or
across slope. Accumulation is another possible modifying factor.
After hiking a certain distance, "molehills" tend to become
disheartening "mountains," and movement is more restricted.
Momentum, or speed, is a third factor that may dynamically alter
the effect of a barrier. A Volkswagen that has to stop for a red
light on a steep hill may not be able to resume movement;
however, if it were allowed to maintain its momentum (e.g., green
light), it could easily reach the top. Similarly a highway
impairment that effectively reduces traffic speeds from 55 to 40
miles per hour has little or no effect during rush hour when
traffic is already moving at a much slower speed.
Another distance-related class of operations is concerned with
the nature of connectivity among locations on an overlay.
Fundamental to understanding these procedures is the
conceptualization of an "accumulation surface." If the thematic
value of a simple proximity map from a point is used to indicate
the third dimension of a surface, a uniform bowl would be
depicted. The surface configuration for a weighted proximity map
would have a similar appearance, but the bowl would be warped
with numerous ridges and pinnacles. Also the nature of the
surface is such that it cannot contain saddle points (i.e., false
bottoms). This bowl-like "topology" is characteristic of all
accumulation surfaces and can be conceptualized as a warped
football stadium with each successive ring of seats identifying
concentric, equidistant halos about its center (the playing
field).
Figure A-6(a) shows the surface configuration of the hiking
WEIGHTED-PROXIMITY map from the previous figure. The accumulated
distance surface is shown as a perspective plot in which the
ranch is the lowest location and all other locations are assigned
progressively larger values of the shortest, but not necessarily
straight, distance to the ranch. When viewed in perspective,
this surface resembles a topographic surface with familiar
valleys and hills. In this case, however, the highlands indicate
areas that are effectively farther away from the ranch. The
tallest protrusions are the Open Water features forming absolute
barriers in which traveltime is infinitely far away. This plot
was created by transferring the WEIGHTED-PROXIMITY map to a 3-D
graphics package for plotting (see Appendix B).
STREAM. In the case of simple distance, "connectivity," or the
delineation of paths, locates the shortest straight line between
two points considering only two dimensions. The STREAM command
traces the steepest downhill path from a location on a complex
three-dimensional surface. The steepest downhill path along a
topographic surface indicates the route of surface runoff. For a
surface represented by a travel-time map, this technique traces
the optimal (e.g., the shortest or quickest) route. In effect,
this route traces the path of the distance "wave front" that
arrived first. Figure A-6(a) indicates the optimal hiking path
(BESTPATH) from a nearby cabin to the ranch as the steepest
downhill path over the accumulated hiking-time surface (WEIGHTED-
PROXIMITY). If an accumulation cost surface is considered, such
as the cost surface for powerline construction described earlier,
the minimum cost route will be located.
DRAIN. If powerline construction to a set of dispersed locations
is simultaneously considered, an "optimal path density" map can
be generated that identifies the number of individual optimal
paths passing through each location from the dispersed termini to
the trunkline. Such a map would be valuable in locating major
"feeder" lines (i.e., areas of high optimal path density)
attaching to a central trunkline.
To generate a hardcopy of this figure, print
TEXT.FIG
on a HP Laserjet III or compatible printer
using WordPerfect 5.0 or compatible word
processing program.
To generate an interactive screen version of
this figure using pMAP, enter
C:\PMAP> pmap tutor25
Figure A-6. Connectivity operations characterize the nature
of spatial linkages between locations. Inset (a) delineates
the shortest (i.e., the quickest) hiking route from the
cabin to the ranch. The route traces the steepest downhill
path along a cumulative hiking-time surface. Inset (b)
identifies the viewshed of the ranch. Topographic relief
and forest cover act as absolute barriers when establishing
visual connectivity. (see TUTOR6.CMD)
SPAN. Another connectivity operation determines the narrowness
of features. The narrowness at each point within a map feature
is defined as the length of the shortest line segment (i.e.,
cord) that can be constructed through that point to diametrically
opposing edges of the feature. The result of this processing is
a continuous map of features with lower values indicating
relatively narrow locations. For example, small values on a
narrowness map of forest stands would indicate interior locations
with easy access to edges.
RADIATE. The process of determining viewsheds involves
establishing intervisibility among locations. Straight lines in
three-dimensional space connect locations forming the viewshed of
an area to the "viewer" location or set of viewers. Topographic
relief and surface objects form absolute barriers that preclude
visual connectivity. If multiple viewers are designated, such as
a road network, locations within the viewshed can be assigned a
value indicating the number, or density, of visual connections to
the "extended eyeball." A "weighted visual density" map can be
generated by assigning relative weights indicating traffic flow
for each road type. Figure A-6(b) shows a map of the simple
viewshed of the ranch considering the terrain and forest canopy
height as visual barriers.
CHARACTERIZING NEIGHBORHOODS: INTERPOLATE, ORIENT, PROFILE,
SCAN, SLOPE
The fourth and final group of operations includes procedures that
create a new map in which the value assigned to a location is
computed as a function of independent values within a specified
distance and direction around that location (i.e., its
cartographic neighborhood). This general class of operations can
be conceptualized as "roving windows" moving throughout the
mapped area. The summary of information within these windows can
be based on the configuration of the surface (e.g., slope and
aspect) or the statistical summary of thematic values.
Establishing neighborhood membership is the initial step in
characterizing cartographic neighborhoods. The set of points
that lies within a specified distance and direction around a
target location uniquely defines the neighborhood, or roving
window, for that location. In most applications, the window has
a uniform geometric shape and orientation (e.g., a circle or
square). As noted in the previous section, however, the distance
may not necessarily be Euclidean or symmetrical. Examples of
effective neighborhoods include "downwind" locations within a
quarter mile of a smelting plant, or a neighborhood of "the ten-
minute drive" along a road network.
The summary of information within a neighborhood can be based on
the surface configuration implied by the set of values occurring
within the window. This is true of operations that measure
topographic characteristics such as slope, aspect, or profile,
from elevation values.
SLOPE AND ORIENT. One such technique involves the "least squares
fit" of a plane to adjacent elevation values. This process is
similar to fitting a linear regression line to a series of points
expressed in two-dimensional space. The inclination of the plane
denotes terrain slope; its orientation characterizes the aspect.
The window is successively shifted over the entire elevation map
to produce a continuous slope map (inclination) or aspect map
(direction).
Figure A-7(a) shows the derived ASPECT map for the area shown in
previous figures. Note most of the area is generally oriented to
the west. In computing this map, a 3x3 window was moved about
the ELEVATION map of the area, and the direction of the best-
fitted plane was assigned. The inclination of the plane would
indicate terrain steepness, or "% slope."
A "slope map" of any surface represents the first derivative of
that surface. For an elevation surface, slope depicts the rate
of change in elevation. For an accumulation cost surface, its
slope map represents the rate of change in cost (i.e., a marginal
cost map). For a traveltime overlay, its slope map indicates
relative change in speed and its "aspect map" identifies
direction of travel at each location. Also the slope map of an
existing topographic slope map (i.e., second derivative)
characterizes surface roughness (i.e., areas where slope is
changing).
PROFILE. The creation of a "profile map" uses a window defined
as the three adjoining points along a straight line oriented in a
particular direction. Each set of three values can be regarded
as defining a cross-sectional profile of a small portion of a
surface. Each line is successively evaluated for the set of
windows along that line. This procedure may be conceptualized as
slicing a loaf of bread, then removing each slice and
characterizing its profile (as viewed from the side) in small
segments along its upper edge.
The center point of each three-member neighborhood is assigned a
value indicating the profile form at that location. The assigned
value can identify a fundamental profile class (e.g., inverted
"V" shape indicative of a ridge) or the magnitude, in degrees, of
the "skyward angle" formed by the intersection of the two line
segments of the profile. The result of this operation is a
continuous map of a surface's profile as viewed from a specified
direction. Depending on the resolution of an elevation map, its
profile map could be used to identify gullies or valleys running
east-west ("V" shape as viewed from the east or west profile) or
depressions ("V" shape in multiple orthogonal profiles).
The previous discussion identified several operations which
characterize the neighborhood by the configuration of the implied
surface. Another group of neighborhood operations summarizes
thematic values contained in the neighborhood.
To generate a hardcopy of this figure, print
TEXT.FIG
on a HP Laserjet III or compatible printer
using WordPerfect 5.0 or compatible word
processing program.
To generate an interactive screen version of
this figure using pMAP, enter
C:\PMAP> pmap tutor25
Figure A-7. Neighborhood operations characterize map
locations by summarizing the attributes of their surrounding
locations. Inset (a) is a map of topographic aspect
generated by successively fitting a plane to neighborhoods
of adjoining elevation values. Inset (b) Map of cover-type
diversity generated by computing the number of different
cover types in the immediate vicinity of each map location.
(see TUTOR7.CMD)
SCAN. The simplest operations involve the calculation of summary
statistics associated with the map categories occurring within
each neighborhood. These statistics might include maximum income
level, minimum land value, or diversity of vegetation types
within a quarter-mile walking radius of each target point.
Figure A-7(b) shows the covertype diversity occurring within the
immediate vicinity of each map location. Other thematic
summaries might include the total, average, standard deviation,
or median value occurring within each neighborhood; the standard
deviation or variance of those values; or the difference between
the value occurring at a target point itself and the average of
those surrounding it.
None of the neighborhood characteristics described so far relate
to the amount of area occupied by the map categories within each
neighborhood. Similar techniques can be applied, however, to
characterize neighborhood values weighted according to spatial
extent (e.g., total land value, on a per-acre basis, within three
miles of each target point). This consideration of size of the
neighborhood components enables several additional neighborhood
statistics including:
Mode, the value associated with the greatest proportion of
neighborhood areas
Minority value, the value associated with the smallest
proportion of a neighborhood area
Uniqueness, the proportion of the neighborhood area
associated with the value occurring at the target point
itself
INTERPOLATE. Another locational attribute that can be used to
modify thematic summaries is the cartographic distance from the
target point. While distance has already been described as the
basis for defining a neighborhood's absolute limits, it can be
used also to define the relative weights of values within the
neighborhood. Noise level, for example, may be measured
according to the inverse square of the distance from surrounding
sources. The azimuthal relationship between neighborhood
location and the target point can be used also to weight the
value associated with that location.
In conjunction with distance weighing, weighing enables a variety
of spatial sampling and interpolation techniques. For example,
"weighted nearest-neighbors" interpolation of lake bottom
temperature assigns a value to an unsampled location of the
distance-weighted average temperature of a set of sampled points
within its vicinity. Spatial interpolation is routinely used to
create maps from point sampled data.
CARTOGRAPHIC MODELING
The preceding discussion developed a topology of fundamental map
processing procedures and described a set of reclassifying,
overlaying, distance, and neighborhood operations common to a
broad range of higher techniques of map analysis. By
systematically organizing these primitive operations, the basis
for a generalized cartographic modeling approach is identified.
This approach accommodates a variety of analytic procedures in a
common, flexible, and intuitive manner that is analogous to the
mathematical structure of conventional algebra.
Figure A-8 outlines an example of the way fundamental map
processing operations can be combined to perform more complex
analyses. The flowchart structure indicates the logical
sequencing of operations on the mapped data that progresses to
the desired final map. This simplified cartographic model
depicts the siting of the optimal corridor for a highway
considering only two criteria: an engineering concern to avoid
steep slopes and a social concern to avoid visual exposure.
To generate a hardcopy of this figure, print
TEXT.FIG
on a HP Laserjet III or compatible printer
using WordPerfect 5.0 or compatible word
processing program.
Figure A-8. This simplified cartographic model depicts the
siting of "best" route for a highway considering only two
criteria: an engineering concern to avoid steep slopes and a
social concern to avoid visual exposure. In a manner
similar to conventional algebra, this process uses a series
of pMAP operations (indicated by lines) to derive
intermediate maps (indicated by boxes), leading to a final
map of the best route. (see TUTOR8.CMD)
Given maps of topographic ELEVATION and of LANDUSE, the model
allocates a minimum-cost highway alignment between two
predetermined termini isolated on the maps named THISPLACE and
THATPLACE. Cost is not measured in dollars, but in terms of
locational criteria. The right side of the model develops a
"discrete cost surface" (the COST map) in which each location is
assigned a relative cost based on the steepness/exposure
combination occurring at each location. For example, flat areas
that are also not visible from houses might be assigned low
values; areas on steep slopes and visually exposed would be
assigned high values. This discrete cost surface is used as a
map of relative barriers for establishing an accumulated cost
surface (the COSTZONES map) from one terminus to all other
locations within the mapped area. The final step locates the
other terminus on the accumulated cost surface and identifies the
minimum cost route as the steepest downhill path along the
COSTZONES surface from that point to the bottom (i.e., the other
end point). An expanded model is implemented and explained in
more detail in the TUTOR8.CMD tutorial command file.
In addition to the benefits of efficient data management and
automated cartographic procedures, the modeling structure of
computer-assisted map analysis has several other advantages.
Foremost among these is the capability of "dynamic simulation"
(i.e., spatial "what if" analysis). For example, the highway
siting model could be executed for several different relative
weightings of the engineering and social criteria. What if the
terrain steepness is more important? What if the visual exposure
is twice as important? Where does the minimum cost route change,
or, just as important, where does it not change? From this
perspective, the model "replies" to user inquiries, rather than
"answering" them: the processing provides information for
decision making, rather than making tacit decisions.
Flexibility is another advantage of cartographic modeling. New
considerations can be added easily and existing ones refined.
For example, nonavoidance of open water bodies in the highway
model is a major engineering oversight. In its current form, the
model favors construction on lakes because they are flat and
frequently visually hidden. This new requirement can be
incorporated by identifying open water bodies as absolute
barriers (i.e., infinite cost) when constructing the accumulation
cost surface. The result will be routing of the minimal cost
path around these areas of extreme cost.
Cartographic modeling also provides an effective structure for
communicating both specific application considerations and
fundamental principles. The flowchart structure provides a
succinct format for communicating the processing considerations
of specific applications. Model logic, assumptions, and
relationships are readily apparent in the "box and line"
schematic of processing flow. The instructional approach in this
section developed a general mathematical framework and then
elaborated on specific primitive operations within this
structure. Numerous examples identified the operations with
potential applications. Such a deductive approach establishes a
comprehensive understanding of concepts that stimulates the
development of applications.
CONCLUSION
Land management has always required spatial information as its
cornerstone process. Physical description of a management unit
allows the conceptualization of its potential usefulness and
constrains the list of possible management practices. Once a
decision has been made and a plan implemented, additional spatial
information is needed to evaluate its effectiveness. This strong
allegiance between spatial information and effective management
policy has become increasingly apparent with the exploding
complexity of issues confronting our world's communities.
Statistical sampling designs involving geographic stratification
have attempted to reduce the complexity of spatial data. This
aggregation retains the quantitative aspects of the data
necessary for most decision-making models but lacks spatial
continuity. Traditional drafting-oriented procedures retain the
spatial continuity of the data but are nonquantitative and
difficult to incorporate into the decision-making process.
Neither approach has capabilities for sophisticated analysis of
complex spatial interrelationships.
The analytical process of both conventional statistical and
drafting approaches inherently are limiting. The traditional
statistical approach seeks to constrain the spatial variability
within the data. However, it is the spatially induced variance
of mapped data and their interactions that most often concern the
land manager. The drafting approach, while spatially precise, is
limited by the laborious procedures involved. In most instances,
a final drafted map represents an implicit decision considering
only a few policy strategies rather than the presentation of
useful information to be assimilated in the decision process.
Digital cartographic modeling using fundamental map analysis
operations addresses the needs of today's professionals. This
approach, embodied in pMAP, identifies primitive operations that
are independent of data and application. Similar to the
structure of traditional algebra, these operations are logically
sequenced on map variables to form complex analytical techniques.
These techniques are, in turn, logically sequenced to develop
models that address specific applications. The maps used in an
analysis specify the unique spatial character of a particular
area under evaluation. The model structure expresses the
interrelationships incorporated in the analysis and documents the
factors and assumptions used in the process.
Spatial statistics and cartographic modeling are expressions of
spatially consistent data. The techniques also allow for ease of
model modification and the ability to rapidly simulate a variety
of possible strategies. Within this analytic context, mapped
data truly becomes spatial information. The Professional Map
Analysis Package (pMAP) provides the analytic sophistication
needed for effective planning and management in today's complex
decision-making environment.
Note: BEYOND MAPPING: CONCEPTS, ISSUES AND ALGORITHMS IN GIS,"
by Joseph K. Berry (GIS World,Inc., Fort Collins, Colorado, 1993)
is a collection of articles from the byline of the same title in
GIS World magazine.
This book elaborates on many of the concepts presented in this
brief text and is recommended for further study. Contact GIS
World, 155 E. Boardwalk Drive, Suite 250, Fort Collins, Colorado,
USA 80525; phone (303) 223-4848, Fax (303) 223-5700.
To generate an interactive screen version of
this figure using pMAP, enter
C:\PMAP> pmap tutor25
CMD: display slope
CMD: renumber slope for s-classes
assign 0 to 1 thru
assign 0 to 3
assign
CMD: display s-classes
CMD: display covertype
CMD: intersect covertype with s-classes
completely for cover-slope
CMD: display cover-slope
COMPUTE 1990.MAP MINUS 1970.MAP FOR CHANGE.MAP
COMPUTE CHANGE.MAP TIMES 100 /
DIVIDEDBY 1970.MAP FOR PERCENT.CHANGE
To generate an interactive screen version of
this figure using pMAP, enter
C:\PMAP> pmap tutor25
CMD: display covertype
CMD: display slope
CMD: composite covertype with slope
average for avg-slope
Inset (a)
CMD: display locations
CMD: renumber locations for ranch
assign 0 to 2 thru 5
CMD: spread ranch to 35 for simple-prox
Inset (b)
CMD: display cover-slope (see Fig. A-3)
CMD: renumber cover-slope for friction
assign
assign
assign
assign
CMD: spread ranch to 75 thru friction
for weighted-prox
Inset (a)
CMD: display locations
CMD: renumber locations for cabin
assign 0 to 1
assign 0 to 3 thru 5
CMD: display weighted-prox (see Fig 5)
CMD: stream cabin over weighted-prox
for bestpath
CMD: display bestpath
Inset (b)
CMD: slice elevation for contours
CMD: display contours
CMD: display covertype
CMD: renumber covertype for trees
assign 0 to 1 thru 2
assign
CMD: display trees
CMD: display ranch
CMD: radiate ranch to 35 over elevation
thru trees for viewshed
CMD: display viewshed
Inset (a)
CMD: display contours (see Fig. 6)
CMD: orient elevation for aspectmap
CMD: display aspectmap
Inset (b)
CMD: display covertype
CMD: scan covertype diversity
for variety
LANDUSE LANDUSE LANDUSE ELEVATION ELEVATION
| | | | |
| | VIEWERS --| |
| | | |
| | VIEWSHED SLOPE
| | |............|
| | |
| THATPLACE COST
| |...........................|
| |
THISPLACE COSTZONES
|......................|
|
HIGHWAY