Beyond Mapping III

Introduction: GIS Software's Changing Roles

 

Cover_small

Map Analysis book with companion CD-ROM for hands-on exercises

and further reading

 

GIS Software's Changing Roles  discusses the evolution of GIS software and identifies important trends 

Determining Exactly Where Is What  discusses the levels of precision and accuracy

Finding Common Ground in Paper and Digital Worlds  describes the similarities and differences in information and organization between traditional paper and digital maps

Resolving Map Detail  discusses the factors that determine the “informational scale” digital maps

Referencing the Future  describes current and alternative approaches for referencing geographic and abstract space

Is it Soup Yet?  describes the evolution in definitions and terminology

What’s in a Name  suggests and defines the new term Geotechnology

SpatialSTEM Has Deep Mathematical Roots  provides a conceptual framework for a map-ematical treatment of mapped data  

Map-ematically Messing with Mapped Data  discusses the nature of grid-based mapped data and Spatial Analysis operations

Paint by Numbers Outside the Traditional Statistics Box  discusses the nature of Spatial Statistics operations  

 

<Click here> right-click to download a printer-friendly version of this topic (.pdf).

 

(Back to the Table of Contents)
______________________________


GIS Software's Changing Roles    
(GeoWorld, September 1998, pg. 28-30)  

(return to top of Topic)

 

Although GIS is just three decades old, the approach of its software has evolved as much as its capabilities and practical expressions.  In the 70’s software development primarily occurred on campuses and its products relegated to library shelves of theses.  These formative years provided the basic organization (both data and processing structures) we find in the modern GIS.  Raging debate centered on “vector vs. raster” formats and efficient algorithms for processing— techy-stuff with minimal resonance outside of the small (but growing) group of innovators.

For a myriad of reasons, this early effort focused on GIS technology itself rather than its applications.  First, and foremost, is the necessity of building a viable tool before it can be taken on the road to practical solutions.  As with most revolutionary technologies, the “chicken and the egg” parable doesn’t apply—the tool must come before the application.

This point was struck home during a recent visit to Disneyland.  The newest ride subjects you to a seemingly endless harangue about the future of travel while you wait in line for over an hour.  The curious part is that the departed Walt Disney himself is outlining the future through video clips from the 1950s.  The dream of futuristic travel (application) hasn’t changed much and the 1990s practical reality (tool), as embodied in the herky-jerky ride, is a long way from fulfilling the vision.

What impedes the realization of a technological dream is rarely a lack of vision, but the nuts and bolts needed in its construction.  In the case of GIS, the hardware and software environments of the 1970s constrained its use outside of academia.  Working with 256K memory and less than a megabyte of disk storage made a GIS engine perform at the level of an old skateboard.  However, the environments were sufficient to develop “working prototypes” and test their theoretical foundations. The innovators of this era were able to explore the conceptual terrain of representing “maps as numbers,” but their software products were woefully impractical.

With the 1980s came the renaissance of modern computers and with it the hardware and software environments needed by GIS.  The research-oriented software gave way to operational systems.  Admittedly, the price tags were high and high-end, specialized equipment often required, but the suite of basic features of a modern GIS became available.  Software development switched from specialized programs to extensive “toolboxes” and subsequently spawned a new breed of software specialists.

Working within a GIS macro language, such as ARCINFO’s Arc Macro Language (AML), customized applications could be addressed.  Emphasis moved from programming the “tool” within generis computer languages (e.g., FORTRAN and Pascal), to programming the “application” within a comprehensive GIS language.  Expertise broadened from geography and computers to an understanding of the context, factors and relationships of spatial problems.  Programming skills were extended to spatial reasoning skills—the ability to postulate problems, perceive patterns and interpret spatial relationships.

From an application developer’s perspective the floodgates had opened.  From an end user’s perspective, however, a key element still was missing—the gigabytes of data demanded by practical applications.  Once again GIS applications were frustrated.  This time it wasn’t the programming environment as much as it was the lagging investment in the conversion from paper maps to their digital form.

But another less obvious impediment hindered progress.  As the comic strip character Pogo might say, “…we have found the enemy and it’s us.”  By their very nature, the large GIS shops established to collect, nurture, and process spatial data intimidated their potential customers.  The required professional sacrifice at the GIS altar “down the hall and to the right” kept the herds of dormant users away.  GIS was more often seen within an organization as an adversary competing for corporate support (a.k.a., a money pit) than as a new and powerful capability one could use to improve workflow and address complex issues in entirely new ways.

The 1990s saw both the data logjam burst and the GIS mystique erode.  As Windows-based mapping packages appeared on individuals’ desks, awareness of the importance of spatial data and its potential applications flourished.  Direct electronic access enabled users to visualize their data without a GIS expert as a co-pilot.  For many the thrill of “visualizing mapped data” rivaled that of their first weekend with the car after the learner’s permit.

So where are we now?  Has the role of GIS developers been extinguished, or merely evolved once again?  Like a Power Rangers transformer, software development has taken two forms that blend the 1970s and 80s roles.  These states are the direct result of changes in software programming approaches in general, and “object-oriented” programming in particular.

MapInfo’s MapX and ESRI’s MapObjects are tangible GIS examples of this new era.  These packages are functional libraries that contain individual map processing operations.  In many ways they are similar to their GIS toolbox predecessors, except they conform to general programming standards of interoperability, thereby enabling them to be linked easily to the wealth of non-GIS programs.

Like using a Lego set, application developers can apply the “building blocks” to construct specific solutions, such as a real estate application that integrates a multiple listing geo-query with a pinch of spatial analysis, a dab of spreadsheet simulation, a splash of chart plotting and a sprinkle of report generation.  In this instance, GIS functionality simply becomes one of the ingredients of a solution, not the entire recipe.

In its early stages, GIS required “bootstrap” programming of each operation and was the domain of the computer specialist.  The arrival of the GIS toolbox and macro languages allowed an application specialist to develop software that tracked the spatial context of a problem.  Today we have computer specialists generating functional libraries and application specialists assembling the bits of software from a variety of sources to tailor comprehensive solutions.

The distinction between computer and application specialist isn’t so much their roles, as it is characteristics of the combined product.  From a user’s perspective the entire character of a GIS dramatically changes.  The look-and-feel evolves from a generic “map-centric view “to an “application-centric” one with a few tailored buttons that walk users through analysis steps that are germane to an application.  Instead of presenting users with a generalized set of map processing operations as a maze of buttons, toggles and pull-down menus, only the relevant ones are integrated into the software solution.  Seamless links to non-spatial programming “objects,” such as pre-processing and post-processing functions, are automatically made.

As the future of GIS unfolds, it will be viewed less as a distinct activity and more as a key element in a thought process.  No longer will users “break shrink-wrap” on stand-alone GIS systems.  They simply will use GIS capabilities within an application and likely unaware of the underlying functional libraries.  GIS technology will finally come into its own by becoming simply part of the fabric of software solutions.

 


Determining Exactly Where Is What

(GeoWorld, February 2008, pg. 14-15)  

        (return to top of Topic)

 

The Wikipedia defines Accuracy as “the degree of veracity” (exactness) while Precision as “the degree of reproducibility” (repeatable).   It uses an archery target as an analogy to explain the difference between the two terms where measurements are compared to arrows shot at the target (left side of figure 1).  Accuracy describes the closeness of arrows to the bull’s-eye at the target center (actual/correct).  Arrows that strike closer to the bullseye are considered more accurate.

 

Precision, on the other hand, relates to the size of the cluster of several arrows.  When the arrows are grouped tightly together, the cluster is considered precise since they all strike close to the same spot, if not necessarily near the bull’s-eye.  The measurements can be precise, though not necessarily accurate.

 

However, it is not possible to reliably achieve accuracy in individual measurements without precision.  If the arrows are not grouped close to one another, they cannot all be close to the bull’s-eye.  While their average position might be an accurate estimation of the bull’s-eye, the individual arrows are inaccurate.

 

TopicEpilog_3a

 

Figure 1. Accuracy refers to “exactness” and Precision refers to “repeatability” of data. 

 

So what does this academic diatribe have to do with GIS, as all maps are accurate and precise, right?  …chiseled in stone with a burning bush in the background, right?  While that might be mapping’s legacy belief, the digital map provides room for different perspectives depending on map type and application. 

 

In GPS technology the target analogy is straight forward (right side of figure 1) and the scattering of GPS measurements over time forms patterns akin to target practice.  The handheld GPS unit shows a dispersion of points within three meters of the bull’s-eye.  A precision GPS unit using a base station for differential correction shows a much tighter, sub-meter cluster at the bull’s-eye (actual geographic location).

 

Whereas GPS readings tell us “where is where” (purely positional), accuracy and precision take on a somewhat different meanings in a GIS involving two informational dimensions—“where is what.”  Precision is concerned with “Where” (position) and accuracy is concerned with “What” (classification). 

 

Figure 2 illustrates the two-fold consideration of Precise Placement of coordinate delineation and Accurate Assessment of attribute descriptor for three photo interpreters.  The upper-right portion superimposes three parcel delineations with Interpreter B outlining considerably more area than Interpreters A and C—considerable variation in precision.  The lower portion of the figure indicates differences in classification with Interpreter B assigning Ponderosa pine as the vegetation type—considerable variation in accuracy to the true Cottonwood vegetation type correctly classified by Interpreters A and C. 

   

Many GIS map layers are precise/accurate, such as surveyed ownership parcels, pipelines and benchmarks.  However, many more layers are less precise/accurate, such as interpreted vegetation parcels, fault lines and bird sightings.  These differences in map sets, as well as mindsets, often divide the GIS community— those involved with precise/accurate maps and those involved with somewhat fuzzier mapped data.

 

TopicEpilog_3b

 

Figure 2. In mapped data, precision refers to placement whereas accuracy refers to classification.

 

In addition, our paper map legacy of visualizing maps frequently degrades precision/accuracy in detailed mapped data.  For example, a detailed map of slope values containing decimal point differences in terrain inclination can be easily calculated from an elevation surface.  But the detailed continuous spatial data is often aggregated into just a few discrete categories so humans can easily conceptualize and “see” the information—such as polygonal areas of gentle, moderate and steep terrain.  Another example is the reduction of the high precision/accuracy inherent in a continuous “proximity to roads” map to that of a discrete “road buffer” map that simply identifies all locations within a specified reach.

 

Further thought suggests an additional consideration of GIS “exactness”—Model Accuracy reflecting how robust and complete a model is.  For example, figure 3 summarizes the logic and results for a electric power line routing model (see Author’s Notes).  The simplified model seeks to identify the optimal route that avoids areas of high housing density, far from roads, within/near sensitive areas and high visual exposure to houses.  The top portion of the figure shows the criteria maps that are calibrated on a scale of 1(most preferred) to 9 (least preferred) in terms of suitability for routing a power line.

 

As you might suspect, different groups have differing perspectives on the interpretation and relative importance of the routing criteria.  For example, homeowners might be most concerned about Housing Density and Visual Exposure; environmentalists most concerned about Road Proximity and Sensitive Areas; and engineers most concerned about Housing Density and Road Proximity.  Executing the model for these differences in perspective (relative importance of the criteria) resulted in three different preferred routes.

 

The lower-left portion of figure 3 shows the spread of the three individual solutions.  One isn’t more precise/accurate than another, just an expression of a particular perspective of the solution. The lower-right side of the figure suggests yet another way to represent the solution using the simple average of the three preference surfaces to identify an overall route and its optimal corridor—sort of analogous to averaging a series of GPS readings to approximate the bull’s-eye.  It might be argued that the overall solution is more precise/accurate as it incorporates more perspectives (average of multiple arrows in a cluster). 

 

TopicEpilog_3c

 

Figure 3. Maps derived by GIS modeling also involve accuracy of the interpretation, logic, understanding and judgment ingrained in the spatial reasoning.

 

The take home from this discussion is that precision and accuracy is not the same thing and that the terms can take on different meanings for different types of maps and application settings.  There are at least three different levels of precision/accuracy—1) “Where is Where” considering just precise placement, 2) “Where is What” considering placement and classification, and 3) “Where is What, if you assume…” considering placement, classification and interpretation/logic/understanding/judgment ingrained in spatial reasoning.  Before GIS can go beyond mapping we need to fully recognize that there are appropriate degrees of precision and accuracy—paraphrasing Voltaire, perfect can be the enemy of good, or at least good enough to be useful.

_____________________________

Author’s Notes:  Related discussion of routing model considerations and procedures is in Topic 8, Spatial Model Example in the book Map Analysis (Berry, 2007; GeoTec Media, www.geoplace.com/books/MapAnalysis) and Topic 19, Routing and Optimal Paths in the online Beyond Mapping III compilation (www.innovativegis.com/basis/MapAnalysis).

 

 

Finding Common Ground in Paper and Digital Worlds

(GeoWorld, February 2007, pg. 28-30)  

        (return to top of Topic)

 

In the real world, landscapes are composed rocks, dirt, water, green stuff and furry/feathered friends.  In a “paper world” these things are represented by words, tables and graphics.  The traditional paper map is a graphical representation with inked lines, shadings and symbols used to locate landscape features using three basic building blocks— Points, Lines and Areas.  For example, a typical water map might identify a well as a dot, a stream as a squiggle and a lake as a blue blob (figure 1).  Each feature is considered a well-defined “discrete spatial object” with unique spatial character, positioning and dimension. 

 

Topic28_1a

 

Figure 1.  Traditional and Extended Map Features.

 

In geometry a point is considered dimensionless, however, the corresponding concept in cartography is a dot of ink having a physical dimension of a few inches to several miles depending on the scale of a paper map.  Similarly, a line in mathematical theory has only length but is manually mapped as a thin serpentining polygon of the pen’s width.  An area feature has both length and width in two-dimensional space.  The interplay of mapping precision and accuracy in a digital world involves a discussion of scale and resolution reserved for the next section.  For now, let’s consider the revolutionary changes in map form and content brought on by the digital map as outlined in the rest of figure 1. 

 

For thousands of years, manual cartography has been limited to characterizing all geographic phenomena as discrete 2-dimensional spatial objects.  However many map variables, such as elevation, change continuously and representation as contour lines suggests a nested series of flat layers like a wedding cake instead of the actual continuously undulating terrain.  The introduction of a grid-based data structure provides for a new basic building block—a map Surface of continuously changing values throughout geographic space. 

 

Another extension to the building blocks is Volumes that track length, width and depth in characterizing discrete or continuous variables in 3-dimensional space.  For example, the L,W,D coordinates identify a specific location in a lake and a fourth value (attribute) can identify its temperature, turbidity, salinity or other condition.

 

A hyper-Volume (or hyper-point, -line, -area or -surface) introduces time as an additional abstract coordinate.  For example, the weekly water volume of a reservoir might be tracked by L,W,D,T coordinates identifying a location in 3-dimensional space, as well as time combined with a fifth value indicating whether water is present or not.  This conceptual extension is a bit tricky and provides discussion fodder about mixed referencing units (e.g., meters and minutes) for a later section.  However, the result is a discrete volumetric map feature that shrinks and expands throughout a year—a dynamic spatial entity that at first appears to violate orthodox mapping commandments.    

 

Another mind-bend brought on by the digital map is the concept of fuzzy-features.  This idea tracks the certainty of a feature or condition at each map location.  For example, the boundary line of a soil polygon is a subjective interpretation, while soil parcel’s actual edge could be a considerable distance away—“the boundary is likely here (high probability) but could be over there (low probability).”  Another fuzzy example is a classified satellite image where statistical probabilities are used to establish which cover type is most likely. 

 

Taken to the hilt, one can conceptualize a data structure that carries L,W,D,T and A,P (attribute and probability) descriptors that identify a location in space and time, as well as characterize its most likely condition, next most likely, and so on—sort of a sandwich of probable conditions.  Such a representation challenges the infallible paradigm of mapping but opens a whole new world of error propagation modeling. 

 

Whereas volumes, hyper-volumes and fuzzy-features define the current realm of GIS researchers, an understanding of contemporary approaches for characterizing points, lines, and areas is necessary for all GIS users.  Figure 2 outlines the two fundamental approaches—vector and raster (see Author’s Notes). 

 

A Point defined by X,Y coordinates in vector, and a Cell defined by Col,Row indices in raster, form the basic data structure units—the “smallest addressable unit of space” in a map.  Lines are formed by mathematically connecting points (vector) or identifying all of the conjoined cells containing a line (raster).  Areas are defined by a set of points that define a closed line encompassing a feature (vector) or by all of the contiguous cells containing a feature (raster).

 

Topic28_1b

 

Figure 2.  Basic Vector and Raster Data Structure Considerations.

 

While spatial precision is a major operational difference between vector and raster systems, how they characterize geographic space is important in understanding limitations and capabilities.  Vector precisely identifies critical points along a line, but the intervening connections are implied.  Raster, on the other hand, identifies all of the cells containing a line without any implied gaps.  Similarly, vector precisely stores an area’s boundary but implies its interior (must calculate); raster stores the interior but implies the boundary (must calculate).

 

The differences in “what is defined” and “what is implied” determine just about everything in GIS technology, except maybe the color pallet for display—data structure, storage requirements, algorithms, coding and ultimately appropriate use.  Vector systems precisely and efficiently store traditional discrete map objects, such as underground cables and property boundaries (mapping and inventory).  Raster systems, on the other hand, predefine continuous geographic space for rapid and enhanced processing of map layers (analysis and modeling).

 

So how do you think vector and raster systems store surfaces, volumes, hyper-volumes and fuzzy-features? …very poorly, or not at all for vector systems.  However raster systems pre-define all of a project area (no gaps) by carrying a thematic value for each cell in a 2-dimensional storage matrix to form a continuous map surface.  For volumes, a third geographic referencing index is added to extend the 2D cells to 3D cubes in geographic space defined by their X,Y,Z position in the storage matrix see Author’s Notes). 

 

A similar expansion is used for hyper-volumes with four indices (X,Y,Z,T) identifying the “position,” except in this instance an abstract space is implied due to the differences in geographic and time units.  Information about fuzzy-features can be coded into a compound attribute value describing any map feature, where the first few digits identify the character/condition at a location with the trailing two digits identifying the certainty of classification.

 

The bottom line is that tomorrow’s maps aren’t simply colorful electronic versions of your grandfather’s maps.  The digital map is an entirely different beast supporting radically new mapping approaches, perspectives, opportunities and responsibilities.        

_____________________________

Author’s Notes:  Topic 6, “Alternative Data Structures,” in Spatial Reasoning for Effective GIS (Berry 1995, Wiley) contrasts vector and raster data structures and describes related alternative structures including TIN, Quadtree, Rasterized Lines and Vectorized Cells. 

 

 

Resolving Map Detail

(GeoWorld, March 2007, pg. 28-30)  

        (return to top of Topic)

 

One of the most fundamental concepts in the paper map world is Geographic Scale—the relationship between a distance on a map and its corresponding distance on the earth.  In equation form, scaleratio= map distance / ground distance but is often expressed as a representative fraction (RF), such as scaleRF= 1:63,360 meaning 1 inch on the map represents 63,360 inches (or 1 mile) on the earth’s surface.

 

However in the digital map world, this traditional concept of scale does not exist.  While at first this might seem like cartographic heresy, note that the “map distance” component of the relationship is assumed to be fixed as ink marks on paper.  In a GIS, however, the map features are stored as organized sets of numbers representing their spatial position (coordinates for “where”) and thematic attribute (map values for “what”).  One can zoom in and out on the data thereby creating a continuous gradient of geographic scales in the resulting display or hardcopy plot. 

 

Hence geographic scale is a function of the display, not an inherent property of the digital mapped data set.  What is important is the implied concept of informational scale, or Resolution—the ability to discern detail.  Traditionally it is implicit that as geographic scale decreases, resolution also diminishes since drafted feature boundaries must be smoothed, simplified or not shown at all due to the width of the inked lines. 

 

Topic28_2a

 

Figure 1. Spatial Resolution describes the level of positional detail used to track a geographic pattern or distribution.

 

However in a GIS, the concept of resolution is explicit.  In fact there are five types of resolution that need to be considered—Spatial, Map, Thematic, Temporal and Model.  Spatial Resolution is the most basic and identifies the “smallest addressable unit” of geographic space (figure 1).  For point features, the X,Y coordinates (vector) and cell size (raster) determine the smallest addressable unit. 

 

For line features in vector, however, the smallest addressable unit is the line segment with larger segments capturing less detail as the implied straight line misses the subtle wiggles and waggles of a pattern.  Similarly, large grid cells capture less linear detail than smaller cells.

 

For polygon features in vector, an entire polygon represents the smallest addressable unit as the boundary needs to be completed before the implied interior condition can be identified.  In raster, the smallest addressable unit is defined by the cell size as the condition is carried for each of the cells comprising the interior and edge of a polygon feature.

 

The concept of spatial resolution easily extends to the level of spatial aggregation or Map Resolution that identifies the “smallest physical grouping” of a map theme (figure 2).  For example, a high resolution forest map might identify individual trees (very small polygons delineating canopy extent), whereas more generally, numerous trees are used to identify a forest parcel of several acres that ignores the scattered tree occurrences.  The size of the minimum polygon is determined by the interpretation process with smaller groupings capturing more detail of the pattern and distribution.

 

Topic28_2b

Figure 2. Map Resolution describes the level of physical aggregation used to depict a geographic pattern or distribution.

 

Thematic Resolution identifies the “smallest classification grouping” of a map theme.  For example, a simple forest/non-forest map might provide a sufficient description of vegetation for some uses and this coarse classification has appeared for years as green on USGS topographic sheets.  However, resource managers require a higher thematic resolution of vegetation cover and expand the classification scheme to include species, age, stocking level and other characteristics.  The result is a finer classification categories of a generalized forest area into smaller more detailed parcels (figure 3).

 

A forth consideration involves Temporal Resolution that identifies the frequency, or time-step of map update.  Some data types, such as geological and landform maps, change very slowly and do not need frequent revision.  A city planner, on the other hand, needs land use maps that are updated every couple of years and include future development sites.  A retail marketer needs even higher temporal resolution and will likely update sales and projection figures on a monthly, weekly or even daily basis.

 

Model Resolution is the least defined and involves factors affecting the level of detail used in creating a derived map, such as an optimal corridor for an electric transmission line or areas of suitable wildlife habitat.  Model resolution considers detail ingrained in 1) the interpretation/analysis assumptions (logic) and 2) the algorithms/procedures (processing) used in implementing a spatial model.  For example, a proposed transmission line could be routed considering just terrain steepness for a low model resolution, or extended to include other engineering factors (soils, road proximity, etc.), environmental concerns (wetlands, wildlife habitats, etc.) and social considerations (visual exposure, housing density, etc.) for much higher model resolution.

 

 

Topic28_2c

Figure 3. Thematic Resolution describes the level of classification aggregation used to depict a geographic pattern or distribution.

 

So why should we care about digital map resolution?  Because accounting for informational scale is just as important as adjusting for a common geographic scale and projection when interacting with a stack of maps.  Our paper map heritage focused on descriptive mapping (inventory of physical phenomena) whereas an increasing part of the GIS revolution focuses on prescriptive mapping (spatial relationships of physical and cognitive interactions).  This “thinking with maps” requires a thorough understanding of the spatial, map, thematic, temporal and model resolutions of the maps involved or you will surely be burned.   

 

 

Referencing the Future

(GeoWorld, April 2007, pg. 28-30)  

        (return to top of Topic)

 

Geo-referencing is the cornerstone of GIS.  In the mid-1600s the French mathematician, René Descartes established the Cartesian coordinate system that is still in use today.  The system determines the location of each point in a plane as defined by two numbers—a  x-coordinate and a y-coordinate. 

 

A third z-coordinate is used to extend the system to 3-dimensional geographic space (see Author’s Notes).  In mapping, these coordinates reference a refined ellipsoid (geodetic datum) that can be conceptualized as a curved surface approximating the mean ocean surface of the earth.

 

The location and shape of map features can be established by X and Y distances measured along flattened portions of the reference surface (figure 1).  The familiar Universal Transverse Mercator (UTM) coordinates represent E-W and N-S movements in meters along the plane.  The rub is that UTM zones are need to break the curved earth surface into a series of small flat, projected subsections that are difficult to edge-match.

 

Topic28_3a

 

Figure 1. Geographic referencing uses three coordinates to locate map features in real world space.

 

A variant of the traditional referencing system uses spherical coordinates that are based on solid angles measured from the center of the earth.  This natural form for describing positions on a sphere is defined by three coordinates—an azimuthal angle (θ) in the X,Y plane from the x-axis, the polar angle (φ) from the z-axis,  and the radial distance (r) from the earth’s center (origin).  The advantage of a spherical referencing system is that it is seamless throughout the globe and doesn’t require projecting to a localized flat plane. 

 

Digital map storage is rapidly moving toward spherical referencing that uses latitude and longitude in decimal degrees for internal storage and on-the-fly conversion to any planar projection.  This radical change from our paper map heritage is fueled by ubiquitous use of GPS and a desire for global databases that easily walk across political and administrative boundaries.

 

Since the digital map is a radical departure from the paper map, other alternative referencing schemes are possible.  For example, hexagons can replace the Cartesian grid squares we have used for hundreds of years (top portion of figure 2).  The hexagon naturally nests to form a continuous network like a beehive’s honeycomb.  An important property of a hexagon grid is that it better represents curved surfaces than a square grid— a soccer ball stitched from squares wouldn’t roll the same [Note: actually a soccer ball is a composite of hexagons (white) and pentagons (black)].

 

However the most important property is that a hexagon has six sides instead of four.  The added directions provide a foothold for more precise measurement of continuous movement— one can turn right- and left-oblique as well as just right and left.  Traditional routing models using Least Cost Path would benefit greatly.

 

Expanding to 3-dimensional geographic space provides for polyhedrons to replace cubes.  For example, a dodecahedron is a nesting twelve-sided object that can be used instead of the six-sided cube.  Weather and ground water flow modeling could be greatly enhanced by the increased options for transfer from a location to its larger set of adjoining locations.  The computations for cross-products of vectors, such as warp-speed cruise missiles, could be greatly assisted as they are affected by different atmospheric conditions and evasive trajectories.

 

Topic28_3b

Figure 2.  Alternative referencing systems and abstract space characterization are possible through the digital nature of modern maps.

 

Another extension involves the use of abstract space (bottom portion of figure 2).  For example, the Z-coordinate can be replaced with an attribute value to generate a map surface, such as customer density.  In this instance, the abstract referencing is a mixture of spatial and attribute “coordinates” and doesn’t imply 3-dimensional, real word geographic occurrences.  Instead, it relates geography and conditions in an extremely useful way for conceptualizing patterns.  Normalization along the abstract coordinate axis is an important consideration for both visualization and analysis.

 

This brings us to space-time referencing.  During a recent panel discussion I was challenged for suggesting such a combination is possible within a GIS.  The idea has been debated for years by philosophers and physicists but H.G. Wells’ succinct description is one of the best—

 

'Clearly,' the Time Traveller proceeded, 'any real body must have extension in four directions: it must have Length, Breadth, Thickness, and - Duration. But through a natural infirmity of the flesh, which I will explain to you in a moment, we incline to overlook this fact. There are really four dimensions, three which we call the three planes of Space, and a fourth, Time. There is, however, a tendency to draw an unreal distinction between the former three dimensions and the latter, because it happens that our consciousness moves intermittently in one direction along the latter from the beginning to the end of our lives.' (Chapter 1, Time Machine).

 

The upshot seems to be that a fourth dimension exists (see Author’s Notes), it is just you can’t go there in person.  But a GIS can easily take you there—conceptually that is.  For example, an additional abstract “coordinate” representing time can be added to form a 3-dimensional data matrix.  The GIS picks off the customer density data for the first “page” and displays it as in the figure.  Then it uses the data on the on the second page (one time step forward) and displays it.  This is repeated to cycle through time and you see an animation where the peaks and valleys of the density surface move with time.

 

So animation enables you to move around a city (X,Y) viewing the space-time relationship of customer density (A).  In a similar manner you could evaluate a forest “green-up” model to predict re-growth at a series of time steps after harvesting to look into future landscape conditions.  Or you can watch the progression over time of ground water pollutant flow in 3D space (4D data matrix) using a semi-transparent dodecahedron solid grid just for fun and increased modeling accuracy.  In fact, it can be argued that GIS is inherently n-dimensional when you consider a map stack of multiple attributes and time is simply another abstract dimension.

 

My suspicions are that revolutions in referencing will be a big part of GIS’s frontier in the 2010s.  See you there?

_____________________________

Author’s Notes: an excellent online reference for the basic geometry concepts underlying traditional and future geo-referencing techniques is the Wolfram MathWorld pages, such as the posting describing the dodecahedron at http://mathworld.wolfram.com/Dodecahedron.html; a BBC posting at http://www.bbc.co.uk/science/space/exploration/timetravel/index.shtml contains an interesting discussion of the space/time reality.  See the online book Beyond Mapping III, Topic 27, “GIS Evolution and Future Trends,” for a discussion of 3-dimensional GIS posted at www.innovativegis.com/basis/MapAnalysis/.

 

 

 

Is it Soup Yet?    
(GeoWorld, February 2009)  

(return to top of Topic)

 

In the forty-odd years of computer-tinkering with maps our perspectives and terminologies have radically changed.  My first encounter was in the late 1960s as an undergraduate research assistant at the University of California, Berkeley.  The entry point was through photogrammetric interpretation in the pursuit of a high resolution contour map for the school’s forest.  In those days one stared at pair of stereo-matched aerial photos and marched a dot at a constant elevation around the three-dimensional surface that appeared.  The result was an inked contour line drawn by a drafting arm that was mechanically connected to the stereo plotter— raise the dot and re-walk to delineate the next higher contour line.

 

The research effort took this process to a new level by augmenting the mechanical arm with potentiometers that converted the movements of the arm into X,Y coordinates that, in turn, were recorded by direct entry into a keypunch machine.  After several months of tinkering with the Rube Goldberg device several boxes of punch cards were generated containing the digital representation of the contour lines that depicted the undulating shape of the terrain surface. 

 

The card boxes then were transferred to a guru who ran the only large-bed plotter on campus and after a couple of more months of tinkering the inked lines emerged.  While far from operational, the research crossed a technological threshold by replacing the analog mechanics of traditional drafting with the digital encoding required to drive the cold steel arm of a plotter—maps were catapulted from drawings to organized sets of numbers.

 

In the 1970’s Computer Mapping emerged through the efforts of several loosely allied fields involved in mapping—geography for the underlying theory, computer science for the software, engineering for the hardware and several applied fields for the practical applications.  As depicted in figure 1, some of the more important perspectives and definitions of the emerging technology at that time were:

 

-        Surveying is the technique and science of accurately determining the terrestrial or three-dimensional space position of points and the distances and angles between them where these points are usually, but not exclusively, associated with positions on the surface of the Earth, and are often used to establish land maps and boundaries for ownership or governmental purposes.  (Wikipedia definition)

-        Photogrammetry is the first remote sensing technology ever developed, in which geometric properties about objects are determined from photographic images.  (Wikipedia definition)

-        Remote Sensing is the small or large-scale acquisition of information of an object or phenomenon, by the use of either recording or real-time sensing device(s) that is not in physical or intimate contact with the object (such as by way of aircraft, spacecraft, satellite, etc.).  (Wikipedia definition)

-        Computer-aided Drafting and Computer-assisted Mapping (CAD/CAM) is the mapping expression of Computer-aided Design that uses computer technology to aid in the design and particularly the drafting (technical drawing and engineering drawing) of a part or product.  (Wikipedia definition)

-        Automated Cartography is the process of producing maps with the aid of computer driven devices such as plotters and graphical displays.  (Webopedia definition)

-        Image processing is any form of signal processing for which the input is an image, such as photographs or frames of video with the output of image processing being either an image or a set of characteristics or parameters related to the image.  (Wikipedia definition)

 

The common thread at the time was an inspiration to automate the map drafting process by exploiting the new digital map form.  The focus was on the graphical rendering of the precise placement of map features—an automated means of generating traditional map products.  For example, the boxes of cards containing the contour lines of research project were mothballed after the plotter generated the printer’s separate used for printing multiple copies of the map. 

 

Intro_6a

 

Figure 1. The terminology and paradigm trajectory of GIS’s evolution.

 

Spatial Database Management expanded this view in the 1980s by combining the digital map coordinates (Where) with database attributes describing the map features (What).  The focus shifted to the digital nature of mapped data and the new organizational capabilities it provided.  Some of the perspectives and terms associated with the era were:

 

-        Automated Mapping and Facilities Management (AM-FM) seeks to automate the mapping process and to manage facilities represented by items on the map.  (GITA definition)

-        Geographic Information System (GIS) is an information system for capturing, storing, analyzing, managing and presenting data which are spatially referenced (linked to location).  (Wikipedia definition)

-        Geographic Information Science (GISc or GISci) is the academic theory behind the development, use, and application of geographic information systems (GIS).  (Wikipedia definition)

-        Desktop Mapping involves using a desktop computer to perform digital mapping functions.  (eNCYCLOPEDIA definition)

-        Enterprise GIS is a platform for delivering organization-wide geospatial capabilities providing for the free flow of information.  (ESRI definition)

 

Geo-query became the rage and organizations scurried to integrate their paper maps and management records for cost savings and improved information access.  The overriding focus was on efficient recordkeeping, processing and information retrieval.  The approach linked discrete Point, Line and Polygon features to database records describing the spatial entities.

 

Map Analysis and Modeling in the 1990s changed the traditional mapping paradigm by introducing a new fundamental map feature—the continuous Surface.  Some of the more important terms and perspectives of that era were:

 

-        Cartographic Modeling is a process that identifies a set of interacting, ordered map operations that act on raw data, as well as derived and intermediate data, to simulate a spatial decision making process.  (Tomlin definition)

-        Map Algebra (and Map-ematics) is a simple and an elegant set-based algebra for manipulating geographic data where the input and output for each operator is a map and the operators can be combined into a procedure to perform complex tasks.  (Wikipedia definition)

-        Geomatics incorporates the older field of surveying along with many other aspects of spatial data management which integrates acquisition, modeling, analysis, and management of spatially referenced data.  (Wikipedia definition)

 

While much of the map-ematical theory and procedures were in place much earlier, this era saw a broadening of interest in map analysis and modeling capabilities.  The comfortable concepts and successful extensions of traditional mapping through Spatial Database Management systems lead many organizations to venture into the more unfamiliar realms of spatial analysis and statistics.  The emerging applications directly infused spatial considerations into the decision-making process by expanding “Where is What?” recordkeeping to “Why, So What and What If?” spatial reasoning—thinking with maps to solve complex problems.

 

Multimedia Mapping in the 2000s turned the technology totally on its head by bringing it to the masses.  Spurred by the proliferation of personal computers and Internet access, spatial information and some “killer apps” have redefined what maps are, how one interacts with them, as well as their applications.  Important terms and perspectives of the times include:

 

-        Global Positioning System (GPS) is the only fully functional Global Navigation Satellite System (GNSS) that enable GPS receivers to determine their current location, the time, and their velocity.  (Wikipedia definition)

-        Mobile GIS is the use of geographic data in the field on mobile devices that integrates three essential components— Global Positioning System (GPS), rugged handheld computers, and GIS software.  (Trimble definition)

-        Web Mapping is the process of designing, implementing, generating and delivering maps on the World Wide Web.  (Wikipedia definition)

-        Virtual Reality (VR) is a technology which allows a user to interact with a computer-simulated environment, be it a real or imagined one.  (Wikipedia definition)

-        Geospatial Technology refers to technology used for visualization, measurement, and analysis of features or phenomena that occur on the earth that includes three different technologies that are all related to mapping features on the surface of the earth— GPS (global positioning systems), GIS (geographical information systems), and RS (remote sensing).  (Wikipedia definition)

 

The technology has assumed a commonplace status in society as people access real-time driving directions, routinely check home values in their neighborhood and virtually “fly” to anyplace place on the earth to view the surroundings or checkout a restaurant’s menu.  While spatial information isn’t the driver of this global electronic revolution, the technology both benefits from and contributes to its richness.  What was just a gleam in a handful of researchers’ eyes thirty years ago has evolved into a pervasive layer in the fabric of society, not to mention a major industry.

 

But what are the perspectives and terms defining the technology’s future?  That’s ample fodder for the next section.

_____________________________

 

Author’s Notes: a brief White Paper describing GIS’s evolution is posted online at www.innovativegis.com/basis/Papers/Other/Geotechnology/Geotechnology_history_future.htm .  An interesting and useful Glossary of GIS terms by Blinn, Queen and Maki of the University of Minnesota is posted at www.extension.umn.edu/distribution/naturalresources/components/DD6097ag.html.

 

 

What’s in a Name    
(GeoWorld, March 2009)  

(return to top of Topic)

 

The previous section traced the evolution of modern mapping by identifying some of the more important labels and terminology that have been used to describe and explain what is involved.  In just four decades, the field has progressed from an era of Computer Mapping to Spatial Database Management, then to Map Analysis and Modeling and finally to Multimedia Mapping. 

 

The perspective of the technology has expanded from simply automated cartography to an information science that links spatial and attribute data, then to an analytical framework for investigating spatial patterns/relationships and finally to the full integration of the spatial triad of Remote Sensing (RS), Geographic Information Systems (GIS) and the Global Positioning System (GPS) with the Internet and other applied technologies.

 

While the evolution is in large part driven by technological advances, it also reflects an expanding acceptance and understanding by user communities and the general public.  In fact, the field has matured to a point where the US Department of Labor has identified Geotechnology as “one of the three most important emerging and evolving fields, along with nanotechnology and biotechnology” (see Author’s Notes).  This is rare company indeed.

 

The Wikipedia defines Biotechnology as “any technological application that uses biological systems, living organisms, or derivations thereof, to make or modify products or processes for specific use,” and Nanotechnology as “a field whose theme is the control of matter on and atomic and molecular scale.”  By any measure these are sweeping definitions that encompass a multitude of sub-disciplines, conceptual approaches and paradigms.  Figure 1 suggests a similar sweeping conceptualization for Geotechnology. 

 

Intro_7a

 

Figure 1. Conceptual framework of Geotechnology.

 

The top portion of the figure relates Geotechnology to “spatial information” in a broad stroke similar to biotechnology’s use of “biological systems” and nanotechnology’s use of “control of matter.”  The middle portion identifies the three related technologies for mapping features on the surface of the earth— GPS, GIS and RS.  The bottom portion identifies the two dominant application arenas that emphasize descriptive Mapping (Where is What) and prescriptive Modeling (Why and So What).

 

What is most important to keep in mind is that geotechnology, like bio- and nanotechnology, is greater than the sum of its parts—GPS, GIS and RS.  While these individual mapping technologies provide the enabling capabilities, it is the application environments themselves that propel geotechnology to mega status.  For example, precision agriculture couples the spatial triad with robotics to completely change crop production.  Similarly, coupling “computer agents” with the spatial triad produces an interactive system that has radically altered marketing and advertising through spatially-specific queries and displayed results.  Or coupling immersive photography with the spatial triad to generate an entirely type of “street view” map that drastically changes 8,000 years of analog mapping.

 

To this point in our technology’s short four decade evolution it has been repeatedly defined from within.  The current “geospatial technology” moniker focuses on the interworking parts that resonates with GIS specialists (see figure 2).  However to the uninitiated, the term is as off-putting as it is confusing—geo (Latin for the earth), spatial (pertaining to space), technology (application of science).  Heck, it even sounds redundant and is almost as introvertedly-cute as the terms geomatics and map-ematics.

 

Intro_7b

 

Figure 2. Wikipedia Definition of Geospatial Technology.

 

On the other hand, the use of the emerging term “Geotechnology” for the first time provides an opportunity to craft a definition with a broader perspective that embraces the universality of its application environments and societal impacts along the lines of the bio- and nanotechnology definitions. 

 

As a draft attempt, let me suggest—

 

Geotechnology refers to any technological application that utilizes spatial location in visualizing, measuring, storing, retrieving, mapping and analyzing features or phenomena that occur on, below or above the earth.  It is recognized by the U.S. Department of Labor as one of the “three mega-technologies for the 21st Century,” along with Biotechnology and Nanotechnology.  There are three primary mapping technologies that enable geotechnology— GPS (Global Positioning System), GIS (Geographic Information Systems) and RS (Remote Sensing).        …etcetera, etcetera, etcetera… to quote a famous King of Siam.

 

As with any controversial endeavor, the devil is in the details (the etcetera).  One of the biggest problems with the term is that geology staked the flag several years ago with its definition of geotechnology as “the application of the methods of engineering and science to exploitation of natural resources” (yes, they use the politically incorrect term “exploitation”).  Also, there is an International Society for Environmental Geotechnology, as well as a several books with the term embedded in their titles.

 

On the bright side, the Wikipedia doesn’t have an entry for Geotechnology.  Nor is the shortened term “geo” exclusive to geology; in fact just the opposite, as geography is most frequently associated with the term (geo + graph + y literally means “to write the descriptive science dealing with the surface of the earth”).  Finally, there are other disciplines, application users and the general public that are desperate for an encompassing term and succinct definition of our field that doesn’t leave them tongue-tied, shaking their heads in dismay or otherwise dumbfounded. 

 

Such is the byzantine fodder of academics …any inspired souls out there willing to take on the challenge of evolving/expanding the definition of Geotechnology, as well as the perspective of our GPS/GIS/RS enabled mapping technology? 

_____________________________

 

Author’s Notes: see www.nature.com/nature/journal/v427/n6972/full/nj6972-376a.html for an article in Nature (427, 376-377; January 22, 2004) that identifies Geotechnology by the US Department of Labor as one of the three "mega technologies for the 21st century” (the other two are Nanotechnology and Biotechnology).

 

 

SpatialSTEM Has Deep Mathematical Roots

    
(GeoWorld, January 2012)  

(return to top of Topic)

 

Recently my interest has been captured by a new arena and expression for the contention that “maps are data”—spatialSTEM (or sSTEM for short)—as a means for redirecting education in general, and GIS education in particular.  I suspect you have heard of STEM (Science, Technology, Engineering and Mathematics) and the educational crisis that puts U.S. students well behind many other nations in these quantitatively-based disciplines. 

 

While Googling around the globe makes for great homework in cultural geography, it doesn’t advance quantitative proficiency, nor does it stimulate the spatial reasoning skills needed for problem solving.  Lots of folks from Freed Zakaria to Bill Gates to President Obama are looking for ways that we can recapture our leadership in the quantitative fields.  That’s the premise of spatialSTEM– that “maps are numbers first, pictures later” and we do mathematical things to mapped data for insight and better understanding of spatial patterns and relationships within decision-making contexts. 

 

This contention suggests that there is a map-ematics that can be employed to solve problems that go beyond mapping, geo-query, visualization and GPS navigation.  This column’s discussion about the quantitative nature of maps is the first part of a three-part series that sets the stage to fully develop this thesis— that grid-based Spatial Analysis Operations are extensions of traditional mathematics (Part 2 investigating map math, algebra, calculus, plane and solid geometry, etc.) and that grid-based Spatial Statistics Operations are extensions of traditional statistics (Part 3 looking at map descriptive statistics, normalization, comparison, classification, surface modeling, predictive statistics, etc.). 

 

Figure 1 outlines the important components of map analysis and modeling within a mathematical structure that has been in play since the 1980s (see author’s note).  Of the three disciplines forming Geotechnology (Remote Sensing, Geographic Information Systems and Global Positioning System), GIS is at the heart of converting mapped data into spatial information.  There are two primary approaches used in generating this information—Mapping/Geo-query and Map Analysis/Modeling.

 

Intro_1new_a

 

Figure 1. Conceptual overview of the SpatialSTEM framework.

 

The major differences between the two approaches lies in the structuring of mapped data and their intended use.  Mapping and geo-query utilizes a data structure akin to manual mapping in which discrete spatial objects (points, lines and polygons) form a collection of independent, irregular features to characterize geographic space.  For example, a Water map might contain categories of Spring (points), Stream (lines) and Lake (polygons) with the features scattered throughout a landscape. 

 

Map analysis and modeling procedures, on the other hand, operate on continuous map variables (termed map surfaces) composed of thousands upon thousands of map values stored in geo-registered matrices.  Within this context, a Water map no longer contains separate and distinct features but is a collection of adjoining grid cells with a map value indicating the characteristic at each location (e.g., Spring=1,  Stream= 2 and Lake= 3). 

 

Figure 2 illustrates two broad types of digital maps, formally termed Vector for storing discrete spatial objects and Raster for storing continuous map surfaces.  In vector format, spatial data is stored as two linked data tables.  A “spatial table” contains all of the X,Y coordinates defining a set of spatial objects that are grouped by object identification numbers.  For example, the location of the Forest polygon identified on the left side of the figure is stored as ID#32 followed by an ordered series of X,Y coordinate pairs delineating its border (connect-the-dots). 

 

Intro_1new_b

 

Figure 2. Basic data structure for Vector and Raster map types.

 

In a similar manner, the ID#s and X,Y coordinates defining the other cover type polygons are sequentially listed in the table.  The ID#s link the spatial table (Where) to a corresponding “attribute table” (What) containing information about each spatial object as a separate record.  For example, polygon ID#31 is characterized as a mature 60 year old Ponderosa Pine (PP) Forest stand.

 

The right side of figure 2 depicts raster storage of the same cover type information.  Each grid space is assigned a number corresponding to the dominant cover type present— the “cell position” in the matrix determines the location (Where) and the “cell value” determines the characteristic/condition (What).  It is important to note that the raster representation stores information about the interior of polygons and “pre-conditions geographic space” for analysis by applying a consistent grid configuration to each grid map.  Since each map’s underlying data structure is the same, the computer simply “hits disk” to get information and does not have to calculate whether irregular sets of points, lines or polygons on different maps intersect.  

 

Figure 3 depicts the fundamental concepts supporting raster data.  As a comparison between vector and raster data structures consider how the two approaches represent an Elevation surface.  In vector, contour lines are used to identify lines of constant elevation and contour interval polygons are used to identify specified ranges of elevation.  While contour lines are exacting, they fail to describe the intervening surface configuration. 

 

Intro_1new_c

 

Figure 3. Organizational considerations and terminology for grid-based mapped data.

 

Contour intervals describe the interiors but overly generalize the actual “ups and downs” of the terrain into broad ranges that form an unrealistic stair-step configuration (center-left portion of figure 3).  As depicted in the figure, rock climbers would need to summit each of the contour interval “200-foot cliffs” rising from presumed flat mesas.  Similarly, surface water flow presumably would cascade like waterfalls from each contour interval “lake” like a Spanish multi-tiered fountain. 

 

The upshot is that within a mathematical context, vector maps are ineffective representations of real-world gradients and actual movements and flows over these surfaces— while contour line/interval maps have formed colorful and comfortable visualizations for generations, the data structure is too limited for modern map analysis and modeling. 

 

The remainder of figure 3 depicts the basic Raster/Grid organizational structure.  Each grid map is termed a Map Layer and a set of geo-registered layers constitutes a Map Stack.  All of the map layers in a project conform to a common Analysis Frame with a fixed number of rows and columns at a specified cell size that can be positioned anywhere in geographic space.  As in the case of the Elevation surface in the lower-left portion of figure 3, a continuous gradient is formed with subtle elevation differences that allow hikers to step from cell to cell while considering relative steepness.  Or surface water to sequentially stream from a location to its steepest downhill neighbor thereby identifying a flow-path.  

 

The underlying concept of this data structure is that grid cells for all of the map layers precisely coincide, and by simply accessing map values at a row, column location a computer can “drill” down through the map layers noting their characteristics.  Similarly, noting the map values of surrounding cells identifies the characteristics within a location’s vicinity on a given map layer, or set of map layers. 

 

Keep in mind that while terrain elevation is the most common example of a map surface, it is by no means the only one.  In natural systems, temperature, barometric pressure, air pollution concentration, soil chemistry and water turbidity are but a few examples of continuous mapped data gradients.  In human systems, population density, income level, life style concentration, crime occurrence, disease incidence rate all form continuous map surfaces.  In economic systems, home values, sales activity and travel-time to/from stores form map variables that that track spatial patterns.   

 

In fact the preponderance of spatial data is easily and best represented as grid-based continuous map surfaces that are preconditioned for use in map analysis and modeling.  The computer does the heavy-lifting of the computation …what is needed is a new generation of creative minds that goes beyond mapping to “thinking with maps” within this less familiar, quantitative framework— a SpatialSTEM environment.

 _____________________________

Author’s Notes:  My involvement in map analysis/modeling began in the 1970s with doctoral work in computer-assisted analysis of remotely sensed data a couple of years before we had civilian satellites.  The extension from digital imagery classification using multivariate statistics and pattern recognition algorithms in the 70s to a comprehensive grid-based mathematical structure for all forms of mapped data in the 80s was a natural evolution.  See www.innovativegis.com, select “Online Papers” for a link to a 1986 paper on “A Mathematical Structure for Analyzing Maps” that serves as an early introduction to a comprehensive framework for map analysis/modeling.

 


Map-ematically Messing with Mapped Data

    
(GeoWorld, February 2012)  

(return to top of Topic)

 

The last section introduced the idea of spatialSTEM for teaching map analysis and modeling fundamentals within a mathematical context that resonates with science, technology, engineering and math/stat communities.  The discussion established a general framework and grid-based data structure needed for quantitative analysis of spatial patterns and relationships.  This section focuses on the nature of mapped data, an example of a grid-math/algebra application and discussion of extended spatial analysis operations.       

 

Figure 1 identifies the two primary perspectives of spatial data—1) Numeric that indicates how numbers are distributed in “number space” (What condition) and 2) Geographic that indicated how numbers are distributed in “geographic space” (Where condition).  The numeric perspective can be grouped into categories of Qualitative numbers that deal with general descriptions based on perceived “quality” and Quantitative numbers that deal with measured characteristics or “quantity.” 

 

Intro_2new_a

 

Figure 1. Spatial Data Perspectives—Where is What.

 

Further classification identifies the familiar numeric data types of Nominal, Ordinal, Interval, Ratio and Binary.  It is generally well known that very few math/stat operations can be performed using qualitative data (Nominal, Ordinal), whereas a wealth of operations can be used with quantitative data (Interval, Ratio).  Only a specialized few operations utilize Binary data. 

 

Less familiar are the two geographic data types.  Choropleth numbers form sharp and unpredictable boundaries in space, such as the values assigned to the discrete map features on a road or cover type map.  Isopleth numbers, on the other hand, form continuous and often predictable gradients in geographic space, such as the values on an elevation or temperature surface. 

 

Putting the Where and What perspectives of spatial data together, Discrete Maps identify mapped data with spatially independent numbers (qualitative or quantitative) forming sharp abrupt boundaries (choropleth), such as a cover type map.  Discrete maps generally provide limited footholds for quantitative map analysis.  On the other hand, Continuous Maps contain a range of values (quantitative only) that form spatial gradients (isopleth), such as an elevation surface.  They provide a wealth of analytics from basic grid math to map algebra, calculus and geometry.  

 

Intro_2new_b

 

Figure 2. Basic Grid Math and Algebra example.

 

Site-specific farming provides a good example of basic grid math and map algebra using continuous maps (figure 2).  Yield Mapping involves simultaneously recording yield flow and GPS position as a combine harvests a crop resulting in a grid map of thousands of geo-registered numbers that track crop yield throughout a field.   Grid Math can be used to calculate the mathematical difference in yield at each location between two years by simply subtracting the respective yield maps.  Map Algebra extends the processing by spatially evaluating the full algebraic percent change equation. 

 

The paradigm shift in this map-ematical approach is that map variables, comprised of thousands of geo-registered numbers, are substituted for traditional variables defined by only a single value.  Map algebra’s continuous map solution shows localized variation, rather than a single “typical” value being calculated (i.e., 37.3% increase in the example) and assumed everywhere the same in non-spatial analysis.  

 

Figure 3 expands basic Grid Math and Map Algebra into other mathematical arenas.  Advanced Grid Math includes most of the buttons on a scientific calculator to include trigonometric functions.  For example, taking the cosine of a slope map expressed in degrees and multiplying it times the planimetric surface area of a grid cell calculates the surface area of the “inclined plane” at each grid location.  The difference between planimetric area represented by traditional maps and surface area based on terrain steepness can be dramatic and greatly affect the characterization of “catchment areas” in environmental and engineering models of surface runoff. 

 

Intro_2new_c

 

Figure 3.  Spatial Analysis operations.

 

A Map Calculus expresses such functions as the derivative and integral within a spatial context.  The derivative traditionally identifies a measure of how a mathematical function changes as its input changes by assessing the slope along a curve in 2-dimensional abstract space. 

 

The spatial equivalent calculates a “slope map” depicting the rate of change in a continuous map variable in 3-dimensional geographic space.  For an elevation surface, slope depicts the rate of change in elevation.  For an accumulation cost surface, its slope map represents the rate of change in cost (i.e., a marginal cost map).  For a travel-time accumulation surface, its slope map indicates the relative change in speed and its aspect map identifies the direction of optimal movement at each location.  Also, the slope map of an existing topographic slope map (i.e., second derivative) will characterize surface roughness (i.e., areas where slope itself is changing).

 

Traditional calculus identifies an integral as the net signed area of a region along a curve expressing a mathematical function.  In a somewhat analogous procedure, areas under portions of continuous map surfaces can be characterized.  For example, the total area (planimetric or surface) within a series of watersheds can be calculated; or the total tax revenue for various neighborhoods; or the total carbon emissions along major highways; or the net difference in crop yield for various soil types in a field.  In the spatial integral, the net sum of the numeric values for portions of a continuous map surface (3D) is calculated in a manner comparable to calculating the area under a curve (2D).

 

Traditional geometry defines Distance as “the shortest straight line between two points” and routinely measures it with a ruler or calculates it using the Pythagorean Theorem.  Map Geometry extends the concept of distance to Simple Proximity by relaxing the requirement of just “two points” for distances to all locations surrounding a point or other map feature, such as a road. 

 

A further extension involves Effective Proximity that relaxes “straight line” to consider absolute and relative barriers to movement.  For example effective proximity might consider just uphill locations along a road or a complex set of variable hiking conditions that impede movement from a road as a function of slope, cover type and water barriers.  

 

The result is that the “shortest but not necessarily straight distance” is assigned to each grid location.  Because a straight line connection cannot be assumed, optimal path routines in Plane Geometry Connectivity (2D space) are needed to identify the actual shortest routes.  Solid Geometry Connectivity (3D space) involves line-of-sight connections that identify visual exposure among locations.  A final class of operations involves Unique Map Analytics, such as size, shape, intactness and contiguity of map features.     

 

Grid-based map analysis takes us well beyond traditional mapping …as well as taking us well beyond traditional procedures and paradigms of mathematics.  The next installment of spatialSTEM discussion considers the extension of traditional statistics to spatial statistics. 

_____________________________

Author’s Notes:  a table of URL links to further readings on the grid-based map analysis/modeling concepts, terminology, considerations and procedures described in this three-part series on spatialSTEM is posted at www.innovativegis.com/basis/MapAnalysis/MA_Intro/sSTEM/sSTEMreading.htm .

 

 

Paint by Numbers Outside the Traditional Statistics Box

    
(GeoWorld, March 2012)  

(return to top of Topic)

 

The two previous sections described a general framework and approach for teaching spatial analysis within a mathematical context that resonates with science, technology, engineering and math/stat communities (spatialSTEM).  The following discussion focuses on extending traditional statistics to a spatial statistics for understanding geographic-based patterns and relationships. 

 

Whereas Spatial analysis focuses on “contextual relationships” in geographic space (such as effective proximity and visual exposure), Spatial statistics focuses on “numerical relationships” within and among mapped data (figure 1).  From a spatial statistics perspective there are three primary analytical arenas— Summaries, Comparisons and Correlations.

 

Statistical summaries provide generalizations of the grid values comprising a single map layer (within), or set of map layers (among).  Most common is a tabular summary included in a discrete map’s legend that identifies the area and proportion of occurrence for each map category, such as extremely steep terrain comprising 286 acres (19 percent) of a project area.  Or for a continuous map surface of slope values, the generalization might identify the data range as from 0 to 65% and note that the average slope is 24.4 with a standard deviation of 16.7. 

 

Summaries among two or more discrete maps generate cross-tabular tables that “count” the joint occurrence of all categorical combinations of the map layers.  For example, the coincidence of steepness and cover maps might identify that there are 242 acres of forest cover on extremely steep slopes (16 percent), a particularly hazardous wildfire joint condition.

 

Map comparison and correlation techniques only apply to continuous mapped data.  Comparisons within a single map surface involve normalization techniques.  For example, a standard normal variable map can be generated to identify “how unusual” (above or below) each map location is compared to the typical value in a project area.

 

Direct comparisons among continuous map surfaces include appropriate statistical tests (e.g., F-test), difference maps and surface configuration differences based on variations in surface slope and orientation at each grid location.

 

 

Intro_3new_a

 

Figure 1. Spatial Statistics uses numerical analysis to uncover spatial relationships and patterns.

 

Map correlations provide a foothold for advanced inferential spatial statistics.  Spatial autocorrelation within a single map surface identifies the similarity among nearby values for each grid location.  It is most often associated with surface modeling techniques that employ the assumption that “nearby things are more alike than distant things”—high spatial autocorrelation—for distance-based weight averaging of discrete point samples to derive a continuous map surface.

 

Spatial correlation, on the other hand, identifies the degree of geographic dependence among two or more map layers and is the foundation of spatial data mining.  For example, a map surface of a bank’s existing concentration of home equity loans within a city can be regressed against a map surface of home values.  If a high level of spatial dependence exists, the derived regression equation can be used on home value data for another city.  The resulting map surface of estimated loan concentration proves useful in locating branch offices. 

 

In practice, many geo-business applications utilize numerous independent map layers including demographics, life style information and sales records from credit card swipes in developing spatially consistent multivariate models with very high R-squared values.  Like most things from ecology to economics to environmental considerations, spatial expression of variable dependence echoes niche theory with grid-based spatial statistics serving as a powerful tool for understanding geographic patterns and relationships.

 

Figure 2 describes an example of basic surface modeling and the linkage between numeric space and geographic space representations using environmentally-oriented mapped data.  Soil samples are collected and analyzed assuring that geographic coordinates accompany the field samples.  The resulting discrete point map of the field soil chemistry data are spatially interpolated into a continuous map surface characterizing the data set’s geographic distribution. 

 

The bottom portion of figure 2 depicts the linkage between Data Space and Geographic Space representations of the mapped data.  In data space, a standard normal curve is fitted to the data as means to characterize its overall “typical value” (Average= 22.9) and “typical dispersion” (StDev= 18.7) without regard for the data’s spatial distribution. 

        

Intro_3new_b

 

Figure 2. An example of Surface Modeling that derives a continuous map surface from set of discrete point data.

 

In geographic space, the Average forms a flat plane implying that this value is assumed to be everywhere within +/- 1 Standard Deviation about two-thirds of the time and offering no information about where values are likely more or less than the typical value.  The fitted continuous map surface, on the other hand, details the spatial variation inherent in the field collected samples. 

 

Non-spatial statistics identifies the “central tendency” of the data, whereas surface modeling maps the “spatial variation” of the data.  Like a Rochart ink blot, the histogram and the map surface provide two different perspectives.  Clicking a histogram pillar identifies all of the grid cells within that range; clicking on a grid location identifies which histogram range contains it. 

 

This direct linkage between the numerical and spatial characteristics of mapped data provides the foundation for the spatial statistics operations outlined in figure 3.  The first four classes of operations are fairly self-explanatory with the exception “Roving Window” summaries.  This technique first identifies the grid values surrounding a location, then mathematically/statistically summarizes the values, assigns the summary to that location and then moves to the next location and repeats the process. 

 

Another specialized use of roving windows is for Surface Modeling.  As described in figure 2, inverse-distance weighted spatial interpolation (IDW) is the weight-averaged of samples based on their relative distances from the focal location.  For qualitative data, the total number of occurrences within a window reach can be summed for a density surface. In figure 3 for example, a map identifying customer locations can be summed to identify the total number of customers within a roving window to generate a continuous map surface customer density.  In turn, the average and standard deviation can be used to identify “pockets” of unusually high customer density. 

     

Intro_3new_c

 

Figure 3. Classes of Spatial Statistics operations.

 

Standard multivariate techniques using “data distance,” such as Maximum Likelihood and Clustering, can be used to classify sets of map variables.  Map Similarity, for example, can be used to compare each map location’s pattern of values with a comparison location’s pattern to create a continuous map surface of the relative degree of similarity at each map location.

 

Statistical techniques, such as Regression, can be used to develop mathematical functions between dependent and independent map variables.  The difference between spatial and non-spatial approaches is that the map variables are spatially consistent and yield a prediction map that shows where high and low estimates are to be expected.  

 

The bottom line in spatial statistics (as well as spatial analysis) is that the spatial character within and among map layers is taken into account.  The grid-based representation of mapped data provides the  consistent framework that needed for these analyses.  Each database record contains geographic coordinates (X,Y= Where) and value fields identifying the characteristics/conditions at that location (Vi= What). 

 

From this map-ematical view, traditional math/stat procedures can be extended into geographic space.  The paradigm shift from our paper map legacy to “maps as data first, pictures later” propels us beyond mapping to map analysis and modeling.  In addition, it defines a comprehensive and common spatialSTEM educational environment that stimulates students with diverse backgrounds and interests to “think analytically with maps” in solving complex problems. 

_____________________________

Author’s Notes:  a table of URL links to further readings on the grid-based map analysis/modeling concepts, terminology, considerations and procedures described in this three-part series on spatialSTEM is posted at www.innovativegis.com/basis/MapAnalysis/MA_Intro/sSTEM/sSTEMreading.htm .

 

 

        (return to top of Topic)

 (Back to the Table of Contents)