Mapping and Display

 

THIS ECTION HAS NOT BEEN COMPLETED...

 

The main criteria used to define a GIS is its capability to transform and integrate spatial data.

Manipulation and Transformation of Spatial Data

The maintenance and transformation of spatial data concerns the ability to input, manipulate, and transform data once it has been created. While many different interpretations exist with respect to what constitutes these capabilities some specific functions can be identified. These are reviewed below.

Coordinate Thinning

Coordinate thinning involves the weeding or reduction of coordinate pairs, e.g. X and Y, from arcs. This function is often required when data has been captured with too many vertices for the linear features. This can result in redundant data and large data volumes. The weeding of coordinates is required to reduce this redundancy.

The thinning of coordinates is also required in the map generalization process of linear simplification. Linear simplification is one component of generalization that is required when data from one scale, e.g. 1:20,000, is to be used and integrated with data from another scale, e.g. 1:100,000. Coordinate thinning is often done on features such as contours, hydrography, and forest stand boundaries.

Geometric Transformations

This function is concerned with the registering of a data layer to a common coordinate scheme. This usually involves registering selected data layers to a standard data layer already registered. The term rubber sheeting is often used to describe this function. Rubber sheeting involves stretching one data layer to meet another based on predefined control points of known locations. Two other functions may be categorized under geometric transformations. These involve warping a data layer stored in one data model, either raster or vector, to another data layer stored in the opposite data model. For example, often classified satellite imagery may require warping to fit an existing forest inventory layer, or a poor quality vector layer may require warping to match a more accurate raster layer.

Map Projection Transformations

This functionality concerns the transformation of data in geographic coordinates for an existing map projection to another map projection. Most GIS software requires that data layers must be in the same map projection for analysis. Accordingly, if data is acquired in a different projection than the other data layers it must be transformed. Typically 20 or more different map projections are supported in a GIS software offering.

Conflation - Sliver Removal

Conflation is formally defined as the procedure of reconciling the positions of corresponding features in different data layers. More commonly this is referred to as sliver removal. Often two layers that contain the same feature, e.g. soils and forest stands both with a specific lake, do not have exactly the same boundaries for that feature, e.g. the lake. This may be caused by a lack of coordination or data prioritization during digitizing or by a number of different manipulation and analysis techniques. When the two layers are combined, e.g. normally in polygon overlay, they will not match precisely and small sliver polygons will be created. Conflation is concerned with the process for removing these slivers and reconciling the common boundary.

There are several approaches for sliver removal. Perhaps the most common is allowing the user to define a priority for data layers in combination with a tolerance value. Considering the soils and forest stand example the user could define a layer that takes precedence, e.g. forest stands, and a size tolerance for slivers. After polygon overlay if a polygon is below the size tolerance it is classified a sliver. To reconcile the situation the arcs of the data layer that has higher priority will be retained and the arcs of the other data layer will be deleted. Another approach is to simply divide the sliver down the centre and collapse the arcs making up the boundary. The important point is that all GIS software must have the capability to resolve slivers. Remember that it is generally much less expensive to reconcile maps manually in the map preparation and digitizing stage than afterwards.

Edge Matching

Edge matching is simply the procedure to adjust the position of features that extend across typical map sheet boundaries. Theoretically data from adjacent map sheets should meet precisely at map edges. However, in practice this rarely occurs. Misalignment of features can be caused by several factors including digitizing error, paper shrinkage of source maps, and errors in the original mapping. Edge matching always requires some interactive editing. Accordingly, GIS software differs considerably in the degree of automation provided.

Interactive Graphic Editing

Interactive graphic editing functions involve the addition, deletion, moving, and changing of the geographic position of features. Editing should be possible at any time. Most graphic editing occurs during the data compilation phase of any project. Remember typically 60 to 70 % of the time required to complete any project involves data compilation. Accordingly, the level of sophistication and ease of use of this capability is vitally important and should be rated highly by those evaluating GIS software. Many of the editing that is undertaken involves the cleaning up of topological errors identified earlier. The capability to snap to existing elements, e.g. nodes and arcs, is critical.

The functionality of graphic editing does not differ greatly across GIS software offerings. However, the user interface and ease of use of the editing functions usually does. Editing within a GIS software package should be as easy as using a CAD system. A cumbersome or incomplete graphic editing capability will lead to much frustration by the users of the software.

Integration and Modelling of Spatial Data

The integration of data provides the ability to ask complex spatial questions that could not be answered otherwise. Often, these are inventory or locational questions such as how much ? or where ?. Answers to locational and quantitative questions require the combination of several different data layers to be able to provide a more complete and realistic answer. The ability to combine and integrate data is the backbone of GIS.

Often, applications do require a more sophisticated approach to answer complex spatial queries and what if ? scenarios. The technique used to solve these questions is called spatial modelling. Spatial modelling infers the use of spatial characteristics and methods in manipulating data. Methods exist to create an almost unlimited range of capabilities for data analysis by stringing together sets of primitive analysis functions. While some explicit analytical models do exist, especially in natural resource applications, most modelling formulae (models) are determined based on the needs of a particular project. The capability to undertake complex modelling of spatial data, on an ad hoc basis, has helped to further the resource specialists understanding of the natural environment, and the relationship between selected characteristics of that environment.

The use of GIS spatial modelling tools in several traditional resource activities has helped to quantify processes and define models for deriving analysis products. This is particularly true in the area of resource planning and inventory compilation. Most GIS users are able to better organize their applications because of their interaction with, and use of, GIS technology. The utilization of spatial modelling techniques requires a comprehensive understanding of the data sets involved, and the analysis requirements.

The critical function for any GIS is the integration of data.

The raster data model has become the primary spatial data source for analytical modeling with GIS. The raster data model is well suited to the quantitative analysis of numerous data layers. To facilitate these raster modeling techniques most GIS software employs a separate module specifically for cell processing.

(from Berry)
The following diagram represents a logic flowchart of a typical natural resource model using GIS raster modeling techniques. The boxes represent raster maps in the GIS, while the connection lines imply an analytical function or technique.
(from Berry)

Integrated Analytical Functions in a GIS

Most GIS's provide the capability to build complex models by combining primitive analytical functions. Systems vary as to the complexity provided for spatial modelling, and the specific functions that are available. However, most systems provide a standard set of primitive analytical functions that are accessible to the user in some logical manner. Aronoff identifies four categories of GIS analysis functions. These are :

Retrieval, Reclassification, and Generalization;
Topological Overlay Techniques;
Neighbourhood Operations; and
Connectivity Functions.

The range of analysis techniques in these categories is very large. Accordingly, this section of the book focuses on providing an overview of the fundamental primitive functions that are most often utilized in spatial analyses.

Retrieval, Reclassification and Generalization

Perhaps the initial GIS analysis that any user undertakes is the retrieval and/or reclassification of data. Retrieval operations occur on both spatial and attribute data. Often data is selected by an attribute subset and viewed graphically. Retrieval involves the selective search, manipulation, and output of data without the requirement to modify the geographic location of the features involved.

Reclassification involves the selection and presentation of a selected layer of data based on the classes or values of a specific attribute, e.g. cover group. It involves looking at an attribute, or a series of attributes, for a single data layer and classifying the data layer based on the range of values of the attribute. Accordingly, features adjacent to one another that have a common value, e.g. cover group, but differ in other characteristics, e.g. tree height, species, will be treated and appear as one class. In raster based GIS software, numerical values are often used to indicate classes. Reclassification is an attribute generalization technique. Typically this function makes use of polygon patterning techniques such as crosshatching and/or color shading for graphic representation.

n a vector based GIS, boundaries between polygons of common reclassed values should be dissolved to create a cleaner map of homogeneous continuity. Raster reclassification intrinsically involves boundary dissolving. The dissolving of map boundaries based on a specific attribute value often results in a new data layer being created. This is often done for visual clarity in the creation of derived maps. Almost all GIS software provides the capability to easily dissolve boundaries based on the results of a reclassification. Some systems allow the user to create a new data layer for the reclassification while others simply dissolve the boundaries during data output.

One can see how the querying capability of the DBMS is a necessity in the reclassification process. The ability and process for displaying the results of reclassification, a map or report, will vary depending on the GIS. In some systems the querying process is independent from data display functions, while in others they are integrated and querying is done in a graphics mode. The exact process for undertaking a reclassification varies greatly from GIS to GIS. Some will store results of the query in query sets independent from the DBMS, while others store the results in a newly created attribute column in the DBMS. The approach varies drastically depending on the architecture of the GIS software.

Topological Overlay

The capability to overlay multiple data layers in a vertical fashion is the most required and common technique in geographic data processing. In fact, the use of a topological data structure can be traced back to the need for overlaying vector data layers. With the advent of the concepts of mathematical topology polygon overlay has become the most popular geoprocessing tool, and the basis of any functional GIS software package.

Topological overlay is predominantly concerned with overlaying polygon data with polygon data, e.g. soils and forest cover. However, there are requirements for overlaying point, linear, and polygon data in selected combinations, e.g. point in polygon, line in polygon, and polygon on polygon are the most common. Vector and raster based software differ considerably in their approach to topological overlay.

Raster based software is oriented towards arithmetic overlay operations, e.g. the addition, subtraction, division, multiplication of data layers. The nature of the one attribute map approach, typical of the raster data model, usually provides a more flexible and efficient overlay capability. The raster data model affords a strong numerically modelling (quantitative analysis) modelling capability. Most sophisticated spatial modelling is undertaken within the raster domain.

In vector based systems topological overlay is achieved by the creation of a new topological network from two or more existing networks. This requires the rebuilding of topological tables, e.g. arc, node, polygon, and therefore can be time consuming and CPU intensive. The result of a topological overlay in the vector domain is a new topological network that will contain attributes of the original input data layers. In this way selected queries can then be undertaken of the original layer, e.g. soils and forest cover, to determine where specific situations occur, e.g. deciduous forest cover where drainage is poor.

Most GIS software makes use of a consistent logic for the overlay of multiple data layers. The rules of Boolean logic are used to operate on the attributes and spatial properties of geographic features. Boolean algebra uses the operators AND, OR, XOR, NOT to see whether a particular condition is true or false. Boolean logic represents all possible combinations of spatial interaction between different features. The implementation of Boolean operators is often transparent to the user.

To date the primary analysis technique used in GIS applications, vector and raster, is the topological overlay of selected data layers.

Generally, GIS software implements the overlay of different vector data layers by combining the spatial and attribute data files of the layers to create a new data layer. Again, different GIS software utilize varying approaches for the display and reporting of overlay results. Some systems require that topological overlay occur on only two data layers at a time, creating a third layer. This pairwise approach requires the nesting of multiple overlays to generate a final overlay product, if more than two data layers are involved. This can result in numerous intermediate or temporary data layers. Some systems create a complete topological structure at the data verification stage, and the user merely submits a query string for the combined topological data. Other systems allow the user to overlay multiple data layers at one time. Each approach has its drawbacks depending on the application and the nature of the implementation. Determining the most appropriate method is based on the type of application, practical considerations such as data volumes and CPU power, and other considerations such personnel and time requirements. Overall, the flexibility provided to the operator and the level of performance varies widely among GIS software offerings.

The following diagram illustrates a typical overlay requirements where several different layers are spatially joined to created a new topological layer. By combining multiple layers in a topological fashion complex queries can be answered concerning attributes of any layer.
 
Neighbourhood Operations

Neighbourhood operations evaluate the characteristics of an area surrounding a specific location. Virtually all GIS software provides some form of neighbourhood analysis. A range of different neighbourhood functions exist. The analysis of topographic features, e.g. the relief of the landscape, is normally categorized as being a neighbourhood operation. This involves a variety of point interpolation techniques including slope and aspect calculations, contour generation, and Thiessen polygons. Interpolation is defined as the method of predicting unknown values using known values of neighbouring locations. Interpolation is utilized most often with point based elevation data.

Elevation data usually takes the form of irregular or regular spaced points. Irregularly space points are stored in a Triangular Irregular Network (TIN). A TIN is a vector topological network of triangular facets generated by joining the irregular points with straight line segments. The TIN structure is utilized when irregular data is available, predominantly in vector based systems. TIN is a vector data model for 3-D data.

An alternative in storing elevation data is the regular point Digital Elevation Model (DEM). The term DEM usually refers to a grid of regularly space elevation points. These points are usually stored with a raster data model. Most GIS software offerings provide three dimensional analysis capabilities in a separate module of the software. Again, they vary considerably with respect to their functionality and the level of integration between the 3-D module and the other more typical analysis functions.

Without doubt the most common neighbourhood function is buffering. Buffering involves the ability to create distance buffers around selected features, be it points, lines, or areas. Buffers are created as polygons because they represent an area around a feature. Buffering is also referred to as corridor or zone generation with the raster data model. Usually, the results of a buffering process are utilized in a topological overlay with another data layer. For example, to determine the volume of timber within a selected distance of a cutline, the user would first buffer the cutline data layer. They would then overlay the resultant buffer data layer, a buffer polygon, with the forest cover data layer in a clipping fashion. This would result in a new data layer that only contained the forest cover within the buffer zone. Since all attributes are maintained in the topological overlay and buffering processes, a map or report could then be generated.

Buffering is typically used with point or linear features. The generation of buffers for selected features is frequently based on a distance from that feature, or on a specific attribute of that feature. For example, some features may have a greater zone of influence due to specific characteristics, e.g. a primary highway would generally have a greater influence than a gravel road. Accordingly, different size buffers can be generated for features within a data layer based on selected attribute values or feature types.

Connectivity Analysis

The distinguishing feature of connectivity operations is that they use functions that accumulate values over an area being traversed. Most often these include the analysis of surfaces and networks. Connectivity functions include proximity analysis, network analysis, spread functions, and three dimensional surface analysis such as visibility and perspective viewing. This category of analysis techniques is the least developed in commercial GIS software. Consequently, there is often a great difference in the functionality offered between GIS software offerings. Raster based systems often provide the more sophisticated surface analysis capabilities while vector based systems tend to focus on linear network analysis capabilities. However, this appears to be changing as GIS software becomes more sophisticated, and multi-disciplinary applications require a more comprehensive and integrated functionality. Some GIS offerings provide both vector and raster analysis capabilities. Only in these systems will one fund a full range of connectivity analysis techniques.

Proximity analysis techniques are primarily concerned with the proximity of one feature to another. Usually proximity is defined as the ability to identify any feature that is near any other feature based on location, attribute value, or a specific distance. A simple example is identifying all the forest stands that are within 100 metres of a gravel road, but not necessarily adjacent to it. It is important to note that neighbourhood buffering is often categorized as being a proximity analysis capability. Depending on the particular GIS software package, the data model employed, and the operational architecture of the software it may be difficult to distinguish proximity analysis and buffering.
 
Proximity analysis is often used in urban based applications to consider areas of influence, and ownership queries. Proximity to roads and engineering infrastructure is typically important for development planning, tax calculations, and utility billing.

The identification of adjacency is another proximity analysis function. Adjacency is defined as the ability to identify any feature having certain attributes that exhibit adjacency with other selected features having certain attributes. A typical example is the ability to identify all forest stands of a specific type, e.g. specie, adjacent to a gravel road.

Network analysis is a widely used analysis technique. Network analysis techniques can be characterized by their use of feature networks. Feature networks are almost entirely comprised of linear features. Hydrographic hierarchies and transportation networks are prime examples. Two example network analysis techniques are the allocation of values to selected features within the network to determine capacity zones, and the determination of shortest path between connected points or nodes within the network based on attribute values. This is often referred to as route optimization. Attribute values may be as simple as minimal distance, or more complex involving a model using several attributes defining rate of flow, impedance, and cost.
Three dimensional analysis involves a range of different capabilities. The most utilized is the generation of perspective surfaces. Perspective surfaces are usually represented by a wire frame diagram reflecting profiles of the landscape, e.g. every 100 metres. These profiles viewed together, with the removal of hidden lines, provide a three dimensional view. As previously identified, most GIS software packages offer 3-D capabilities in a separate module. Several other functions are normally available.

These include the following functions :

user definable vertical exaggeration, viewing azimuth, and elevation angle;
identification of viewsheds, e.g. seen versus unseen areas;
the draping of features, e.g. point, lines, and shaded polygons onto the perspective surface;
generation of shaded relief models simulating illumination;
generation of cross section profiles;
presentation of symbology on the 3-D surface; and
line of sight perspective views from user defined viewpoints.

While the primitive analytical functions have been presented the reader should be aware that a wide range of more specific and detailed capabilities do exist.

The overriding theme of all GIS software is that the analytical functions are totally integrated with the DBMS component. This integration provides the necessary foundation for all analysis techniques.