Unite 3
Unite 3
Unite 3
The Geographic phenomena Are those drastic and observable changes that take place
in nature. They can occur abruptly and are capable of transforming the environment, so
that, after these phenomena occur, a new reality arises.
1
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
Then, in the nature is part of a geographic fact. Then a phenomenon that generates an
abrupt variation in the environment is generated, and the new reality that is generated
later becomes a new geographic fact.
A GIS operates under the assumption that the spatial phenomena involved occur in a
two- or three-dimensional Euclidean space. Euclidean space can be informally defined
as a model of space in which locations are represented by coordinates (x, y) in 2D and
(x, y, z) in 3D space and distance and direction can defined with geometric formulas. In
2D, this is known as the Euclidean plane. To represent relevant aspects of real-world
phenomena inside a GIS, we first need to define what it is we are referring to. We might
define a geographic phenomenon as a display of an entity or process of interest that:
Geographic phenomena can be classified according to the elements from which they
occur. This classification includes three types: physical, biological and human.
Within physical geographic changes can be storms, tornados, heavy rains and
earthquakes, among others. The physical geographic changes that are generated are
able to transform the landscape and generate a new reality.
Rivers overflow: A river can overflow as a result of different natural causes. Some of the
possible causes may be the following:
2
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
When a river overflows, it can generate lasting changes in the landscape. It is possible
for the river to permanently expand its channel, to flood surrounding plant species, and if
there are human communities nearby, it is possible that it may collapse with houses,
buildings, roads and other constructions.
Biological geographic phenomena are those that are generated by living beings,
excluding humans. Within this classification are the geographic variations produced by
plants, animals, insects and microorganisms. Some examples of biological geographic
changes can be:
Plague deforestation
The appearance of pests can destroy large areas of vegetation. Pests can appear, for
example, as a result of an imbalance in fauna; If there are no natural predators, a species
can become a pest.
Pests especially affect plants located on land with few nutrients, which can lead to
deforestation of entire regions and altering the environment completely. Pests can also
greatly reduce the amount of animal organisms in an area.
These phenomena are the most obvious and, in many cases, invasive that can be found
on the planet. Human geographic phenomena are caused exclusively by the action of
man in his environment.
Like physical and biological phenomena, human geographic phenomena alter the
environment in a lasting way. As a result of these transformations, positive and, in many
cases, negative consequences can be generated. Some examples of human geographic
phenomena:
Construction of roads
3
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
As a result of the need to expand its communication channels, human beings have
transformed their environment. This has involved the construction of roads and roads that
intervene openly in the environment. The construction of this type of structures has been
beneficial for the development of the human race, allowing to extend the interaction
between the men and to generate a more effective communication.
However, in some cases the intervention has been detrimental to nature, because some
Ecosystems Have been affected.
Construction of dams
Hydraulic dams are structures, made of walls and containment elements, whose main
function is to store or divert water from a river to serve different purposes.
Among the functions of a water dam are the regulation of water supply in a particular
region, the storage of water for irrigation or energy production.
When constructing a dam, the human being interferes to a great extent in the nature.
These constructions generate positive consequences for human life, such as renewable
energy production, flood control in certain areas and facilitating access to water for human
consumption.
Data modeling is the process of creating a simplified diagram of a software system and
the data elements it contains, using text and symbols to represent the data and how it
flows. Data models provide a blueprint for designing a new database or reengineering a
legacy application. Overall, data modeling helps an organization use its data effectively
to meet business needs for information.
4
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
A data model can be thought of as a flowchart that illustrates data entities, their attributes
and the relationships between entities. It enables data management and analytics teams
to document data requirements for applications and identify errors in development plans
before any code is written.
Alternatively, data models can be created through reverse-engineering efforts that extract
them from existing systems. That's done to document the structure of relational databases
that were built on an ad hoc basis without upfront data modeling and to define schemas
for sets of raw data stored in data lakes or NoSQL databases to support specific analytics
applications.
Data modeling can also help establish common data definitions and internal data
standards, often in connection with data governance programs. In addition, it plays a big
role in data architecture processes that document data assets, map how data moves
through IT systems and create a conceptual data management framework. Data models
are a key data architecture component, along with data flow diagrams, architectural
blueprints, a unified data vocabulary and other artifacts.
Data modelers use three types of models to separately represent business concepts and
workflows, relevant data entities and their attributes and relationships, also the technical
structures for managing the data. The models typically are created in a progression as
organizations plan new applications and databases. These are the different types of data
models and what they include:
This is a high-level visualization of the business or analytics processes that a system will
support. It maps out the kinds of data that are needed, how different business entities
interrelate and associated business rules. Business executives are the main audience for
conceptual data models, to help them see how a system will work and ensure that it meets
business needs. Conceptual models aren't tied to specific database or application
technologies.
5
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
Once a conceptual data model is finished, it can be used to create a less-abstract logical
one. Logical data models show how data entities are related and describe the data from
a technical perspective. For example, they define data structures and provide details on
attributes, keys, data types and other characteristics. The technical side of an
organization uses logical models to help understand required application and database
designs. But like conceptual models, they aren't connected to a particular technology
platform.
A logical model serves as the basis for the creation of a physical data model. Physical
models are specific to the database management system (DBMS) or application software
that will be implemented. They define the structures that the database or a file system will
use to store and manage the data. That includes tables, columns, fields, indexes,
constraints, triggers and other DBMS elements. Database designers use physical data
models to create designs and generate schema for databases.
Data modeling emerged in the 1960s as databases became more widely used on
mainframes and then minicomputers. It enabled organizations to bring consistency,
repeatability and disciplined development to data processing and management. That's
still the case, but the techniques used to create data models have evolved along with the
development of new types of databases and computer systems.
These are the data modeling approaches used most widely over the years, including
several that have largely been supplanted by newer techniques.
Hierarchical data models organize data in a treelike arrangement of parent and child
records. A child record can have only one parent, making this a one-to-many modeling
method. The hierarchical approach originated in mainframe databases -- IBM's
Information Management System (IMS) is the best-known example. Although hierarchical
6
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
data models were mostly superseded by relational ones beginning in the 1980s, IMS is
still available and used by many organizations. A similar hierarchical method is also used
today in XML, formally known as Extensible Markup Language.
This was also a popular data modeling option in mainframe databases that isn't used as
much now. Network data models expanded on hierarchical ones by allowing child records
to be connected to multiple parent records. The Conference on Data Systems
Languages, a now-defunct technical standards group commonly called CODASYL,
adopted a network data model specification in 1969. Because of that, the network
technique is often referred to as the CODASYL model.
The relational data model was created as a more flexible alternative to hierarchical and
network ones. First described in a 1970 technical paper by IBM researcher Edgar F.
Codd, the relational model maps the relationships between data elements stored in
different tables that contain sets of rows and columns. Relational modeling set the stage
for the development of relational databases, and their widespread use made it the
dominant data modeling technique by the mid-1990s.
A variation of the relational model that can also be used with other types of databases,
entity-relationship (ER) models visually map entities, their attributes and the relationships
between different entities. For example, the attributes of an employee data entity could
include last name, first name, years employed and other relevant data. ER models
provide an efficient approach for data capture and update processes, making them
particularly suitable for transaction processing applications.
The two fundamental ways of representing geography are discrete objects and fields.
Discrete objects the discrete object view represents the world as objects with well-defined
boundaries in empty space. Just as the desktop may be littered with books, pencils, or
7
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
computers, the geographic world is littered with cars, houses, forest stands, and other
discrete objects. One characteristic of the discrete object view is that objects can be
counted. For example, there may be 49 houses in a particular subdivision.
Geographic objects are identified by their dimensionality. Objects that occupy area,
including lakes, and forest stands, are termed two-
dimensional and generally referred to as areas or
polygons. Other objects that are linear, including roads,
railways, and rivers, are termed one-dimensional and
generally referred to as lines. Objects that are single
locations, including individual animals and buildings, are
termed zero-dimensional and generally referred to as
points.
8
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
row corresponding to a different discrete object and each column to an attribute of the
object.
Fields
While we might think of land as composed of discrete mountain peaks, valleys, ridges,
slopes, etc., and think of listing them in tables and counting them, in practice there are
unresolvable problems of definition for all of these objects. Instead, it is much more useful
to think of terrain as a continuous surface in which elevation can be defined rigorously at
every point. Such continuous surfaces form the basis of the other common view of
geographic phenomena, known as the field view. The field view represents the real
world as a finite number of variables, each one defined at every possible position.
Discrete objects are distinguished by their dimensions and naturally fall into categories of
points, lines, and areas. Fields, on the other hand, can be notable by what varies and
how smoothly. A field of elevation, for example, varies much more smoothly in a
landscape that has been worn down by glaciation or compressed by blowing sand than
one recently created by cooling lava. Cliffs are places in fields where elevation changes
suddenly rather than smoothly.
9
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
There are two essential methods used to store information in a Geographic Information
System GIS for both reflections: Raster and Vector Data Model. GIS data represents
real-world objects such as roads, land use, elevation with digital data. The Real-world
objects or features of earth can be divided into two abstractions: discrete objects (a
Tree) and continuous fields (like elevation).
To work in a GIS environment, real world observations (objects or events that can be
recorded in 2D or 3D space) need to be reduced to spatial entities. These spatial entities
can be represented in a GIS as a vector data model or a raster data model.
A raster data type is made up of pixel or cells and each pixel has an associated value.
Digital Photography is the best example of raster data type model, anyone who is familiar
with digital photography can recognize the pixels as the smallest individual unit of an
image, where each pixel value in the image corresponds to a particular color and the
combination of these pixels will create an image. As of now, the best example of raster
data that is commonly used is Aerial photos, with only one purpose, to display a detailed
image on a map or for the purposes of digitization. Raster data type consists of rows and
columns of cells and these each cells stores a single value. Raster data can be
images (raster images) with each pixel containing a color value. In Raster, data is
represented as a grid of (usually square) cells. Each cell of a raster, stores a single
value and it can be extended by using raster bands to represent RGB (red, green, blue)
colors.
10
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
Points When geographic features are too small to represent as polygons, points features
are used; in other words, simple location. For example, the locations of Trees, depth,
Point of Interest. These vector points are simply XY Co-ordinates.
Lines or polylines vector lines or polylines connect with each vertex with paths, they
usually represent features that are linear such as rivers, roads, railroads, and
pipelines.
11
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
Vector features are group into layers and features in a specific layer have the same
geometry type. For example, if a layer contains a Polygons feature, then GIS application
will only allow a user to create a new polygon feature in the same layer. Each of the vector
features is stored in a database along with their attributes. For example, a database that
describes a Street may contain a Street’s Name, Type, speed limit. The User can perform
spatial analysis with different geometries.
While deciding whether to use a vector data model or raster data model in your work it
entirely depends on the data you have as input and what your goals are for displaying or
analyzing the data. There are many analyses
that make use of both data models i.e. vector
and raster or require the conversion of one
data model to another. While conversion is a
common procedure, it’s suggested that any
translation between data model to be kept at
a minimum to avoid accumulating error in
12
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
your spatial model. The size of the dataset should be a consideration, as raster dataset
can be quite large and difficult for some workstation to process in a timely manner. Vector
data model is recommended for analysis, unless modeling a continuous surface and
when using a raster data model, it is important to use cell sizes that are appropriate for
analysis.
The term “spatial relationships” refers to the way objects are arranged in relation to
one another in geographic space. For example, we can describe them as adjacency,
contiguity, overlap, and proximity.
Here are a couple of quotes that all good geographers should remember to better
understand the concept of spatial relationships.
“Everything is related to everything else. But near things are more related than distant
things.” -Waldo R. Tobler
“Geography has made us neighbors. History has made us friends. Economics has made
us partners, and necessity has made us allies” -John F Kennedy
What each quote implies is that geography is connected. And this is the same as spatial
relationships because they identify how features relate to one another in geographic
space.
The adjacency, contiguity, overlap, and proximity are the four ways of describing the
relationship between two or more entities.
Adjacency: Adjacency is when two polygon entities are directly connected to each other
sharing a boundary.
Contiguity: Contiguity is the relationship between two or more entities when they share
an edge such as the 48 contiguous states.
13
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
Overlap: An overlap occurs when a single entity shares the same location or partial
location of another entity.
Proximity: Finally, proximity is defined as being close enough that you can establish
contact with each other without having physical contact with each other. There is no single
rule for what a spatial relationship between two objects is. But the common types of
spatial relationships include overlap, proximity, contiguity, and adjacency.
A spatial join is a powerful tool in the GIS world. It allows data to be connected or joined
together, by sharing layers with the same attributes based on location. In GIS, the spatial
join can be achieved through different spatial relationships.
14
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
Identical Both features match identically. When two features are exactly
without having any difference or variant of any kind, the
features are identical.
Closest The join feature is closest to the target feature. When there are
several possible features to join, this spatial join takes only the
closest feature.
Today, topology in GIS is generally defined as the spatial relationships between adjacent
or neighboring features. Mathematical topology assumes that geographic features occur
on a two-dimensional plane.
15
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
SDC: Smart Data Compression: SDC is ESRI's highly compressed format, which is
directly readable by ArcGIS software, but not by ArcView 3.x. Many ESRI Data and Maps
datasets are natively in SDC format. (Environmental Systems Research Institute)
With ESRI Data and Maps 2006, a standalone "Data Distribution Application" was
included that converts SDC data files directly to shapefiles.
LYR: Layer File: A .lyr file is directly readable only by ArcGIS software and other newer
software applications. This file does not contain actual geographic data, but rather
contains specifications for the presentation of other datasets. Such specifications include
color shading, naming, label properties (font, color, placements, etc.). Such presentation
properties are usually time consuming to create, so a .lyr file allows these settings to be
16
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
saved and shared. In order to use a .lyr file, you must also have a seperate data file with
the same prefix name saved in the same filespace.
SHP: Shapefile: The ESRI Shapefile has become an industry standard geospatial data
format, and is compatible to some extent with practically all recently released GIS
software. To have a complete shapefile, you must have at least 3 files with the same
prefix name and with the following extensions: .shp = shapefile, .shx = header and .dbf =
associated database file. Additionally, you may have a .prj = Projection file, a .lyr = layer
file, and other index files. All these files must be saved in the same workspace.
ArcInfo Coverage: An ArcInfo coverage does not have an individual file extension.
Instead, it is composed of two folders within a "workspace" which each contain multiple
files. One of the two folders carries the name of the coverage, and contains a number of
various .adf files. The other folder is an "info" folder, which typically contains .dat and .nit
files for all the coverages and grids in the workspace. The best way to manage (copy,
move, delete, rename) ArcInfo coverages is with ArcCatalog or ArcInfo Workstation
(command line).
E00: Arc Export or Interchange Format: .e00 (pronounced e-o-o or e-zero-zero) files
are ArcInfo Interchange or export files, used to conveniently copy and move ArcInfo GIS
coverages (see above) and grids (see below). An .e00 file must be "imported" in order to
use the data in ArcView or other GIS software.
17
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
The selective inclusion of points within a TIN gives the triangles their irregular pattern and
reduces the amount of data storage required relative to the regularly distributed points in
a DEM.
ArcInfo Grid: An ArcInfo Grid does not have an individual file extension. Instead it is
composed of two folders within a "workspace" which each contain multiple files. One of
the two folders carries the name of the grid, and contains a number of various .adf files.
The other folder is an "info" folder, which typically contains .dat and .nit files for all the
coverages and grids in the workspace. The best way to manage (copy, move, delete,
rename) ArcInfo Grids is with ArcCatalog or ArcInfo Workstation (command line).
Most recent GIS software, including ArcGIS, are able to read MrSID compressed images
without any additional extensions. ArcView 3.x, however, requires a MrSID Extension for
image access. Plugins for other software, such as AutoCAD and Photoshop, may or may
not be required.
JPEG 2000: JPEG 2000 is a non-proprietary image compression format based on ISO
standards, and typically uses .jp2 as the file extension. It's advantages are that it offers
lossy and lossless compression, and world files (.j2w) can be used to georeference an
18
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
image in GIS software. Compression ratios are similar to MrSID and ECW formats. For
more information, see the Wikipedia entry for JPEG 2000.
Vector Data
.kml Google KML File Is a XML-based plaintext file that may contain
geometry, data, or a pointer to a web service
.kmz Compressed KML Same as above, but compressed. You can
uncomrpess this with 7zip (free/open) and you will
see plaintext KML
.gpx GPS data file XML-based GPS data file, usually coming from a
GPS device.
.gdb File Geodatabase Esri File Geodatabase can be used to store vector
(Directory) and raster data as well as more complex data
containing topologies, and other supporting
files. The File Geodatase is a directory, which
contains many files which can not be read on their
own
.mdb Personal Esri Personal Geodatabase format
Geodatabase/MS
Access Database
.sqlite Sqlite/Spatialite Extension is optional for this format, but is often
Database .sqlite. Spatialite is an an extenion to sqlite which
spatialally enables it. This format is often used
with QGIS.
Shapefile Has traditionally been the “standard” vector format
for ArcGIS. Is actually a collection of files, not a
single file, all in the same directory, with the same
file base name (the name without the
extension). When viewed in an ArcGIS filesystem
dialog, is usually displayed as a single file.
19
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
20
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
21
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
It is also of great importance in the insurance sector. Companies can make use of different
strategies for converting data to ensure that the data is compatible with their systems.
GIS Consortium has the capability to convert hard copy into a wide range of electronic
formats. GISC’s team of experienced and multi skilled specialists have a vast knowledge
of converting various types of geospatial data. It can deal with the present-day challenges
22
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
such as complexity of data, project timelines and effect on the quality and accessibility of
the data, ensuring a smooth and successful data conversion.
For a good quality GIS Services, you can get in touch with the GIS Consortium.
GIS design involves organizing geographic information into a series of data themes-layers
that can be integrated using geographic location. So it makes sense that geodatabase
design begins by identifying the data themes to be used, then specifying the contents and
representations of each thematic layer. This involves defining:
➢ How the geographic features are to be represented for each theme (for example,
as points, lines, polygons, or rasters) along with their tabular attributes.
➢ How the data will be organized into datasets, such as feature classes, attributes,
raster datasets, and so forth.
➢ What additional spatial and database elements will be needed for integrity rules,
for implementing rich GIS behavior (such as topologies, networks, and raster
catalogs), and defining spatial and attribute relationships between datasets.
That’s why organizations require an enterprise GIS geodatabase, Infographic offers the
following Geodatabase specializations:
Geodatabase design is based on a common set of fundamental GIS design steps, so it's
important to have a basic understanding of these GIS design goals and methods. GIS
design involves organizing geographic information into a series of data themes layers that
can be integrated using geographic location. So, it makes sense that geodatabase design
23
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
begins by identifying the data themes to be used, then specifying the contents and
representations of each thematic layer.
24
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
Representation
Each GIS database design begins with a decision as to what the geographic
representations will be for each dataset. Individual geographic entities can be
represented as
➢ Feature classes (sets of points, lines, and polygons)
➢ Imagery and rasters.
➢ Continuous surfaces that can be represented using features (such as
contours), rasters (digital elevation models [DEM]), or triangulated irregular
networks (TINs) using terrain datasets
➢ Attribute tables for descriptive data
Data themes
The concept of a thematic layer was one of the early notions in GIS. Practitioners thought
about how the geographic information
in maps could be partitioned into logical
information layers as more than a
random collection of individual objects
(such as a road, a bridge, a hill, a
house, a peninsula). These early GIS
users organized information in thematic
layers that described the distribution of
a phenomenon and how it should be
portrayed across a geographic extent.
These layers also provided a protocol
(capture rules) for collecting the
representations (as feature sets, raster layers, attribute tables, and so on).
25
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
In GIS, thematic layers are one of the main organizing principles for GIS database design.
Each GIS will contain multiple themes for a common geographic area. The collection of
themes acts as layers in a stack. Each theme can be managed as an information set
independent of other themes. Each has its own representations (points, lines, polygons,
surfaces, rasters, and so on). Because the various independent themes are spatially
referenced, they overlay one another and can be combined in a common map display.
Plus, GIS analysis operations, such as overlay, can fuse information between themes.
26
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
between map scales (for example, from polygon to line or point). In many cases, you may
need to generalize the feature representations for use at smaller scales. Rasters can be
resampled using image pyramids. In other situations, you may need to collect alternative
representations for different map scales.
Decompose each representation into one or more geographic datasets. Discrete
4.
features are modeled as feature classes of points, lines, and polygons. You can consider
advanced data types such as topologies, networks, and terrains to model the relationships
between elements in a layer as well as across datasets. For raster datasets, mosaics and
catalog collections are options for managing very large collections. Surfaces can be
modeled using features, such as contours, as well as using rasters and terrains.
Define the tabular database structure and behavior for descriptive attributes. Identify
5.
attribute fields and column types. Tables also might include attribute domains, relationships,
and subtypes. Define any valid values, attribute ranges, and classifications (for use as
domains). Use subtypes to control behaviors. Identify tabular relationships and associations
for relationship classes.
Define the spatial behavior, spatial relationships, and integrity rules for your
6.
datasets. For features, you can add spatial behavior and capabilities and also characterize
the spatial relationships inherent in your related features for a number of purposes using
topologies, address locators, networks, terrains, and so on. For example, use topologies to
model the spatial relationships of shared geometry and enforce integrity rules. Use address
locators to support geocoding. Use networks for tracing and pathfinding. For rasters, you
can decide if you need a raster dataset or raster catalog.
Propose a geodatabase design. Define the set of geodatabase elements you want in your
7.
design for each data theme. Study existing designs for ideas and approaches that work.
Copy patterns and best practices from the ArcGIS data models.
Design editing workflows and map display properties. Define the editing procedures
8
and integrity rules (for example, all streets are split where they intersect other streets, and
street segments connect at endpoints). Design editing workflows that help you meet these
integrity rules for your data. Define display properties for maps and 3D views. Determine
the map display properties for each map scale. These will be used to define map layers.
27
[email protected]
CSC468 Geographical Information System | Prithivi Narayan Campus | Dev Timilsina
Assign responsibilities for building and maintaining each data layer. Determine who
9.
will be assigned the data maintenance work within your organization or assigned to other
organizations. Understanding these roles is important. You will need to design how data
conversion and transformation is used to import and export data across various partner
organizations.
Build a working prototype. Review and refine your design Test your prototype design.
10.
Build a sample geodatabase copy of your proposed design using a file, personal, or
enterprise geodatabase. Build maps, run key applications, and perform editing operations
to test the design's utility. Based on your prototype test results, revise and refine your
design. Once you have a working schema, load a larger set of data (such as loading it into
an enterprise geodatabase) to check out production, performance, scalability, and data
management workflows. This is an important step. Settle on your design before you begin
to populate your geodatabase.
Document your geodatabase design. Various methods can be used to describe your
11.
database design and decisions. Use drawings, map layer examples, schema diagrams,
simple reports, and metadata documents. Some users like working with UML. However,
UML is not sufficient on its own. UML cannot represent all the geographic properties and
decisions to be made. Also, UML does not convey the key GIS design concepts such as
thematic organization, topology rules, and network connectivity. UML provides no spatial
insight into your design. Many users create a graphic representation of their geodatabase
schema with Visio, such as those published with the ArcGIS data models. Esri provides a
tool that can help you capture these kinds of graphics of your data model elements using
Visio.
The 11 steps presented above outline a general GIS database design process. The initial
design steps 1 through 3 help you identify and characterize each thematic layer. In steps
4 through 7, you begin to develop representation specifications, relationships, and
ultimately, geodatabase elements and their properties. In steps 8 and 9, you will define
the data capture procedures and assign data collection responsibilities. In the final stage
(steps 10 and 11), you will test and refine your design through a series of initial
implementations. In this final phase, you will also document your design.