GIS Fundamentals/ Geographic Database Design

Slides:



Advertisements
Présentations similaires
droit + pub = ? vincent gautrais professeur agrégé – avocat
Advertisements

Les pronoms compléments
OBJECTIF: Arrondir un grand nombre. Objective: Round-Off numbers.
CCIE – 27 novembre 2000 Part 1: (45 minutes) - Jean Rauscher
GERPISA Eleventh International Colloquium June 11-13, 2003 Paris The Origins and the Limits of the Productive Models Diversity Research questions and research.
Département fédéral de lintérieur DFI Office fédéral de la statistique OFS Implementing the economic classification revision (NACE / ISIC) in the Business.
Practical Session – Defining Learning Outcomes
Un risque « oublié » dans l’Océan Indien: le tsunami
Le sondage LibQUAL à HEC Montréal Une première expérience réussie qui sintègre au processus de planification stratégique de la bibliothèque Le sondage.
Échanger connaissances et techniques sur les routes et le transport routier 1 The PIARC Website.
Copyright © 2010 Systematic Présentation des enjeux Europe et International 1 Jean-Luc Beylat, Vice-Président International Systematic.
Primary French PowerPoints What’s Your Name?.
Revenir aux basiques !. 1 Revenir aux basiques Processus Nécessité daméliorer la Maîtrise les Offres et Projets: lanalyse des causes racines montre un.
Inforoute Santé du Canada Les défis de linteropérabilité en e-santé Mike Sheridan, Chef de lexploitation 19 mai 2006.
Talking about yourself
Cliquez et modifiez le titre Cliquez pour modifier les styles du texte du masque Deuxième niveau Troisième niveau Quatrième niveau Cinquième niveau 23/01/2014©
RECOMMENDATIONS ON EXPORT MARKETING FOR GEORGIAN WINES Tbilisi – November 27, 2007.
Status report SOLEIL April 2008
TP2 ... MVC ? JList JLabel JSlider ImageLibrary Contrôleur Vue Modèle
Reading an analog clock
Université Des Sciences Et De La Technologie DOran Mohamed Boudiaf USTO République Algérienne Démocratique et Populaire Département de linformatique Projet.
BONJOUR Complex name, Simple aim! SEACSSEACS. What we wanted out of SEACS To promote sustainable energy usage within the school community To allow students.
Defence R&D Canada R et D pour la défense Canada Novel Concepts for the COP of the Future Denis Gouin Alexandre Bergeron-Guyard DRDC Valcartier.
TM.
Defence Research and Development Canada Recherche et développement pour la défense Canada Canada 11-1.
* Google Confidential and Proprietary Khaled KOUBAA Public Policy & Gov't Relations Manager - North Africa Google, Inc. Research, Innovation and Entrepreneurship.
Assessment and the new secondary curriculum S. Barfoot.
How to solve biological problems with math Mars 2012.
Computer Heritage: Problems and Perspectives at the Musée des arts et métiers Cyrille Foasso Modern Scientific Instrument Collection
Core Module 9 Family and Community Engagement Association des conseils scolaires des écoles publiques de lOntario (ACÉPO) Association franco-ontarienne.
EUROPEAN ASSOCIATION OF DEVELOPMENT RESEARCH AND TRAINING INSTITUTES ASSOCIATION EUROPÉENNE DES INSTITUTS DE RECHERCHE ET DE FORMATION EN MATIÈRE DE DÉVELOPPEMENT.
AFNOR NF Z – "Online Consumer Reviews
28th Conference of Directors of Paying agencies Namur, 27 to 29 October 2010 The Belgian Presidency of the Council of the European Union Workshop 2 : Control.
Indefinite articles, plural of nouns
1 La promotion de la santé ancrée dans la loi Le mandat de Promotion Santé Suisse est ancré dans la loi fédérale du 18 mars 1994 sur lassurance maladie.
On-line resource materials for policy making Ex-Ante Carbon-balance Tool Food and Agriculture Organization of the United Nations, FAO Apprendre à utiliser.
TortoiseSVN N°. Subversion : pour quoi faire ? Avoir un espace de stockage commun – Tous les étudiants du SIGLIS ont un espace svn commun Partager vos.
Youth Involvement - revitalising the Scout Method Participation des jeunes - revitaliser la méthode scoute.
INVESTMENT CLIMATEDEVELOPMENT IMPACT EVALUATION INITIATIVE Piloting the Entreprenant Status: In search of a successful formalization model BENIN Impact.
PURCHASING PHASE REVIEW Cornerstones of Purchase baseline
Les choses que j aime Learning Objective: To know how to use j aime to talk about things I like to do.
Laboratoire de Bioinformatique des Génomes et des Réseaux Université Libre de Bruxelles, Belgique Introduction Statistics.
L’ensemble microcanonique
La pratique factuelle Années 90 un concept médical visant à optimiser les décisions cliniques face aux soins des patients Aujourdhui un concept évolutif,
Présentation dun modèle dinterface adaptative dun système de diagnostique et dintervention industriel: ADAPTS (Adaptive Diagnostics And Personalized Technical.
Cest mercredi le neuf octobre Le plan! 1.Révisions 2.Vocabulaire 3.Jouer 4.Ecouter 5.Parler Il fait beau! Le but! Les couleurs!
Un chat deux chats deux chiens Un chien deux chevaux Un cheval
QU’EST-CE QUE TU FAIS?.
Les lycées. Sixth form is compulsory in France so it is really important to choose the right one. Look carefully at the information about colleges and.
Le Baromètre Zone Cours : un environnement pour la micro-évaluation de ressources pédagogiques* Jacques Raynauld Olivier Gerbé HEC Montréal, MATI Montréal.
Employment Policies. an Azorean story...
INDICATOR DEFINITION An indicator describes the manifestation of a process of change resulting from the pursuit of an action. Un indicateur décrit la manifestation.
16-Oct-00SL-BI and QAP Presented to QAWG on 23/10/2000Slide 1 Quality Assurance in SL/BI Jean-Jacques GRAS (SL-BI)
Branche Développement Le présent document contient des informations qui sont la propriété de France Télécom. L'acceptation de ce document par son destinataire.
KM-Master Course, 2004 Module: Communautés virtuelles, Agents intelligents C3: Collaborative Knowledge construction & knowledge sharing Thierry NABETH.
8th International Conference on psychosocial and economic aspects of HIV infection
Quelle heure est-il? What time is it ?.
2010 CAADP A FRICA F ORUM M EETING THE C HALLENGES OF C LIMATE C HANGE S TRATEGIES OF S MALLHOLDER FARMERS TO ACHIEVE F OOD S ECURITY AND I NCOME G ROWTH.
Ministère de l’Éducation, du Loisir et du Sport Responsables des programmes FLS et ELA: Diane Alain et Michele Luchs Animateurs: Diane Alain et Michael.
Core Module 10 Advocacy: Engaging the Public Association des conseils scolaires des écoles publiques de l’Ontario (ACÉPO) Association franco-ontarienne.
PERFORMANCE One important issue in networking is the performance of the network—how good is it? We discuss quality of service, an overall measurement.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
F RIENDS AND FRIENDSHIP Project by: POPA BIANCA IONELA.
Definition Division of labour (or specialisation) takes place when a worker specialises in producing a good or a part of a good.
1-1 Introduction to ArcGIS Introductions Who are you? Any GIS background? What do you want to get out of the class?
Session 3: Implementation experience: Selection of measures based on Cost-effectiveness Analysis Introduction: summary of relevant results of the questionnaire.
Beneficiary Communication
Linking Schools Class 4Helsinki at Lapage linking with Finland
Asset Management Inventorial Assets: Stand-Alone equipment items that have a cost of $2,000 or more and a useful life of one year or more Artwork & Collections.
Transcription de la présentation:

GIS Fundamentals/ Geographic Database Design

GIS Concepts Information cycle: Data/Information/System/Information System Geographic Information System Main Components/Characteristics Geographic Database Data Modeling Data Representation Spatial Analysis Implementing a GIS

Information Cycle (Contexte d’émergence des SIG) Territory Data GIS DSS Information Decision

Data / Information Information is the result of interpretation of relations existing between a certain number of single elements (called data). Example: The Museum located at 5th Avenue, NY, was built in 1898. Data: Museum, address, year of construction.

System A system is a set organized globally and comprising elements which coordinate for working towards doing a result. Example: Water supply system Elements: pipes, valves, hydrants, water meters, pumps, reservoirs, etc.

Information System (IS) An Information System is a set organized globally and comprising elements (data, logiciel, equipment, procedures, users) that coordinate for working towards doing a result (information).

GIS: “G” & “IS” Definition: A GIS is a collection of computer hardware and software, geographic data, methods, and personnel assembled to capture, store, analyze and display geographically referenced information in order to resolve complex problems of management and planning. A GIS is an IS with geographic data as input and geographic information as output Geographic Data? Geographic Information?

SIG: Système d’Information «Géographique»: Définition: Un SIG peut être défini comme un ensemble organisé intégrant le matériel, le logiciel, le personnel, les méthodes et les données nécessaires pour permettre la saisie, le stockage, l’analyse et l’affichage de l’information géoréférencée en vue de résoudre des problèmes complexes de gestion et de planification.

Components of a GIS

GIS Input Output User Interface Models Other GIS Reports Maps Geographic Data Geographic Information Input GIS Output Reports Maps Photo. Products Statistics Input Data for models Manipulation Analysis Maps Census Field Data RS Data Others Data Capture Display Storage GIS Components User Interface Models Other GIS

GIS: Main Characteristics Integration of Multiple data: - Sources - Scales - Formats Geographic Database Spatial Analysis

Data from multiple sources-at multiple scales-in multiple formats Census/ Tabular data Maps Picture & Multimedia GPS/ air photos/ satellite images

Referencing map features: Coordinate systems & map projections To integrate geographic data from many different sources, we need to use a consistent spatial referencing system for all data sets

The Latitude/Longitude reference system latitude φ : angle from the equator to the parallel longitude λ : angle from Greenwich meridian

Map Projections Curved surface of the earth needs to be “flattened” to be presented on a map: Map Projection Projections are classified according to which properties they preserve: area, distance, shape and angles, Some distortion is inevitable: Less distortion if maps show only small areas, but large if the entire earth is shown Projection is the method by which the curved surface is converted into a flat representation. Projection is the method by which the curved surface is converted into a flat representation

UTM: Universal Transverse Mercator Minimal distortions of area, angles, distance and shape at large and medium scales Very popular for large and medium scale mapping (e.g., topographic maps) Cylindrical projection with a central meridian that is specific to a standard UTM zone 60 zones of longitude around the world One cartographic reference system that desreves more detailed discussion is the UTM system

Space as an indexing system

The concept of cartographic scale Scale is the ratio between distances on a map and the corresponding distances on the earth’s surface e.g., a scale of 1:100,000 means that 1 cm on the map corresponds to 100,000 cm or 1 km in the real world Small scale: small fraction such as 1:10,000,000 shows only large features Large scale: large fraction such as 1:25,000 shows great detail for a small area “small scale” vs “large scale” often confused

Multi-scale The same feature represented in different scales. Example: lake large scale map of 1:25,000 may show individual houses smaller scale map of 1:500,000 shows only points representing villages Large scale (1:25.000) Small scale 1:500.000

Multi-formats Raster Vector Raster-Vector-Raster DXF-DGN-etc. Shapefile KML Etc.

Geographic Database Geographic Data Characteristics/Examples Definitions: Entity/Attribute/Dataset/Database Data Modeling Spatial representation Vector/Raster Topology

Descriptive Data vs Geographic Data Descriptive attributes Geographic Data: Spatial attributes Location Form Geographic location is the element that distinguishes geographic information from all other other types.

Geographic Data Characteristics : Position: explicit geographic reference Cartesian coordinates :X,Y,Z Geographic coordinates (lat, log) implicit geographic reference Address Place-name Etc. Geometric Form: ex: a polygon representing a parcel of land Spatial data describes the location and shape of geographic features, and their relationship to other features, and Descriptive data which characterizes the geographic features.

Example1: Parcel of land Attribute (descriptive) Data Landowner Area Etc. Spatial data Position Located at 100 Nelson Mandela Ave X= a; Y=b within system (X,Y) Form dimensions (sides and arcs, constituting a polygon)

Example 2: District Attribute (Descriptive) data: District-Code District-Name Population 1990 Population 2000 Population 2010 Spatial data: Geographical Position Polygon

Spatial entity We use the term entity to refer to a phenomenon that can not be subdivided into like units. Example: a house is not divisible into houses, but can be split into rooms. Others: a lake, a statistical unit, a school, etc. In database management systems, the collection of objects that share the same attributes. An entity is referenced by a single identifier, perhaps a place-name, or just a code number Each spatial entity has one or more attributes that identify what the entity is, describe it, or represent some magnitude associated with the entity. Indeed, the type of analysis you plan to do depends on the type of attributes you are working with. Example: you can categorize roads by whether they are local roads, highways, etc; by their length; their width; their pavement; etc.

Attribute Each spatial entity has one or more attributes that identify what the entity is, and describe it. Example: you can categorize roads by whether they are local roads, highways, etc; by their length; their width; their pavement; etc. The type of analysis you plan to do depends on the type of attributes you are working with.

Dataset “A dataset is a single collection of values or objects without any particular requirement as to form of organization.” Example: Streets, rivers, cities, etc.

Geographic Database “A geographic database is a collection of spatial data and related descriptive data organized for efficient storage, manipulation and analysis by many users.” It supports all the different types of data that can be used by a GIS such as: Attribute tables Geographic features Satellite and aerial imagery Surface modeling data Survey measurements Fundamentally, a GIS is based on a structured database that describes the world in geographic terms.

Data Modeling Data Modeling is the process of defining (geographic features) to be included in the database, their attributes and relationships, and their internal representation in the Database. It involves the development of conceptual, logical and physical models of the geographic Database. The outcomes include a Data Dictionary

Modeling Process Reality Modeling Geographic Database Abstracting the Real World Reality Modeling (data & processing.) We have real world as percieved by users ------ > And we would build a physical data model We insert an intermediate step: the CDM Geographic Database

ANSI/SPARC: Study Group on Data Base Management Systems (1975) Different users have different views of the world “Real World” External Model 1 External Model 2 External Model 3 Conceptual Model Logical Model Physical Model

Conceptual Model A synthesis of all external models (user’s views). Schematic representations of phenomena and how they are related. Information content of the database (not the physical storage) so that the same conceptual model may be appropriate for diverse physical implementations. Therefore, the conceptual model is independent from technology. The conceptual level corresponds to a synthesis of all external models. Although an abstraction of the real world, the conceptual model is consisting of schematic representations of phenomena and how they are related. The organization scheme created at this stage generally deals with only the information content of the database, not the physical storage, so that the same conceptual model may be appropriate for diverse physical implementations. Therefore, the conceptual model is independent from technology.

Conceptual Model (cont.) Easy to read Conceived for the analyst or designer Objective representation of the reality, therefore independently from the selected GDB System One conceptual model for the Database

Data Logical Model & Physical Model We transform the conceptual model into a new modeling level which is more computing oriented: the logical model (Example: the Relational Database approach) We transform the logical model into an internal model (physical model) which is concerned with the byte-level data structure of the database. Whereas the logical model is concerned with tables and data records, the physical model deals with storage devices, file structure, access methods, and locations of data.

Several types of data organization Hierarchical model - Hierarchical relationships between data(parent- child) Network Model - Focus on connections (e.g. airline booking system) Relational model - Based on relations (tables)- True Relat. DBMS use SQL Object-Oriented model - Focus on Objects Hierarchical: (Obselete). Hierarchical model - Hierarchical relationships between data (parent-child) Network Model - Focus on connections (e.g. airline booking systems) - Adv.: fast, efficient; Disadv.: inflexible Relational model - Data is organized within Tables (files) and relationships expressed between tables and data elements - True Relational DBMS use the Structured Query Language (SQL) Object-Oriented model - Focus on Objects: efficient storage and retrieval - (Future + Web) Network Model: technically obselete

Entity-relationship Formalism Entity name Attributes ENTITY_NAME1 -attribute 1 -attribute 2 … ENTITY_NAME2 -attribute 1 -attribute 2 … 0-N 0-1 Identifier (key-attribute) Association (relationship) Maximum cardinality (indeterminable/any number) (0,N) refers to the cardinality of the relationship. It means that , for example, at a minimum Minimum cardinality (0,N) refers to the cardinality of the relationship

An example of land parcels

The E/R diagram for land parcels STREET -name A B SEGMENT -number PARCEL -number 2-N 0-1 1-2 3-N 1-N 2-2 A: Streets have edges (segments) B: parcels have boundaries (segments) C: line have two endpoints D: parcels have owners, and people own land. C D 2-N 1-N POINT -number -x,y LANDOWNER -name -date-of-birth

Data Tables

Data Dictionary Definition: A data catalog that describes the contents of a database. Information is listed about each field in the attribute table and about the format, definitions and structures of the attribute tables. A data dictionary is an essential component of metadata information.

Example Definition of entities RAIL: way of communication and transportation Definition of attributes RAIL-ID: reference numbers for rail segments RAIL_CLASS: single track, double track, electrified, etc. RAIL_NAME: name for particular railway Explanations for measurements of attributes (type of attribute values) or coding practices RAIL-ID: INTEGER RAIL-NAME: CHARACTER, LONG=30

Sample components of a digital EA map

EA database entities Street Number Name --- Admin. Unit AU AU_Pop. --- EA-code Area Pop. Buildings Number HHs Etc. Crew leader area CL-code Name RO responsible Landmark -- ---

Example of Relations EA entity can be linked to the entity crew leader area. The table for this entity could have attributes such as the name of the crew leader, the regional office responsible, contact information, and the crew leader code (CL code) as primary code, which is also present in the EA entity. R EA EA-code Area Pop. Crew leader area CL-code Name RO responsible 1-1 1-N

Entity: Enumeration areas Type (attributes) EA-code Area Pop. CL-code 50101 28.5 988 78 50102 20.2 708 78 50103 18.1 590 78 50104 22.4 812 78 50201 19.3 677 79 50202 17.6 907 79 50203 25.7 879 79 50204 26.8 591 79 … … … Identifier

Components of a digital EA database Boundary database

A Simpler Alternative In many countries, EA map design may be simpler than in this example Instead of a fully integrated digital base map in vector format, rasterized images of topographic maps may be used as a backdrop for EA boundaries In some instances, map features may be more generalized, for instance by using only the centerlines for the streets and polygons for entire city blocks rather than for individual houses This can include the use of free data as a baseline or starting point in the creation or updating of census related maps

Data Representation Raster Vector Real World

Two Fundamental Types of Data GIS work with two fundamentally different types of geographic information Vector Raster (or Grid) Both types have unique advantages and disadvantages A GIS should be able to handle both types

Vector Data Vector data are stored as a series of x,y coordinates Good for discrete data representation points: wells, town centroids lines: roads, rivers, contours polygons: enumeration areas, districts, town boundaries, building footprints

Raster Data A raster image is a collection of grid cells - like a scanned map or picture Raster data is extremely useful for continuous data representation elevation slope modeling surfaces Satellite imagery and aerial photos are commonly used raster data sets Easier data structure Certain analysis operations are more easily implemented Good for representing continuous variation (vegetation, countour lines, etc.)

Vector vs Raster or Discrete vs Continuous River x1,y1 Raster-based analysis: Area of analysis divided into squares of uniform size. - Each cell characterizes the feature of interest within this area with a single value. Vector data: Coordiante-based data structure commonly used to represent linear features. Each feature is represented as a list of ordered x,y coordinates xn,yn

Raster-Vector conversion (“vectorization”) Computer algorithms are used to convert data of one type to the other: Vector --- Raster is trivial Raster ---- Vector is harder

Vector data + image (raster)

Vector: Points, lines, polygons Set of geometric primitives: points lines polygons y node vertex x

Vector Structure Spaghetti Topology 1 2 1 2 3 8 7 6 4 3 5 4 Topology On retrouve dans le modèle vecteur, plusieurs niveaux sémantiques: du plus structuré (topo) vers le plus simple (spaghetti) Modèle spaghetti Modèle réseau (graphe quelconque) Modèle topologique 7 6 4 3 5 4 Topology Spaghetti

Spaghetti File No Topology = raw file or ‘spagehetti file’ Lines not connected; have no ‘intelligence’

Example of “Spaghetti” data structure 6 Poly coordinates A (1,4), (1,6), (6,6), (6,4), (4,4), (1,4) B (1,4), (4,4), (4,1), (1,1), (1,4) C (4,4), (6,4), (6,1), (4,1), (4,4) A 5 4 3 B C 2 1 1 2 3 4 5 6

Topology Data structure in which each point, line and piece or whole of a polygon : “knows” where it is “knows” what is around it “understands” its environment “knows” how to get around Helps answer the question what is where?

Topology: Spatial Relationships Left Polygon = A Right Polygon = B Node 1 = Chains A,B,C Chain A is connected to chains B & C Polygon B Contained within polygon A Adjacency Connectivity Containment

Example of Topological data structure Node X Y Lines I 1 4 1,2,4 II 4 4 4,5,6 III 6 4 1,3,5 IV 4 1 2,3,6 1 Poly Lines A 1,4,5 B 2,4,6 C 3,5,6 6 A 5 I II III 4 4 5 3 From To Left Right Line Node Node Poly Poly 1 I III O A 2 I IV B O 3 III IV O C 4 I II A B 5 II III A C 6 II IV C B 2 B C 3 6 IV 1 2 1 2 3 4 5 6 O = “outside” polygon

Encoding Topology (not): CAD

Encoding Topology: GIS

Comparison Advantages: Spaghetti Topology Set of independent objects Representation of heterogonous objects within the same model Appropriate to CAD Pre-calculation of topological relations Maintenance of topological constraints correspondence with exchange formats

…cont. Disadvantages: Spaghetti Topology Spatial Relationships calculated Risk of incoherence (duplication of common boundaries) High cost of up-to-date Many levels of indirections for complex objects Maintenance

Some well known Topological models TIGER: Topologically Integrated Geographic Encoding and Referencing (Census Bureau of the USA) Line is the principal element to which are related points and area features ARC/INFO model: ESRI Point, Line, Polygon

Zip Codes Cities Voting Districts Census Tracts Counties MCD’s TIGER Data: Polygon Zip Codes Cities Voting Districts Census Tracts Counties MCD’s Block Groups

TIGER Data: Line Railroads Streams Streets

TIGER Data: Point Zip+4 Centroids Place Names Landmarks Key Locations

Recapitulation on spatial models Transformations between models: “vectorization” of raster images (costly) topology toward spaghetti (easy) spaghetti toward topology (possible but costly) The vector model most used, essentially topology; it’s useful to integrate raster and vector

Spatial Analysis select features by their attributes: “find all districts with literacy rates < 60%” select features by geographic relationships “find all family planning clinics within this district” combined attributes/geographic queries “find all villages within 10km of a health facility that have high child mortality” Query operations are based on the SQL (Structured Query Language) concept

Examples (query): What is at…? Features that meet a set of criteria

Spatial Analysis (cont.) Buffer: find all settlements that are more than 10km from a health clinic Point-in-polygon operations: identify for all villages into which vegetation zone they fall Polygon overlay: combine administrative records with health district data Network operations: find the shortest route from village to hospital

Modeling/Geoprocessing modeling: identify or predict a process that has created or will create a certain spatial pattern diffusion: how is the epidemic spreading in the province? interaction: where do people migrate to? what-if scenarios: if the dam is built, how many people will be displaced?

“is nearest to” Point/point Which family planning clinic is closest to the village? Point/line Which road is nearest to the village Same with other combinations of spatial features

“is nearest to”: Thiessen Polygons

“is near to”: Buffer Operations Point buffer Affected area around a polluting facility Catchment area of a water source

Buffer Operations Line buffer How many people live near the polluted river? What is the area impacted by highway noise

Buffet Operations Polygon buffer Area around a reservoir where development should not be permitted

“ is within”: point in polygon Which of the cholera cases are within the containment area

Problem: We may have a set of point coordinates representing clusters from a demographic survey and we would like to combine the survey information with data from the census that is available by enumeration areas. Solution: “Point-in-Polygon” operation will identify for each point the EA area into which it falls and will attach the census data to the attribute record of that survey point.

Polygon Overlay

“overlaps”: Polygon overlay

Data Layers A GIS combines layers of information about a place to give you a better understanding of that place. What layers of information you combine depends on your purpose. GIS uses geography, or space, as the common key element between datasets. Information is linked only if it relates to the same geographic area. - Data Layers: space as an indexing system. - A GIS combines layers of information about a place to give you a better understanding of that place. What layers of information you combine depends on your purpose.

Spatial aggregation Example of Spatial aggregation: fusion of many provinces constituting an economic region

Spatial data transformation: interpolation Example 1: Based on a set of station precipitation surface estimates, we can create a raster surface that shows rainfall in the entire region 13.5 20.1 26.0 27.2 12.7 15.9 24.5 26.1

GIS capabilities: Visualization

Implementing a GIS Consider the strategic purpose Plan for the planning Determine technology requirements Determine the end products Define the system scope Create a data design Choose a data model Determine system requirements Analyze benefits and costs Make an implementation plan Source: Thinking About GIS, Third Edition Geographic Information System Planning for Managers

GIS: Enables us to handle very large amounts of data Example: census data – thousands of EAs – hundreds of variables – many complementary data layers (roads, rivers, public facilities) Example: remote sensing – satellites send huge amounts of data that need to be processed, interpreted and stored

GIS: Helps to make data re-usable and useful to many more users Census geography – EA maps do not have to be redrawn every time, only updated – census information can be used for many more applications – data sharing among agencies

In Conclusion GIS for inventory/visualization GIS creates maps from data pulled from databases anytime to any scale for anyone GIS for database management GIS for spatial analysis/modeling GIS a tool to query, analyze, and map data in support of the decision making process. GIS can be viewed three ways: The Database View The Map View The Model View Improve Organizational Integration A GIS can link data sets together by common locational data, such as addresses, which help departments and agencies share their data. By creating a common database, data can be collected once and used many times.

What is Not GIS GPS – Global Positioning System …not just software! …not just for making maps! Maps are an input data to and a “product” of a GIS A way to visualize the analysis

Literature related to Census Mapping & GIS US National Research Council: Tools and Methods for Estimating Populations At Risk David Martin (1996) Geographic Information Systems: Socioeconomic Applications Longley and al, Wiley (2005) Geographic Information Systems and Science, second edition ESRI Press: Unlocking the Census with GIS Mapping the Census 2000

Demographic Statistics Section UN Statistics Division New York Merci pour votre attention Contact Information: Demographic Statistics Section UN Statistics Division New York globalcensus2010@un.org

Compromise projections Comoromise projections: Do not preserve any proprety but reprsent a good compromise between the different objectives. E.g. Robibson’s projection of the world

Vector to Raster Conversion: Polygons b a c

Vector to Raster Conversion: Lines

Raster to Vector Conversion: Polygons

Raster to Vector Conversion: Polygons

Spatial Operations “adjacent to” “connected to” “near to” “intersects with” “within” “overlaps” etc.

Spatial relationships Logical connections between spatial objects represented by points, lines and polygons e.g., - point-in-polygon - line-line - polygon-polygon Fundamental Questions addressed by GIS What is in a particular place? Find the position of some object What is the area of, the distance from specific object? Find patterns- by looking at the distribution of features on the map instead of just an individual feature Combination of data layers Point in polygon operation Line in polygon operation Polygon overlay