Skip to content

Creating OWL file for counties in each state

Bill Hogan edited this page Dec 25, 2015 · 7 revisions

Downloaded gazetteer files from US Census (2015) from here: https://www.census.gov/geo/maps-data/data/gazetteer2015.html

The county file has the following column headings:

  1. USPS: two-letter state abbreviation
  2. GEOID: INCITS 31 county code
  3. ANSICODE: GNIS identifier
  4. NAME: name of the county
  5. ALAND: land area of county in square meters
  6. AWATER: water area of county in square meters
  7. ALAND_SQMI: land area of county in square miles
  8. AWATER_SQMI: water area of county in square miles
  9. INTPTLAT: interpolated latitude
  10. INTPTLON: interpolated longitude

Notes on process

  1. need to import us-integral-geography.owl
  2. need annotation properties:
  • INCITS 31 - DONE
  • rdfs:label - DONE
  • editor preferred term - DONE
  • GNIS identifier (need to create this one in us-basic-geography.owl) - DONE
  1. need object properties
  • proper_continuant_part_of - DONE
  1. need to filter out puerto rico and washington dc
  2. need to start creating IRIs sequentially at some number than highest already assigned - DONE (starting at GEO_000000800)
  3. need to create ontology IRI - http://us--county-geography.owl
  4. hash USPS to IRI for regions of all 50 states and DC from us-integral-geography.owl - DONE
  5. for each row in file:
  • if usps != last usps, save current ontology file, create new one for new usps, get IRI for region of state by USPS from hash
  • create OWL individual
    • annotate it with editor preferred term = region of NAME, USPS
    • annotate it with rdfs:label = region of NAME
    • rdf:type is geographical region
    • proper_continuant_part_of state geographical region
    • make sure you save all the axioms to the ontology
  • for land area measurements
    • create area quality individual for area of land
      • it inheres in the OWL individual
      • it is rdf:type area (from PATO)
      • create rdfs:label area of land
      • create editor preferred term area of land of region of NAME, USPS
    • create two land area quality measurements
      • rdf:type scalar measurement datum
      • is about area quality individual
      • has measurement unit label (either m^2 or square miles as appropriate)
      • has measurement value (value in respective field)
      • rdfs:label land area measurement of NAME
      • editor preferred term land area measurement of NAME, USPS in (either m^2 or square miles as appropriate)
  • for water area measurements, same thing, adjusting labels and editor preferred terms as appropriate