Monday, March 21, 2016

Sand Mining in Western Wisconsin: Data Collection

Introduction:

Once a research question has been proposed the next concrete step is to collect data. To understand the impacts that frac sand mining has on surrounding areas it was required to find and acquire datasets from numerous government sources. This data then had to be put through a sting of tools (see “Python Scripting Two” below) to normalize and prepare for use. All the data sets were acquired for use in Trempealeau County Wisconsin.


Methods:
Figure one. Information databases typically come in a compressed file format (zipped). This saves memory space and make for quicker downloads. The above image shows and extraction program in the process of decompressing zipped files.
The first objective of this exercise was to actually find and download data for Trempealeau County. There were four different government websites visited for six datasets (Figure five). One had to find each particular dataset by selecting different parameters before downloading. All the datasets were initially downloaded as zip files and required extraction before use (Figure one).
Figure two. Naming folders appropriately helps a researcher to recall where information is held.
 The compressed files were extracted into labeled folders in a permanent drive. Folder names reflect the source and subject of each dataset. This helps to keep files from being lost or misinterpreted.

    This project focuses on the early steps of manipulating data. This involved the fine process of changing file types, reorganization, examining feature attribute tables. Ultimately this led to making relationships among individual feature classes with the use primary and foreign keys.
Figure three. Results of clipping a rail network via county borders. Rails that exist within county in red. Rails in green are not needed for the area of interest so removing them is a good practice in data management. 
Data management is a cornerstone for any well-organized project. Having files that cover more than the necessary ground is a waste of space, clipping features removes information and preserves what is important for the project. See figure three for an example. A somewhat contradictive issue in regards to file size also became apparent when downloading data for this project. The digital elevation model (DEM) downloaded from the USGS did not cover the entirety of the county. There was one small section on the county’s south point that was omitted meaning that a second DEM needed to be downloaded (See figure four). An image mosaic needed to be completed between these two raster files. The output file was in unsigned 16 bit format and only had one radiometric band which is not an issue as this file only had one band to start out with.  
Figure four.  Digital elevation data comes as a raster image that covers a large area. In the case of Trempealeau County two images (blue tint and red tint) are needed for full coverage. 
Once all the information was downloaded I went on a search through each datasets respective metadata file to assess their reliability. Observe figure five for results.
Figure five. Metadata accuracy information for downloaded datasets.
Conclusions:

All data has limitations and knowing what they are is key to its accurate representation. The datasets that were downloaded range in spatial and temporal accuracy and are a blend between vector and raster type files. For a large scale map that depicts a village or town sized area these could lead to issues in regards to accurate representation. The spatial resolution is far too course and changes over time are far more pronounced meaning the datasets that haven’t been updated lately would be a key concern. This project focusses on Trempealeau County, an area with a relatively large scale, meaning that the precision is not as imperative.

No comments:

Post a Comment