Data are things, known or assumed as facts, making the basis of reasoning or calculation.
Metadata is information about data.
/
Data collection is the process of gathering and measuring information on targeted variables in an established system. The purpose is to answer relevant questions or/and evaluate outcomes.
Interpretation is the process of making sense of numerical data that has been collected, analyzed, and presented.
Visualization is the graphical representation of information and data.
Data analysis is a process of inspecting, cleaning, transforming and modeling data with the goal of discovering useful information, informing conclusions and supporting decision-making.
In statistics, exploratory data analysis (EDA) is an approach to analyzing data sets to summarize their main characteristics, often with visual methods.
Too large or complex data to be dealt with by traditional data-processing application and software.
Apache Point, New Mexico, US, 1995
ALMA, Atacama, Antofagasta Region, Chile
Data mining is the process of discovering patterns in large data sets.
Data assimilation is a mathematical discipline that seeks to optimally combine theory with observations.
Data manipulation; inserting, deleting, and modifying data in a database.
Data science is a multi-disciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge from structured and unstructured data.
Data Types
Data Set
Data Formats
Vector
Raster
Gridded
Vector data structures represent specific features on the Earth’s surface, and assign attributes to those features.
Thiessen polygon
It defines an area around a point, where every location is nearer to this point than to all the others (2D).
Rasters are made up of a matrix of pixels (also called cells), each containing a value that represents the conditions for the area covered by that cell.
***
What do you think about the advantages and disadvantages of vector and raster types?
IPCC = Intergovernmental Panel on Climate Change ( 500, 250, 180, 110 km )
Coarse (low) and Finer (high) resolution
NWP uses mathematical models of the atmosphere and oceans to predict the weather based on current weather conditions.
What if you have not enough money or capacity to calculate the processes in high-resolution grids, globally?
Hints: Regional Climate Models (RGM), Nested Grid or Domains etc.
What should we do in polar regions? Which grid system would be better?
Hints:Projection, Re-Shaped Grid etc.
What are the other grid problems?
Hint: Coastal region, Clouds etc.
Observation
Forecast
Re-Analysis Data
Ensamble Data
The information collected by a primary source.
The process of making predictions of the future based on past and present data.
A scientific method for developing a comprehensive record. Observations and a numerical model are combined objectively.
Instead of making a single forecast, a set (or ensemble) of forecasts. This set of forecasts aims to give an indication of the range of possible future states of the atmosphere.
ASCII (csv, txt)
netCDF
HDF
GRIB
Shapefile
American Standard Code for Information Interchange
ASCII codes represent text in computers and other devices.
Network Common Data Form
A file format for storing multidimensional scientific data (variables) such as temperature, humidity, pressure, wind speed, and direction.
NCAR (National Center for Atmospheric Research)
UCAR (University Corporation for Atmospheric Research)
Hierarchical Data Format
Designed to store and organize large amounts of data.
National Center for Supercomputing Applications, supported by The HDF Group.
GRIdded Binary or General Regularly-distributed Information in Binary form
Data format commonly used in meteorology to store historical and forecast weather data.
World Meteorological Organization
Geospatial vector data format for geographic information system (GIS) software.
It is developed and regulated by Esri.
Global to Continental
What kind of shapefile could they have used?