eolab:openhype:start
Differences
This shows you the differences between two versions of the page.
eolab:openhype:start [2022/09/19 11:53] – [3 Dashboard] sina001 | eolab:openhype:start [2023/01/05 14:38] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 1: | Line 1: | ||
+ | /* | ||
~~NOTOC~~ | ~~NOTOC~~ | ||
- | ====== | + | */ |
- | + | ====== | |
- | ===== NRW Groundwater Data – OpenHygrisC Data Processing for Education (OpenHyPE) ===== | + | |
- | + | ||
- | * Gefördert durch das [[https:// | + | |
- | * Laufzeit: 15.12.2021 - 31.12.2022 | + | |
+ | * Gefördert durch das [[https:// | ||
+ | * Laufzeit: 15.12.2021 - 30.06.2023 | ||
+ | /* | ||
=====Table of Contents===== | =====Table of Contents===== | ||
Line 39: | Line 39: | ||
5. Project Codes | 5. Project Codes | ||
+ | */ | ||
Line 97: | Line 97: | ||
===== 2.2 PostgreSQL/ | ===== 2.2 PostgreSQL/ | ||
- | PostgreSQL | + | PostgreSQL, also known as Postgres, is a free and open-source |
- | **PostGIS: | + | **PostGIS: |
- | The below image shows the PGadmin. | + | The below image shows the PGadmin |
| {{ : | | {{ : | ||
Line 142: | Line 142: | ||
* Programming applications | * Programming applications | ||
- | In this project, we have used Python for data engineering, | + | In this project, we have used Python for data engineering, |
| {{ : | | {{ : | ||
Line 179: | Line 179: | ||
- | **openhype environment: | + | **openhype environment: |
Line 212: | Line 212: | ||
* ** geopandas: | * ** geopandas: | ||
is extends the data types used by pandas to allow spatial operations on geometric types" | is extends the data types used by pandas to allow spatial operations on geometric types" | ||
+ | | ||
< | < | ||
conda install --channel conda-forge geopandas | conda install --channel conda-forge geopandas | ||
</ | </ | ||
+ | |||
+ | Some packages need to specify the channel to install and that's why in the above code we have specified the channel. | ||
< | < | ||
Line 243: | Line 246: | ||
</ | </ | ||
| | ||
- | Since we have downloaded the four CSV files in the previous chapter, now is the time to read our CSV files and start to clean them in order to make them ready to import | + | Since we have downloaded the four CSV files in the previous chapter, now is the time to read our CSV files and start to clean them in order to make them ready to import |
Line 255: | Line 258: | ||
There below four notebooks should be run separately, in order to import data into the database. | There below four notebooks should be run separately, in order to import data into the database. | ||
- | * import_gemeinde.ipynb | + | * import_gemeinde.ipynb: |
- | * import_katalog_stoff.ipynb | + | In this notebook, we will import the data of all geminde into the database. |
- | * import_messstelle.ipynb | + | * import_katalog_stoff.ipynb: |
- | * import_messwert.ipynb | + | In this notebook, we will import the data of all the catalogue substances into the database. |
+ | * import_messstelle.ipynb: | ||
+ | In this notebook, we will import the data of all stations into the database. | ||
+ | * import_messwert.ipynb: | ||
+ | In this notebook, we will import the data of all values into the database. | ||
Line 269: | Line 276: | ||
- | In the previous section, we have downloaded the data, cleaned and imported them to the database successfully and now it's time to see the data in the database. as we know, the SQL command is valid in the Postgres | + | In the previous section, we downloaded the data, cleaned and imported them to the database successfully and now it's time to see the data in the database. as we know, the SQL command is valid in the PostgresSQL |
First, we want to see our tables, with the below code, we are selecting all the columns (***** means all the columns) from our schema which is consist of our table (in this case is **sina**) and with the name of the table. and then because the size of the table is huge and we want only to see the first 100 rows then we just limit it to 100. | First, we want to see our tables, with the below code, we are selecting all the columns (***** means all the columns) from our schema which is consist of our table (in this case is **sina**) and with the name of the table. and then because the size of the table is huge and we want only to see the first 100 rows then we just limit it to 100. | ||
Line 529: | Line 536: | ||
\\ | \\ | ||
- | All the codes are available in below link. | + | All the codes are available in the below link. |
[[ https:// | [[ https:// |
eolab/openhype/start.1663581201.txt.gz · Last modified: 2023/01/05 14:38 (external edit)