USER GUIDE TO
THE COHESION OPEN DATA PLATFORM
- USER GUIDE TO THE COHESION OPEN DATA PLATFORM
1. What is "open data"
Open data (and content) can be freely used, modified, and shared by anyone for any purpose (http://opendefinition.org/).
"Cohesion Open Data" is maintained by the European Commission to share unique, structured data generated from the monitoring of EU Budget funded, shared management programmes. The unique data relates primarily to different forms of financial inputs (planned and implemented) and to outputs and result outputs and result indicators of the physical results of the funded programmes.
Find out more about #CohesionOpenData: Website - Blog - FAQ - User Guide to ODP - User guide to visualisation - Browse data stories
2. How to access the platform
Direct link: https://cohesiondata.ec.europa.eu/
Also accessible from Inforegio under "Information sources" – "Cohesion Open Data"
3. How to access the data
The data behind the ESIF visualizations can be accessed and downloaded from the platform in two ways:
1. Use the "Explore this Data" link associated to the charts presented on the platform.
2. If you know which dataset you want to manipulate or are searching for specific data or charts, search in the “Catalogue” link found on the home page => https://cohesiondata.ec.europa.eu/browse
The catalogue gives access to a library of open data objects with a search tool with various filters.
The following search functions can be used cumulatively:
- Free search (1);
- "Categories" tags (2) - programme periods, dataset types;
- "View types" tags (3) - the main types used are datasets, charts, filtered views and stories;
- Word "Tags" (4);
- "Sort by" tags (5).
4. What is a Metadata page?
4. What is a Metadata page?
Each dataset has a metadata page with its own URL. It introduces users to the dataset and gives important information about the dataset as follows:
- Data description;
- Date of last update;
- Links to featured content, i.e. graphs created by the data owner;
- Key details about the dataset: Usage, owner, contact, licence, etc.;
- Structure of the data;
- Related content (i.e. graphs created by other users).
Before you start reusing the data it is essential to understand what the data relates to and how it is structured in order to make useful interpretations and charts. The metadata is there for that purpose. For instance the metadata highlights important characteristics as follows:
- In all cases for 2014-2020 the values are expressed cumulatively not annually. Some other datasets have annualised values;
- The data may be presented in annual snapshots. In those cases you can compare annual timeseries, but never aggregated the values from differnent years;
- Financial values are always expressed in current prices.
5. Create a profile to fully exploit the platform functionalities
Create a profile with you own password by enrolling on this site: https://cohesiondata.ec.europa.eu/login
With a profile you can filter, create and save objects on the platform for reuse by you and other users.
6. Interacting with and reusing the data
Once you “view the data” from the metadata page the platform offers several ways to interact with the
You can for instance filter a dataset to keep just one country, one Thematic Objective, one fund; you can create charts or other types of visualizations to illustrate a report or to embed on a webpage.
All the functionalities are accessible in the toolbar on the top right. These are explained below.
6.1 More views
All the filtered views or visualizations created from a dataset will be listed here.
6.2 Filtering / filtered views
This tool works like the pivot table in Excel. You must first define the filter conditions.
Select "MS" (1st column) "is" and choose, for instance, a country code "AT" to keep one Member State.
You can add as many filter conditions as you want. Let's filter out the Technical Assistance.
To aggregate the data, you must use the "Roll-Ups & Drill-Downs" function.
To group similar data (in our example: display the amount allocated for each OP by Fund, by TO) use "Group by".
"Roll-up" allows the mathematical functions (Sum, Average, Maximum …).
“Saving” changes the type of file. You are now working on a grouped view and not directly on the dataset anymore. The grouped view will continue to update as the parent dataset is updated.
Important: Do not save the modifications on the dataset. Use "Save as" to create a new filtered view.
6.3 Exporting and reusing data
6.3.1 Static export
The "Export" section offers the possibility to export a dataset or a filtered/ grouped view as data file with different formats offered.
When exporting data file there are two typical challenges that users face if using MS Excel:
- The number and financial amounts with commas and decimal formatting are not accurately imported into Excel giving false results when checked against reference values on the platform: => See Annex 1 to correct set your default excel settings for numbers and financial amounts;
- The .csv file is not formatted in columns: => see Annex 2 to see how to reformat the .csv format in Excel columns and cells.
6.3.2 Dynamic export with an OData link
Consider using the OData. It is a permanent link between the platform and the excel file. It can be used to avoid regular manual downloads. Refreshing the data will automatically update on your excel.
1. Open via Dataset details or the table view.
Click Access Data via OData
2. Copy the OData Endpoint link
3. Open a new Excel document
4. Click on the Data tab on the top panel, then choose New Query and From Other Sources, select From OData Data Feed
5. Paste the copied OData Endpoint link and then click OK
6. You will see a preview of the table, click Load to insert it
The table should look similar to this.
Note: Column A lists unique identifiers for each row. The column order may be rearranged.
When starting work on the dataset, click Refresh here to update your table and sync with the source data.
6.3.3 Reuse data using a SODA API interconnector
Each published dataset on the platform is provided with an application programming interface (API) to allow direct access to the data from a variety of programming softwares.
IT developers can use the Socrata developer portal and the API documents to programmatically reuse the data directly to build filtered views and visualisations.
6.4 Visualizing the data
The platform provides the possibility to make a range of interactive charts and visuals, without IT programming knowledge.
This separate data story provides a guide to building #CohesionOpenData visuals to share and reuse. It provides detailed instructions how to create charts as follows:
6.5 Embed data, filtered views and charts
Embedding allows the charts to be published on webpages by providing an “embed code”. Choose the size or customize it, use the "Preview" button and copy the embed code.
You have the possibility to share any dataset, views or chart via Facebook or Twitter.
6.7 About = Metadata
This link gives you access to information on the data description, usage, owner, contact and last update.
ANNEX 1: Setting the correct number format to open and read an exported “.CSV (for Excel)” file
- Open windows settings (in Windows not in Excel)
- Then open regional settings
- Then select 'additional date, time & regional settings'
- Then select 'change date, time or number formats'
5. In the next pop-up window, press button "Additional settings"
6. In the next pop-up window, change the following settings;
7. Decimal symbol: dot '.'
8. Digit grouping symbol: comma ','
9. List separator: comma ','
ANNEX 2: Reformatting an exported “.CSV (for Excel)” file to display data in cell
Is your data exported in “CSV for Excel” format presenting strangely with data rows all grouped in the first cell of each row?
Follow these step in Excel:
- Select the Column A
- In the menu select “Data” => “Text to Columns”
3. In the task window – Step 1 of 3:
- Choose "Delimited"
- Take note of which character is used to delimit the data elements - either “;” or “,”
- Click Next
4. In the task window (Step 2 of 3):
- Choose either “Semicolon” or “Comma” depending on the format of the source data
- That should now show you column formatted data in the Data Preview
- Click “Finish” to see the dataset reformatted in an excel readable format
5. The data should now look like this.
6. Save the file in an ".xls" or other Excel format NOT as ".csv"
RESOURCES AND CONTACTS
DG REGIO: Find out more about #CohesionOpenData: Website - Blog - FAQ - User Guide to ODP - User guide to visualisation - Browse data stories
Twitter: EUinmyRegion - REGIOEvaluation - #EURegioDataStories - #CohesionPolicy - #CohesionOpenData
- Technical Support: firstname.lastname@example.org
- Developer Platform: https://dev.socrata.com/
Date of text: April 2022
Authors: Iluta BADERE, John WALSH