Open Data

Created by Donal Hunt, Director at OpenStreetMap Ireland
90 People have this badge

What everyone's up to

Jul 01, 2018
Published a dataset
Interactive on-line overview of 5000 public, private and non-profit projects in Croatia co-funded with 4 billion 
Euros from various EU instruments, including weekly contracting figures.
https://www.udruga-gradova.hr/pregled-ugovorenih-eu-projekata-rh/

Submitted Hacktoberfest PR
Published a dataset
Used NLP
Used Machine Learning
Wrote Documentation
+ 3
I have contributed into #hacktoberfest2021 uploading four audio #datasets on #DagsHub and #GitHub with complete documentation. You can now access all open-source dataset at one place.
GitHub: https://lnkd.in/dj8TD5ht
Public Domain Sounds: https://lnkd.in/dpvggTYi
Urdu Dataset: https://lnkd.in/dh8jenjh
EmoSynth: https://lnkd.in/dqc4piZq
Voice Gender Detection: https://lnkd.in/dZ3WNTGz
Used DAGsHub
Used Github
Published a dataset
Submitted Hacktoberfest PR
Urdu Audio Dataset
+ 3
I have contributed to the URDU dataset by making it parseable on DagsHub and easy to consume! as a part of hacktoberfest2021

GitHub: https://lnkd.in/dj8TD5ht
DAGsHub: https://lnkd.in/dh8jenjh

Sep 11, 2021
Published a dataset
https://cryptics.eigenfoo.xyz/

cryptics.eigenfoo.xyz is a dataset of cryptic crossword clues, collected from various blogs and publicly available digital archives. I originally started this project to practice my web scraping and data engineering skills, but as it’s evolved I hope it can be a resource to solvers and constructors of cryptic crosswords.

The project scrapes several blogs and digital archives for cryptic crosswords. Out of these collected web pages, the clues, answers, clue numbers, blogger’s explanation and commentary, puzzle title and publication date are all parsed and extracted into a tabular dataset. The result (as of September 2021) is over half a million clues from cryptic crosswords over the past twelve years, which makes for a rich and peculiar dataset.
Sep 05, 2021
Published a dataset
Decent correlation between return to office that many experienced on September 1 and corporate travel booking data from TripActions here... Head back to the office, then start booking in-person meetings again.

Published a dataset
Developed Web App
Contributed to open source
+ 1
Here's the result of my and Leonid Evdokimov (https://darkk.net.ru/) work to collect the data on russia's twitter throttling:

Dataset: https://github.com/4ndv/russia-twitter-throttle

Webpage, that was used to collect the data: https://github.com/4ndv/is-my-twitter-slow-or-what