ODAK – Program

This is the draft programme for the event. The papers will be found on the EWIC digital library nearer to the conference, and the programme presents the titles and abstracts.

Welcome

2022-06-30T10:00:00Z

Session chair: Karina Rodriguez Echavarria

Open Data and Artificial Intelligence

2022-06-30T10:15:00Z

Chair: Karina Rodriguez Echavarria

A novel cough audio segmentation framework for COVID-19 detection

Alice E. Ashby (University of Brighton, UK), Julia A. Meister (University of Brighton, UK), Goran Soldar (University of Brighton, UK), Khuong An Nguyen (University of Brighton, UK)

Despite its potential, Machine Learning has played little role in the present pandemic, due to the lack of data (i.e., there were not many COVID-19 samples in the early stage). Thus, this paper proposes a novel cough audio segmentation framework that may be applied on top of existing COVID-19 cough datasets to increase the number of samples, as well as filtering out noises and uninformative data. We demonstrate the efficiency of our framework on two popular open datasets.

Exploring AI in Healthcare: How the Acceleration of Data Processing can Impact Life Saving Diagnoses

Jacob Jane (Arizona State University, United States, Ajay Bansal (Arizona State University, United States, Tyler Baron (Arizona State University, United States)

Artificial Intelligence (AI) is one of the biggest topics being discussed in the realm of Computer Science and it has made incredible breakthroughs possible in so many different industries. One of the largest issues with utilizing computational resources in the health industry historically is centered around the quantity of data, the specificity of conditions for accurate results, and the general risks associated with being incorrect in an analysis. Although these all have been major issues in the past, the application of artificial intelligence has opened up an entirely different realm of possibilities because accessing massive amounts of patient data, is essential for generating an extremely accurate model in machine learning (ML). This paper presents an analysis of tools and algorithm design techniques used in recent times to accelerate data processing in the realm of healthcare, but one of the most important discoveries is that the standardization of conditioned data being fed into the models is almost more important than the algorithms or tools being used combined.

Towards a Standard Testing Data Set in Privacy

Amedeo Roberto Esposito (EPFL, Switzerland)

More and more information is being rendered publicly available through open data. Consequently, the need for private mechanisms is growing. The issue of the privacy-accuracy trade-off is more prominent than ever: keeping the information private and secure can seriously hamper the performance of queries of interest. Having perfectly secure open data that no one can interrogate is a paradox against the principles upon which open data themselves were founded. But how can one test said accuracy and performance? Much like in Machine Learning, data-sets for benchmarking are becoming necessary. The best one can do without them is theoretically compare private mechanisms among themselves, while the implications of these theoretical guarantees in daily practice remain unclear. A preliminary analysis that takes ideas from theory and tries to identify the characteristics of a potential benchmark is presented in this work.

Lunch

2022-06-30T11:10:00Z

Co-creation of Open Data: Open Data of the citizens, by the citizens, for the citizens!

2022-06-30T12:00:00Z

Vittorio Scarano, Chair: Andrew Fish

Vittorio Scarano is a Full professor of Computer Science at the Computer Science Department of University of Salerno in Italy. He has been visiting the University Eotvos Lorand in Budapest (Hungary) and he has visited the Department of Computer Science at the University of Massachusetts at Amherst (USA). Since 1996, he funded and co-directs the ISISLab laboratory within the Department. ISISLab has been hosting, until now, the research activity of 23 PhD students, actually employing more than 20 collaborators (grants, fellowships) with involvement in research project at local, regionale, national and european level. He is co-author of more than 150 papers in internationally refereed journals and conferences of IEEE, ACM, etc.. He coordinated the European funded research H2020 project ROUTE-TO-PA “Raising Open and User-friendly Transparency-Enabling Technologies fOr Public Administrations” (grant agreement No 645860) with 12 partners and a budget above 3M€. He has participated and coordinated local units in European (FP6, Leonardo, Erasmus+), national and regional funded research and innovation projects. He likes family, doing research and teaching, in this order.

Open data is data that is freely available to everyone to use and republish as they wish, without any kind of restrictions, be it copyright, patents or any other mechanisms of control. It is recently become a very important innovation for the Public Administration and citizens in order to improve the transparency and the awareness of the relationship government-citizens. The seminar will describe the experiences generated by the EU H2020 Project ROUTE-TO-PA, multidisciplinary innovation project, that, by combining expertise and research in the fields of e-government, computer science, learning science and economy, is aiming at improving the impact, towards citizens and within society, of ICT-based technology platforms for transparency. The mail objective of the project was to improve the engagement of citizens by making them able to socially interact over open data, by forming or joining existing online communities that share common interest and discuss common issues of relevance to local policy, service delivery, and regulation; citizens are also empowered to co-create open datasets, thereby becoming authors and actors in the Open Data ecosystem, rather than simple (maybe even advanced) users. We will illustrate the guidelines of the project, the Social Platform for Open Data (SPOD) created by the project and several examples of real communities that are using the outcomes of the project, even 3 years after the end of the project. In particular we will describe the experience of HETOR (www.hetor.it) where communities are collectively creating knowledge (in the form of open datasets) about the local cultural heritage, collecting materials, oral traditions, and recollections of their local history. Finally we will explore how, by using Linked Open Data, it is possible to simply create a Virtual Exhibition from personalized requirements. Acknowledgments: Most part of the work was supported by the grant of EU Horizon 2020 project ROUTE-TO-PA. We thank all the researchers of the ROUTE-TO-PA project for very interesting and useful discussions. We also thank all the participants of the project and all the citizens and organizations that collaborated in the use cases.

Participatory and Community Open Data Initiatives

2022-06-30T13:00:00Z

Jason Evans (The National Library of Wales), Karina Rodriguez (University of Brighton), Michael Kelly (University of Brighton), Chair: Myrsini Samaroudi

This panel will explore developments, challenges and opportunities for co-developing open data and knowledge, in particular when this involve collaboration with non-technical experts, including members of communities, and other practitioners in disciplines such as humanities, museums and heritage, arts and crafts. The session will bring together researchers and practitioners explore the various workflows, technologies and processes to capture, document, link and provide access to a variety of content which can support various purposes including open research practices, preservation of knowledge and access of data by the wider public. Cross cutting issues, including ethics, copyright and sustainability will also be discussed.

Coffee and Tea Break

2022-06-30T13:40:00Z

Visualisation of Open Data

2022-06-30T13:55:00Z

Chair: Vittorio Scarano

Visually Overviewing Biodiversity Open Data Digital Collections

Asla Medeiros e Sà (FGV/EMAp, Brazil), Franklin Alves de Oliveira (FGV/EMAp, Brazil), Bruno Schneider (Universitat Konstanz, Germany), Karina Rodriguez Echavarria (University of Brighton, UK), Cristiana Silveira Serejo (Museu Nacional, Brazil)

After years of mass digitisation initiatives in Natural History institutions, large biodiversity collections have emerged on the web as open data. Studies on climate change and nature conservation rely heavily on this data to understand the distribution, presence/absence, changes over time, and interaction of species, and community ecology. For the institutions that hold this data, the exploration and verification of the records they produce are critical to support new modes of studying, analysing, and accessing biodiversity information. However, the process of data verification is challenging given the complex relationships between the data. This poses difficulties to the diagnosis of completeness, correctness, and good coverage of the domain. To this day, there is no clear understanding of to what extent existing visualization techniques can systematically support the task of data verification. To support research in this area, this paper reviews the visualisation solutions by focusing on a function-based visual exploration concept that can be integrated into a data verification pipeline for biodiversity datasets. Beyond reviewing the state of the art, we describe a data verification pipeline following such concept for biodiversity collections of the National Museum/Federal University of Rio de Janeiro, Brazil. The pipeline is targeted to domain expert users in supporting strategic decisions on data maintenance, as well as also having the potential to support general users in contextualising the datasets.

30Cappa - The "Christmas" decree in kilometres

Maurizio Napolitano (Fondazione Bruno Kessler, Italy), Andrea Borruso (Associazione OnData, Italy), Salvatore Fiandaca (GFOSS.it, Italy)

Following the COVID-19 pandemic emergency, in mid-December 2020 the Italian government introduced travel restrictions during the Christmas holidays. In the implemented policies there was an exception: citizens of a municipality of up to 5,000 citizens can move within an area of 30 kilometres from their respective borders. This policy was used again in the following months to manage travel during the pandemic. 30Cappa is a data visualization created by three civic hackers to give citizens the opportunity to understand this policy (”cappa” is the pronunciation of the letter ”k” in Italian and means ”km”). The project consists of a website where it is possible to receive information on the matter and the cartographic representation of the municipalities that correspond to the exception. The site has reached 350,000 unique visitors in two months and there has been a lot of talk about it in the media. This work highlights how the transformation of a policy into a tangible product such as a map created with the open data available, becomes an effective tool to guide citizens and also to review the policies themselves.

Smart Museum Tour using Linked Open Data

Somesh Siddabasappa (Arizona State University, United States), Suyog Somesh Halikar (Arizona State University, United States), Ravikanth Dodda (Arizona State University, United States), Nikhil Hiremath (Arizona State University, United States), Srividya Bansal (Arizona State University, United States)

Linked Open Data is structured data on the web that is published and shared from various sources. Related data is connected using web standards such as URIs and RDF. The current linked open data cloud comprises billions of triples. With the availability of various open data sets, a number of innovative and interesting applications are possible. This paper focuses on using data integration techniques and ontology engineering for linked data generation that can be used for the creation of a smart museum tour. The Smithsonian American Art Museum has a vivid collection of American artworks. The data about all artworks and sculptures and their location in the museum is published as Linked Open Data. This paper presents a linked data generation approach for a smart museum tour application that uses the user’s location as input and displays information about all the nearby artworks and additional connected and related information from a linked open data set about an artwork chosen.

Infrastructures of knowledge: Two perspectives on linked open data in the field of Germany’s cultural heritage.

2022-06-30T15:00:00Z

Robert Nasarek, Chair: Karina Rodriguez Echavarria

Robert Nasarek is a Research associate (Museum and Cultural Informatics - Germanisches Nationalmuseum Nuremberg). He was an Information scientist (National Academy of Sciences Leopoldina) and is a PhD student at the Martin-Luther-University Halle-Saale.

The presentation describes the goals, current situation and remaining challenges for producers and users of linked open data in the field of cultural heritage in Germany. Two perspectives will be taken, on the one hand the macroscopic view on the part of the consortium NFDI4Culture, an organizational unit within the German National Research Data Infrastructure founded in October 2020, and on the other hand the mesoscopic view out of the engine room of the Scientific Comunication Infrastructure Software WissKI. Theoretical, technical and governemental solutions and problems in the field of linked open data of cultural heritage are thus presented.

Dinner at Beach Club Restaurant

2022-06-30T18:00:00Z

Applications of Open Data

2022-07-01T08:30:00Z

Session chair: Asla Sa

Open Data Literacy by Remote: Hiccups and Lessons

Alessia Antelmi (Università degli Studi di Salerno, Italy), Maria Angela Pellegrino (Università degli Studi di Salerno, Italy)

Open Data are published to ensure the creation of value and data exploitation, but limited technical skills are a critical barrier. Most users lack the skills required to assess data quality and its fitness to use, awareness of open data sources, and what they can do with the data. To advance the dialogue around methods to increase awareness of Open Data, improve users’ skills to work with them, and deal with the requirement of letting future citizens develop data and information literacy according to 21st-century skills, this article proposes a series of workshops to let Italian high school learners familiarise themselves with effective communication based on Open Data. The article describes an ongoing activity, reporting preliminary results on engagement and learning. We discuss challenges in engaging learners remotely and the promising learning outcomes achieved by overcoming cultural and technical barriers to visualise Open Data.

Collaborative Open Data in education: COLTRANE experience on cybersecurity

Jerry Andriessen (Wise & Munro Learning Research, the Netherlands), Steven Furnell (University of Nottingham, UK), Gregor Langner (AIT Austrian Institute of Technology GmbH, Austria), Giuseppina Palmieri (Università degli Studi di Salerno, Italy), Gerald Quirchmayr (Universitat Wien, Austria), Vittorio Scarano (Università degli Studi di Salerno, Italy), Teemu Johannes Tokola (University of Oulu, Finland)

In this paper we report on an experiment on Open Data co-creation conducted in the COLTRANE Erasmus+ project, that focusses on cybersecurity education. Students of a Master degree course on Computer Science were asked to collaboratively work on building and understanding open data about cybersecurity threats and risks, presenting interesting insights after a week of work (conducted by remote, by using a collaborative platform). We report here some of the initial results of their experience, and some lessons learned and limitations found in COLTRANE activities.

Keyword extraction and summarization from unstructured text: A case study with open data from legal domain

Varun Singh (Arizona State University, United States), Srividya Bansal (Arizona State University, United States)

Information Extraction (IE) is an important and crucial task in the world of web and open data. IE is achieved using Natural language Processing (NLP). There are various techniques used for extraction of information, however coming up with useful and meaningful information is the most important task. Many search engines rely heavily on IE. This paper focuses on entity extraction of named entities from natural language and converting them into knowledge graph of triples. The goal is to answer two types of queries (i) Keyword search that returns exact information; (ii) Summarization of a keyword in question. A case study using open data from legal domain is presented.

Speeding Date - The Mathematics of Fast Calendar Algorithms

Cassio Neri, Independent Researcher

This talk, a joint work with Prof. Lorenz Schneider of EM-Lyon, concerns fast calendar algorithms. I start by touching on the fascinating history of the Gregorian calendar, from its Roman origins until it got its current form in the 16th century. This evolution deeply impacts implementations of calendar algorithms in all types of software systems. Then, I progress to the study of Euclidean Affine Functions which provide the mathematical foundations of our algorithms. Specifically, I show some algebraic and numerical results that explain why our algorithms are so suited for modern superscalar CPUs. Along the talk, I present code snippets in C/C++ and x86_64 assembly and also benchmark results showing that our algorithms perform considerably faster than counterparts in Microsoft's .NET framework, OpenJDK, Android phones and other popular open source libraries. Literally billions of devices can benefit from our results. Actually many already are since I have contributed these algorithms to GCC's C++ Standard Library and to the Linux Kernel

Coffee and Tea Break

2022-07-01T09:30:00Z

From Datasets to Knowledge to Decision-making in e-Gov Processes: Best Practices in Public Administrations.

2022-07-01T10:00:00Z

Giuseppe Ferretti, Chair: Alessia Antelmi

Giuseppe Ferretti is a public official (role: e-Gov Strategic Planner, supporting the coordination and promotion of e-government policies and smart communities) for Digital Transition Special Office at Campania Regional Government (Napoli, Italy), dealing with the state-of-the art Digital Services and best practices for a region of 6.000.000 resident people, more populated than many UE nations. 30 years demonstrated experiences in ICT leader roles, both in Industry and Public Administration, his physical brain was made in Italy (Napoli) and runs unstopped since 0,5+ centuries ago.

The pyramid Data-Information-Knowledge-Wisdom shows that, starting from the base, the incremental elaboration at each level is useful to accumulate the experience that allows decision-making processes in every organization, public or private. The decision-risk decreases towards the vertex of the pyramid, but only if the value and the quality of the base (data) is high and if the security is granted: there is a need for a continuous attention to standardization and regulation about this, expecially in Health, Social and Environment sectors. In e-gov processes, the Public Administration must be both Producer (for citizens, enterprises, professionals and other PA's) and Consumer (e.g. by interoperability with other PA's or by using Research tools and prototypes) of high quality open data, compliant with privacy and transparency. We will show some of the most interesting experiences in PA's, pre and post pandemic, at european, national and regional level, originated by citizen's contests or by decision-maker actions.

2022-06-30T10:00:00Z

Session chair: Karina Rodriguez Echavarria

2022-06-30T10:15:00Z

Chair: Karina Rodriguez Echavarria

A novel cough audio segmentation framework for COVID-19 detection

Alice E. Ashby (University of Brighton, UK), Julia A. Meister (University of Brighton, UK), Goran Soldar (University of Brighton, UK), Khuong An Nguyen (University of Brighton, UK)

Exploring AI in Healthcare: How the Acceleration of Data Processing can Impact Life Saving Diagnoses

Jacob Jane (Arizona State University, United States, Ajay Bansal (Arizona State University, United States, Tyler Baron (Arizona State University, United States)

Towards a Standard Testing Data Set in Privacy

Amedeo Roberto Esposito (EPFL, Switzerland)

2022-06-30T11:10:00Z

2022-06-30T12:00:00Z

Vittorio Scarano, Chair: Andrew Fish

2022-06-30T13:00:00Z

Jason Evans (The National Library of Wales), Karina Rodriguez (University of Brighton), Michael Kelly (University of Brighton), Chair: Myrsini Samaroudi

2022-06-30T13:40:00Z

2022-06-30T13:55:00Z

Chair: Vittorio Scarano

Visually Overviewing Biodiversity Open Data Digital Collections

Asla Medeiros e Sà (FGV/EMAp, Brazil), Franklin Alves de Oliveira (FGV/EMAp, Brazil), Bruno Schneider (Universitat Konstanz, Germany), Karina Rodriguez Echavarria (University of Brighton, UK), Cristiana Silveira Serejo (Museu Nacional, Brazil)

30Cappa - The "Christmas" decree in kilometres

Maurizio Napolitano (Fondazione Bruno Kessler, Italy), Andrea Borruso (Associazione OnData, Italy), Salvatore Fiandaca (GFOSS.it, Italy)

Smart Museum Tour using Linked Open Data

2022-06-30T15:00:00Z

Robert Nasarek, Chair: Karina Rodriguez Echavarria

2022-06-30T18:00:00Z

2022-07-01T08:30:00Z

Session chair: Asla Sa

Open Data Literacy by Remote: Hiccups and Lessons

Alessia Antelmi (Università degli Studi di Salerno, Italy), Maria Angela Pellegrino (Università degli Studi di Salerno, Italy)

Collaborative Open Data in education: COLTRANE experience on cybersecurity

Keyword extraction and summarization from unstructured text: A case study with open data from legal domain

Varun Singh (Arizona State University, United States), Srividya Bansal (Arizona State University, United States)

Speeding Date - The Mathematics of Fast Calendar Algorithms

Cassio Neri, Independent Researcher

2022-07-01T09:30:00Z

2022-07-01T10:00:00Z

Giuseppe Ferretti, Chair: Alessia Antelmi

2022-07-01T11:00:00Z

2022-07-01T12:00:00Z

Session chair: TBC

2022-07-01T13:30:00Z

2022-07-01T15:30:00Z

Session chair: Karina Rodriguez Echavarria

News

New Speakers announcedJune 19, 2022

Programme availableJune 01, 2022

Keynote SpeakerJanuary 30, 2022

Call for Submissions AnnouncedJanuary 20, 2022

Sponsors

New Speakers announced
June 19, 2022

Programme available
June 01, 2022

Keynote Speaker
January 30, 2022

Call for Submissions Announced
January 20, 2022