The Web of Data has seen tremendous growth recently. New forms of structured data have emerged in the form of knowledge graphs, Web markup, such as schema.org, as well as entity-centric data in Web tables. Considering these rich, heterogeneous and evolving data sources which cover a wide variety of domains, exploitation of Web Data becomes increasingly important in the context of various applications, including dataset search, question answering and fact verification. These applications require reliable information on dataset characteristics, including general metadata, quality features, statistical information, dynamics, licensing, and provenance. Lack of a thorough understanding of the nature, scope and characteristics of data from particular sources limits their take-up and reuse, such that applications are often limited and focused on well-known reference datasets.
The goal of the PROFILES’19 workshop is to bring together researchers and practitioners interested in the development of techniques for dataset profiling and deriving quality analytics, as well as performing dataset search and dataset retrieval on the Web while taking dataset profiles into account. We are interested in approaches to analyse, characterise and discover data sources. We aim to discuss technologies addressing data profiling and search – including semantics, information retrieval for Web Data (ranking algorithms and indexing), in particular in the context of decentralised and distributed systems, such as the Web. We want to facilitate a discussion around data search across formats and domain-specific applications.
PROFILES offers a highly interactive forum for researchers and practitioners, bringing together experts in the fields of the Web, Semantic Web, Web Data, Semantic Search, Databases, NLP, IR, and application domains. We envision the workshop as a forum for researchers and practitioners to come together and discuss common challenges and identify synergies for joint initiatives.
The Proceedings are available at CEUR-WS. The full proceedings PDF is available here.