Papers by Willem R van Hage

There is an abundance of semi-structured reports on events being written and made available on th... more There is an abundance of semi-structured reports on events being written and made available on the World Wide Web on a daily basis. These reports are primarily meant for human use. A recent movement is the addition of RDF metadata to make automatic processing by computers easier. A fine example of this movement is the Open Government Data initiative which, by adding RDF metadata to spreadsheets and textual reports, strives to speed up the creation of geographical mashups and visual analytics applications. In this paper we present a new Open Linked Data RDF dataset 1 and a method for automatically adding such RDF metadata to semi-structured reports. We showcase our method on piracy attack reports issued on the web by the International Chamber of Commerce's International Maritime Bureau (ICC-CCS IMB) 2 We create a Semantic Web representation with the Simple Event Model (SEM) from screen scrapes of the ICC-CCS website. We show how the event layer makes it possible to easily analyze and visualize the aggregated reports to answer domain questions. Our pipeline includes conversion of the reports to RDF, linking their parts to external resources from the Linked Open Data cloud and exposing them to the Web through a ClioPatria web server that hosts the RDF.
Uncertainty Estimation and Analysis of Categorical Web Data
Lecture Notes in Computer Science, 2014
Trusting Web data: a maritime case study
Page 1. IMO MMSI ... Flag ... Length ... Lat ... Long ... Considering opinions from different lan... more Page 1. IMO MMSI ... Flag ... Length ... Lat ... Long ... Considering opinions from different land stations, we can reduce their interference. Including Web data can be a solution. IMO MMSI ... Flag ... Length ... Type Trusting Web data: a maritime case study opm: wasDerivedFrom SCENARIO Ship send satellite (AIS) messages to land station. AIS Message AIS messages provide static (official identifiers, flag, dimensions, etc.) and dynamic (position) information about the ships. Davide Ceolin Paul Groth Willem Robert van Hage Guus Schreiber ...
Public authorities are increasingly sharing sets of open data.
These data are often preprocessed ... more Public authorities are increasingly sharing sets of open data.
These data are often preprocessed (e.g. smoothened, aggregated) to avoid to expose sensible data, while trying to preserve their reliability. We present two procedures for tackling the lack of methods for measuring the open data reliability. The first procedure is based on a comparison between open and closed data, and the second derives reliability estimates
from the analysis of open data only. We evaluate these two procedures over data from the data.police.uk website and from the Hampshire Police Constabulary in the UK. With the first procedure we show that the open data reliability is high despite preprocessing, while with the second one we show how it is possible to achieve interesting results concerning the open data reliability estimation when analyzing open data alone.
Uploads
Papers by Willem R van Hage
These data are often preprocessed (e.g. smoothened, aggregated) to avoid to expose sensible data, while trying to preserve their reliability. We present two procedures for tackling the lack of methods for measuring the open data reliability. The first procedure is based on a comparison between open and closed data, and the second derives reliability estimates
from the analysis of open data only. We evaluate these two procedures over data from the data.police.uk website and from the Hampshire Police Constabulary in the UK. With the first procedure we show that the open data reliability is high despite preprocessing, while with the second one we show how it is possible to achieve interesting results concerning the open data reliability estimation when analyzing open data alone.