Abstract
This paper describes a hackathon event that took place as part of the Ontology Summit in 2014. The purpose of this hackathon was to develop an integrated ontology for a potential travel risk application, and was intended as an exercise in ontology integration and re-use. The hackathon took place over a 48 hour period via remote teleconferencing. Several existing ontologies and ontology design patterns were integrated into one common framework, while a new ontology was developed from available data and integrated with these. The notion of ‘event’ as it relates to risk presented some interesting ontological issues that are described here. This paper describes the work that took place, gives a brief overview of the ontology content, and concludes with lessons learned on the opportunities and challenges in re-use of ontologies from different sources, including the implications of different ontological commitments. The final work product was substantial enough to form the basis of a possible travel risk application.
Introduction
This paper describes a hackathon event that took place as part of the Ontology Summit in 2014 (Obrst et al., 2014). The purpose of this hackathon was to develop an integrated ontology. This was one of several hackathons that took place as part of that year’s Summit.
The stated aim of this hackathon was to explore the opportunities and challenges in re-using ontologies.
This hackathon brought together a number of ontologies, ontology design patterns and high level semantic abstractions (foundational ontology content) to create an ontology around the area of risk in the context of travel. The aim was to specify the ontology needed to support a possible risk application, though the application itself was outside the defined scope of the hackathon.
The primary aim of this hackathon was to consider and identify the challenges in re-using ontologies that come from different sources and that in many cases were designed for different intended uses.
The ontology aimed to identify and model semantic abstractions of risk, combining the different dimensions of risk, namely events, situations, probabilities and impacts, and to identify ontology design patterns for those subject areas. This was to enable the integration of data (such as accident statistics) that would be usable to support a simple mobile application. Some conclusions are given towards the end of the paper.
Planned activities
Hackathon
The following were the stated aims and activities for the Hackathon, as recorded on the hackathon wiki page for this hackathon (Bennett and Berg-Cross, 2014):
Explore and identify ontologies for the different types of content that relate to risk (events, situations, statistics, incidents, goals, etc.) Brainstorm semantic abstractions that would unify the concepts relating to risk across these ontologies, as well as to identify the high-level abstractions for risk itself Identify ontology design patterns around these concepts Consider how to re-use such patterns, for example taking existing patterns for Situation and Event and specializing these for Risky Situation and Risky Event Create a single ontology that integrates these concepts
Future work/out of scope
Scope out and specify a possible application Capture and/or simulate sample data Specify and create semantic queries to interrogate the integrated ontology Feed the results of semantic queries and/or reasoning engines into an application for mathematical calculation of risk values
Focus was on
Ontology design patterns versus high level abstractions Extending design patterns for new application contexts Considering the use of upper ontology elements in integration Accessing and interpretating Linked Data using ontologies Understanding what makes an ontology re-usable and how to assess the ontology for a given use case
Tooling
Two ontology visualization tools were available for use in the hackathon. One was a Unified Modeling Language (n.d.) tool called Sparx Enterprise Architect (n.d.), adapted by the author for the visualization of constructs in the Web Ontology Language (OWL), described in McGuinness and Van Harmelen (2009) using an early draft of the Ontology Definition Metamodel (2009) from the Object Management Group (n.d.) that was modified to provide business-readable diagrams. It was not possible to generate OWL serialization files directly from this toolset, and so this was only used for presentation of visual representations of the common risk foundational ontology in the Financial Industry Business Ontology (2015) Foundations specification that was in preparation during this hackathon.
The main visualization tool used was the Visual Ontology Modeler (n.d.) from Thematix Partners LLC. This was used for visualizing ontologies after they were created. This tool uses the canonical version of the ODM (2009) specification. It also has the ability to ingest OWL in a number of syntactical flavors and to generate OWL representations of that material for further work and for validation in tools that provide reasoner support.
Additional tooling was used to create ontologies, principally Protégé (n.d.) from Stanford University.
Hackathon activities
The hackathon took place over a 48 hour period, using remote screen-sharing teleconferencing. Participants took part from Australia, Russia, Italy, France and the United States.
Work was a combination of on-screen discussion using shared diagrams and ontology visualization tools, and off-line working on individual ontologies in Protégé by the different participants. Other participants researched possible ontologies and data sources to use, and this initial research was used to decide the areas of risk relevant to support a mobile application.
Ontologies
The group defined the subject domain to be the risks involved in getting from Location A to Location B via various means of transport. Team members identified, built or adapted ontologies for the different dimensions of the problem (events, situations, probabilities and impacts). These were:
trajectory modeling transport events which may occur along the way impacts and goals core risk concepts event ontology and ontology design pattern
Detailed working
The proposed mobile application was intended to provide comparative risk figures for a range of transportation modes against a single specified goal. In the example, the goal was to get from the user’s home in Washington, D.C., to a conference venue in Austin, Texas, by 9am on a given day. A number of different options were given for completing this goal. Risks would then be calculated as a product of probability and impact on that goal, with probability being determined as a simple actuarial application of historical data to present probabilities.
The team combined concepts in the following areas:
Trip data, extending an existing Trajectory ontology from Dean et al. (2013)
Common risk concepts (context-neutral) derived from an existing event ontology design pattern developed for the Centre for Digital Music (Event ontology design pattern developed for the Centre for Digital Music, 2007) and the FIBO ontology for Goals in FIBO Foundations (2015)
Risk Assessment (impacts etc.) – also extended for positive versus negative outcomes of an event
Travel Adverse Events based on available sources of historical statistical data in Campbell et al. (2003), Chang (2008) and Zegeer et al. (2005).
Some ontologies were created from scratch from sample data sets, while others were adapted or re-used. The Trajectory ontology was provided by Mike Dean, who extended it with travel-related concepts for the purposes of this hackathon in the form of a new “Trip” ontology. The kinds of incident that could prevent or delay a traveler were sourced in the three sample sets of historical data from which Anett Hoppe created an ontology from scratch.
The Event Ontology was built around a published event ontology design pattern for open source development, developed for the Centre for Digital Music. Max Gillmore extended and adapted this to create an over-arching ontology of risk events.
Mike Bennett created diagrammatic visualizations of these ontologies and facilitated collaborative sessions in which the group endeavored to integrate these.
We imported most of the ontologies into our working set of integrated ontologies in the VOM tool, and used this to identify common concepts and differences between concepts in the different ontologies.
In this way the hackathon involved both top-down and bottom-up development of ontologies.
For the Trip, Common Risk and Risk Assessment areas, the participants created or adapted ontologies in Protégé. The resulting Common Risk and Risk Assessment ontologies were then ingested into the VOM tool in order to provide a visualization of the model content.
All ontologies were in OWL. We did not set out any specific requirements for these to be in particular dialects of OWL as described in McGuinness et al. (2009), such as OWL-DL or OWL-Full, since the requirement focused on business meaning rather than machine processing. However, inspection of the ontologies used suggests that many or all of them would also be OWL-DL compliant. We did not test for this.
Syntaxes used were N3 (2011), Turtle (2014) and RDF/XML (2014). OWL was chosen partly because it was familiar to a number of participants and could be created in tooling available to them, and partly because there were pre-existing ontologies in this format which we were able either to re-use or adapt.
Diagrams were created in the VOM tool for each ontology to better understand the content, and these were laid out along similar lines to the available conceptual diagrams in the reference sources for this work. The aim was to create an integrating ontology which would import these and define the overall application ontology.
The ‘Travel Adverse Events’ ontology was developed in bottom-up fashion directly from the available data. This ontology is very extensive and covers multiple modes of transport and multiple causes of delays, accidents, bridge strikes1
Bridge strike is the industry term for when a truck or other item hits and damages a bridge. The affected section of track or roadway normally has to be closed until the damage to the bridge can be inspected and declared safe.
A second round of work involved layering the common risk concepts such as those for risk event consequences and impacts, and integrating the C4DM Events ODP with the FIBO ontology for Goals. This was then ingested into VOM and a set of diagrams created for the main concepts (Fig. 1).

The Ontology ‘Risk Core Concepts’.
We used the on-line sessions to compare thinking about what a model of the core concepts for risk would look like and converged on a common conceptual framework which was implemented in whole or in part in the travel events, common concepts and risk assessment ontologies, each of which contained refinements and extensions to that model. The common conceptual framework took the form of a set of high level semantic abstractions of which the concepts in the integrated ontologies could be framed as specialized kinds by way of subclass and subproperty relationships.
The extensions to the Trip ontology were segregated so that there were separate OWL files for instance data for the example application, and for common concepts for modes of transport. Most of those common concepts were already in the ‘Travel Adverse Events’ ontology, while some needed to be added. Those additional concepts were for types of rental car, types of aircraft body and other variables which were assumed to be related to the real-world risk of those travel modes. As a future exercise, once the ontology of these additional concepts is defined, it would form a checklist of sets of historical data to be mined. Thus, the top-down approach to risk factors met the bottom-up modeling of available risk statistics that was carried out during the hackathon.
Of particular interest in the integration, was the notion of “Event”. For the hackathon itself, we did not look at foundational ontologies that have high-level concepts for things that happen, but instead took as our starting point the simple C4DM event ontology design pattern and considered the semantics of events as they related to risk.
The hackathon ended up with two distinct notions of Event that could not be combined into one concept or represented as one class. One of these was the above ontology design pattern where an event has a time and a place (whether this is in the past, present or future), and the other was the kind of event that may or may not happen, and which (for the purposes of risk) is an event to be avoided. As can be seen in Fig. 1, these were related via two subclass relationships but this was not considered satisfactory. This was not addressed during the hackathon, and in particular the class shown as ‘RecurrentEvent’ in Fig. 1 would not be retained in any eventual solution.

Integrated Ontology Architecture.
It was also noted that in other contexts, such as finance, an event to be avoided by one party as a risk might represent an event to be welcomed by another as an opportunity. In either case, the event needed to be represented in the ontology even though it may not (or in the risk case, should not) actually happen. This provided some interesting insights into the way that different interpretations of a term like “event” might refer to different kinds of events in a domain of discourse.
The ontologies created for the hackathon are available in a publicly accessible GitHub repository as GitHub RiskHackathon (2014).
Status of the ontologies
At the end of the hackathon there was sufficient information to specify an overall, integrated ontology and design for a simple travel risk mobile application. In this application, the user would enter a desired time and destination and either enter different travel modes or have these calculated by existing applications which already do this; the application would return comparative risk figures for the different travel options.
The integrated ontology consists of several modules in the form of individual OWL files (sometimes also referred to as ontologies), with OWL import relationships between them as appropriate. One over-arching OWL file imports the others to define the overall graph that is the integrated ontology for the application, as shown in Fig. 2. This over-arching file is identified in Fig. 2 as “Integrating Ontology” and would need to be given a more appropriate permanent name.
In Fig. 2 the rounded gray boxes are individual OWL ontology files while the white boxes represent sets of assertions which are included in those ontology files for now but which would ideally be segregated into separate ontology files when the design is completed.
Ontological analysis of ‘Event’
There needed to be representations that would account for the two distinct interpretations of the notion of ‘Event’: something that actually happens with a time and a place, and something that can be described and thought of as an event whether or not it will happen. The latter is always either avoided or is in some possible future; once a risk related event has happened, it is no longer a risk. Therefore, any event characterized as having a time and a place is an event which has happened (or will happen) and having done so no longer presents a risk.
This presents challenges to some notions of how an ontology is intended to relate to its subject matter (e.g. ‘what exists’). One common approach to these challenges has been to think in terms of the risk event occurring in some “possible world”. In the case of risk this does not seem satisfactory, since one always has to model the world in which the event is avoided – this being also a possible world but not the possible work in which the event occurred. In any case this kind of work-around would not directly address the presence of two risk classes in the ontology – how would you show the distinction between the two classes called Event? These would have different identifiers but the user would need to know the intended semantics of each.
Since the application needs to make reference to both of these event concepts, it was necessary to approach this problem by sub-dividing the event-related part of the foundational model, to distinguish events that represent some risk, from other kinds of event in the domain of discourse, including the occurrence of a journey (a set of events) in which the risk-related events do not happen.
Another way of looking at this is that an ontology can be understood to provide intensional representations of events real, imagined or avoided, and that in the case of adverse events this intensional understanding of the concept is germane to avoiding or dealing with the occurrence of such an event in the actual world.
Property domains and ranges
It was also noted that many of the re-used ontologies had overly generalized domains and ranges, relying on the restrictions within the application to convey the intended semantics. In our view, this might be appropriate for a stand-alone Semantic Web application and may make some ontologies easier to re-use, but can be a barrier to integration of ontologies from multiple sources, as the intended semantics of the individual properties are not well defined.
Tooling
Another conclusion was that the tooling that we were able to find was not ideal for integrating ontologies. For example, VOM was well suited to ontology visualization, but only supports editing of one ontology at a time. There is a need for tooling that can be used to actively edit content in multiple ontology files, to move things around and complete and optimize the overall, integrated ontology.
Travel risk and financial risk
An interesting by-product of this work is that there are conceptual similarities between the semantics of the risks in multi-stage journeys, and the semantics of financial credit risk and cashflow payment streams. The concepts seen here for journeys and trajectories provide a more accessible way of thinking about these concepts for the financial industry. The similarities between journey trajectories and complex cashflow commitments seems to be amenable to the creation of a common ontology design pattern for the flow of cash based on the trajectories ontology. This would be complementary to existing mathematical models such as the well-known Black–Scholes model (Black and Scholes, 1973).
Future work
We plan to integrate the separate ontologies that were reused into a single integrated ontology, while retaining the separate topic-specific ontology files as modules. This involves specifying assertions for class equivalences and for mapping properties between modules. We plan to re-define the modular structure of the complete set of ontologies to more closely reflect the separate concerns that each ontology covers.
The need for two distinct notions of event points to the need for further ontological analysis of the concepts relating to events. To support this, it will also be necessary to partition the integrated ontology. For example, the original FIBO conceptual framework uses a Continuant versus Occurrent pair of partitions based on Sowa (2000) to distinguish things that happen from things that persist in their identities over time. The Event ontology design pattern we re-used did not have such partitioning. It is possible to further sub-divide occurrents, for example to distinguish between intended occurrents (like process definitions) and actual occurrents i.e. past present or predicted occurrences of a kind of event. Other possible distinctions include those between cumulative occurrents, where the occurrent builds towards some outcome, such as the accrual of interest, versus non-cumulative occurrents which do not. The ontological analysis of event-related concepts may also take in the notion of rigidity as described in the OntoClean methodology in Guarino and Welty (2002).
The conclusion we reached was that when integrating ontologies from a range of sources, the target, integrated ontology needs to have clear partitioning of this and other, similar theories. In other words, the integrated ontology needs to make reference to a well-formed “upper ontology”, where this term refers to the use of top-level categories or “partitions” (such as the Continuant and Occurrent ones mentioned above), along with an appropriate set of cross-domain high level concepts.
The challenge is that many source ontologies are not so partitioned. For this reason, when re-using ontologies, it may be necessary to re-frame the content of each one within an upper ontology (in the sense described above) with that upper ontology being part of the integrated model. Such re-framing would necessarily be in a separate namespace from the original ontology.
Additional risk factors in the Trip ontology (which covers rental car types, aircraft body types etc.) will be added in such a way that statistical data for these risks can be incorporated into applications. Types of travel event for which there are risk statistics (such as traffic jams) will be rolled up into broader events which are elements of the trajectory. For example, an event such as a traffic jam would be seen to result in a trajectory event such as a missed connection or the failure to complete one leg of a journey.
Participants
The following participated actively in the hackathon:
Anett Hoppe – detailed travel events analysis and ontology
Brand Niemann – initial brainstorming
Jeff Braswell – ontology integration discussions; insights on financial risk
Max Gillmore – Top level risk ontology
Mike Bennett – Coordination, integration
Mike Dean – Trajectory ontology and extensions
Mirko Morandini – ontology integration discussions
Tatyana Poletaeva – ontology integration discussions
