FrameBase is a linked open knowledge base meant to uniformly represent a wide range of knowledge, tackling semantic heterogeneity among various sources of structured knowledge, such as the ones in the Linked Open Data cloud. It provides a flexible and uniform way of capturing n-ary relationships by adapting and combining repositories of frames from the fields of linguistics and cognitive science (FrameNet and WordNet) to establish a large and wide-coverage vocabulary that can be used to represent complex knowledge and extended with more specific elements. In other words: if you can express it with language, you can express it with FrameBase —barring some very specific concepts that need to be coined or imported from domain-specific KBs.
The basic motivation underlying FrameBase is: if you can express it
There are two interconnected representation levels in FrameBase:
- A highly expressive layer where information is represented with explicit entities instantiating the frames, and representing specific situations, processes or events of any kind, organized into a rich hierarchy.
- A less expressive but simpler layer based on direct binary predicates between the elements (participants, properties) of the frames. This level is more compact to store and query, and it is connected with the other layer by means Reification-Dereification (ReDer) rules. It can also be used to connect to similar predicates in other sources of structured knowledge or natural language.
FrameBase connects to other knowledge bases by means of integration
rules that can link data in ways that cannot be implemented
with existing binary properties like
Furthermore, because of FrameNet's ties to linguistic semantics, it offers additional possibilities for interfacing with natural language, both for querying and text mining.
Currently, FrameBase integrates knowledge from different large-scale LOD knowledge bases such as YAGO2s, Freebase, and events from DBpedia and Schema.org, so they can be queried under our single schema.
FrameBase thus represents a significantly novel way of connecting the Linked Data world to natural language that is highly expressive, easily expandable, and allows us to draw on natural language processing techniques.
Currently, FrameBase is being used by the PIKES project.
Select different English sentences to see how their information can be represented in FrameBase:
Comparison of existing representation models
The knowledge bases in the LOD cloud use different models to represent
n-ary relations, which leads to inconsistency and impossiblity to link
knowledge by binary predicates such as
. The FrameBase model subsumes them, providing less overall overhead
and a flexible two-layer model that combines the benefits of each of
Additional funding was provided by National Basic Research Program of China Grants 2011CBA00300, 2011CBA00301, and NSFC Grants 61033001, 61361136003, 61450110088.
The FrameBase logo and icon are licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License by the FrameBase team.