Graph Model Definitions - Hyper-relational

Artigo 1 - Improving Hyper-Relational Knowledge Graph Completion

Hyper-relational KGs (HKGs) go beyond conventional KGs by representing facts with more complex semantic information, e.g., using relation-entity pairs as the qualifiers of triplets. The combination of a triplet and its qualifiers together is called a statement.

In a hyper-relational KG GH , we denote the set of entities and relations as V and R respectively. The total number of entities is N and the number of relations is M. The edge connecting them, which we call a statement (or fact), is expressed in the domain V × R × V × P (R × V) where P denotes the power set.

It’s usually written as (mh,mr,mt, Q) where (mh,mr,mt) is the main triplet of the statement containing head entity mh ∈ V, relation mr ∈ R and tail entity mt ∈ V respectively. Q is the set of qualifiers consisting n relation-entity pairs {(qri, qei)}n i=1 where qri ∈ R and qei ∈ V.

(mh,mr,mt, {(qri, qei)})

Artigo 2 - Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction

Despite its broad adoption, the triple-based representation of a KG often oversimplifies the complex nature of the data stored in the KG, in particular for hyper-relational data (a.k.a. multi-fold [38] or n-ary [14] relational data), where each fact contains multiple relations and entities.

Such hyper-relational data is ubiquitous in KGs.

However, representing a KG using triplets only often oversimplifies the complex nature of the data stored in the KG, in particular for hyper-relational data, where each fact contains multiple relations and entities (see example above).

Hyper-relational fact: A hyper-relational fact contains a base triplet (h,r,t) and a set of associated key-value pairs (ki ,vi), i = 1, ...,n.

(h,r,t, {(ki, vi)})

Artigo 3 - Logic on MARS: Ontologies for Generalised Property Graphs

We give a formalisation of a generalised notion of Property Graphs, called multi-attributed relational structures (MARS),

There is no standard definition of what constitutes a KG, and the formats used in practice vary. The basis of KGs typically are directed graphs with labelled nodes and edges. What distinguishes them from plain graphs is their enriched structure that includes additional annotations to provide contextual information for every edge or node. Examples include provenance (source information) and temporal validity, but there can be many other types of annotations.

A popular data model for such KGs is the Property Graph model, used by the Neo4J graph database ... It allows sets of attribute–value pairs to be associated with the nodes and edges in a directed graph. Such graphs are also known as attributed graphs.

In fact, even the underlying data model of a multi-attributed graph lacks proper formalisation. Property Graph and Wikidata are highly implementation bound and have no formal specification.

We consider a finite set P of predicates, where each p ∈ P has an associated arity ar(p) ≥ 0. If not otherwise stated, we assume this signature to be fixed and refrain from mentioning it. For the following definition, let Pfin (S) be the set of all finite subsets of set S.

Definition 1. A multi-attributed relational structure (MARS) M consists of a non-empty set ∆M of domain elements and, for each n-ary predicate p ∈ P, an (n + 1)-ary relation pM ⊆ (∆M) n × Pfin (∆M × ∆M).

In other words, a MARS behaves like a relational structure (i.e., hypergraph) over a domain ∆M, where each relation tuple (i.e., hyperedge) is annotated with a finite binary relation over ∆M. We view this relation as a set of attribute–value pairs. There might be multiple values for each attribute, justifying
our terminology. Also note that the same relational tuple may occur with different attribute–value collections within a single MARS. Thus MARS generalise Property Graphs, where attributes are functional and relations are unary and binary. The unary relations can be used to assign attribute–value collections to nodes.

Artigo 4 - Wikidata on MARS⋆

As noted in [9], Wikidatas custom data model supports attributed statements (with the attributes referred to as qualifiers), and allows attributes with multiple values.

“In spite of the huge practical significance of these data models ..., there is practically no support for using such data in knowledge representation.”

Wikidata’s custom data model goes beyond the Property Graph data model, which associates sets of attribute-value pairs with the nodes and edges of a directed graph, by allowing for attributes with multiple values.

As Wikidata, like RDF [2], has a single domain for everything, including predicates, we will be extending MARS in this direction, but in a way that does not permit some of the strange situations possible in RDF.

Definition 1. A datatype theory, D, consists of a finite set of named datatypes, D, each of which has a finite or infinite set of data values; a finite set of named and typed datatype relations, R, over D; and a finite set of named and typed datatype functions, F, over D. The relations are closed under negation.
So datatype theory for the rationals and integers would have as data values all the rational numbers (with the integers as a subset). The datatype functions and relations could include the comparison relations (both within each datatype and between the two datatypes) and arithmetic functions.

Definition 2. An extended MARS (eMARS), M, is a MARS extended with a datatype theory, D. All datatypes, datatype relations, and datatype functions of D as well as all predicates are distinct elements of the domain of M, δM. All data values in D are also elements of δM. Each datatype is a unary predicate of M which is true on the data values of the datatype.

The domain elements for datatypes, datatype relations, datatype functions, and predicates are all distinct, thus eliminating several unusual situations that can occur in RDF and can be forced in extensions of RDF.

Objects in Wikidata are items, which include predicates (properties). Facts in Wikidata are statements, consisting of a subject (an item) and a main snak. Snaks are predicate-object pairs, or some-value snaks, or no-value snaks. Statements also have associated qualifiers, which are also snaks. Statements have a rank, which is regular, preferred, or deprecated. Wikidata also provides optional typing information for the values of properties. We also use a characterization for each property used in a qualifier in Wikidata; a set of ontological rules for Wikidata; and a set of constraints.

Apresentação sobre o MARS -> https://web.stanford.edu/class/cs520/abstracts/pfps.pdf

Artigo 5 - MillenniumDB: A Persistent, Open-Source, Graph Database - Leitura de Artigo

... it (domain graph model) generalizes existing graph data models such as RDF and property graphs. We also show its utility in concisely modeling real-world knowledge graphs that contain higher-arity relations, such as Wikidata [43].

Formally, assume a universe Obj of objects (ids, strings, numbers, IRIs, etc.). We define domain graphs as follows:

Definition 2.1. A domain graph 𝐺 =(𝑂,𝛾) consists of a finite set of objects 𝑂 ⊆ Obj and a partial mapping 𝛾 : 𝑂 →𝑂 ×𝑂 ×𝑂.

Intuitively, 𝑂 is the set of objects that appear in our graph database, and 𝛾 models edges between objects. If 𝛾(𝑒) =(𝑛1,𝑡,𝑛2), this states that the edge (𝑛1,𝑡,𝑛2) has id 𝑒, type 𝑡, and links the source node 𝑛1 to the target node 𝑛2

We can analogously define our model as a relation ... where eid (edge id) is a primary key of the relation

DomainGraph(source,type,target,eid)

Domain graphs can directly capture property graphs, ... However, given a legacy property graph, there are some potential “incompatibilities” with the resulting domain graph

Connected Papers: Uma abordagem alternativa para revisão da literatura

Durante um projeto de pesquisa podemos encontrar um artigo que nos identificamos em termos de problema de pesquisa e também de solução. Então surge a vontade de saber como essa área de pesquisa se desenvolveu até chegar a esse ponto ou quais desdobramentos ocorreram a partir dessa solução proposta para identificar o estado da arte nesse tema. Podemos seguir duas abordagens: realizar uma revisão sistemática usando palavras chaves que melhor caracterizam o tema em bibliotecas digitais de referência para encontrar artigos relacionados ou realizar snowballing ancorado nesse artigo que identificamos previamente, explorando os artigos citados (backward) ou os artigos que o citam (forward) Mas a ferramenta Connected Papers propõe uma abordagem alternativa para essa busca. O problema inicial é dado um artigo de interesse, precisamos encontrar outros artigos relacionados de "certa forma". Find different methods and approaches to the same subject Track down the state of the art rese...

Pesquisa de Doutorado da Veronica

Pesquisar este blog

Graph Model Definitions - Hyper-relational

Marcadores

Comentários

Postar um comentário

Postagens mais visitadas deste blog

Connected Papers: Uma abordagem alternativa para revisão da literatura

Knowledge Graph Embedding with Triple Context - Leitura de Abstract

KnOD 2021