Pular para o conteúdo principal

Graph Model Definitions - Hyper-relational

Artigo 1 - Improving Hyper-Relational Knowledge Graph Completion

Hyper-relational KGs (HKGs) go beyond conventional KGs by representing facts with more complex semantic information, e.g., using relation-entity pairs as the qualifiers of triplets. The combination of a triplet and its qualifiers together is called a statement.

In a hyper-relational KG GH , we denote the set of entities and relations as V and R respectively. The total number of entities is N and the number of relations is M. The edge connecting them, which we call a statement (or fact), is expressed in the domain V × R × V × P (R × V) where P denotes the power set. 

It’s usually written as (mh,mr,mt, Q) where (mh,mr,mt) is the main triplet of the statement containing head entity mh ∈ V, relation mr ∈ R and tail entity mt ∈ V respectively. Q is the set of qualifiers consisting n relation-entity pairs {(qri, qei)}n i=1 where qri ∈ R and qei ∈ V

(mh,mr,mt, {(qri, qei)})

Artigo 2 - Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction

Despite its broad adoption, the triple-based representation of a KG often oversimplifies the complex nature of the data stored in the KG, in particular for hyper-relational data (a.k.a. multi-fold [38] or n-ary [14] relational data), where each fact contains multiple relations and entities.

Such hyper-relational data is ubiquitous in KGs.

However, representing a KG using triplets only often oversimplifies the complex nature of the data stored in the KG, in particular for hyper-relational data, where each fact contains multiple relations and entities (see example above).

Hyper-relational fact: A hyper-relational fact contains a base triplet (h,r,t) and a set of associated key-value pairs (ki ,vi), i = 1, ...,n.

(h,r,t, {(ki, vi)})

Artigo 3 - Logic on MARS: Ontologies for Generalised Property Graphs

We give a formalisation of a generalised notion of Property Graphs, called multi-attributed relational structures (MARS),

There is no standard definition of what constitutes a KG, and the formats used in practice vary. The basis of KGs typically are directed graphs with labelled nodes and edges. What distinguishes them from plain graphs is their enriched structure that includes additional annotations to provide contextual information for every edge or node. Examples include provenance (source information) and temporal validity, but there can be many other types of annotations.

A popular data model for such KGs is the Property Graph model, used by the Neo4J graph database ... It allows sets of attribute–value pairs to be associated with the nodes and edges in a directed graph. Such graphs are also known as attributed graphs.

In fact, even the underlying data model of a multi-attributed graph lacks proper formalisation. Property Graph and Wikidata are highly implementation bound and have no formal specification.

We consider a finite set P of predicates, where each p ∈ P has an associated arity ar(p) ≥ 0. If not otherwise stated, we assume this signature to be fixed and refrain from mentioning it. For the following definition, let Pfin (S) be the set of all finite subsets of set S.

Definition 1. A multi-attributed relational structure (MARS) M consists of a non-empty set ∆M of domain elements and, for each n-ary predicate p ∈ P, an (n + 1)-ary relation pM ⊆ (∆M) n × Pfin (∆M × ∆M).

In other words, a MARS behaves like a relational structure (i.e., hypergraph) over a domain ∆M, where each relation tuple (i.e., hyperedge) is annotated with a finite binary relation over ∆M. We view this relation as a set of attribute–value pairs. There might be multiple values for each attribute, justifying
our terminology. Also note that the same relational tuple may occur with different attribute–value collections within a single MARS. Thus MARS generalise Property Graphs, where attributes are functional and relations are unary and binary. The unary relations can be used to assign attribute–value collections to nodes.

Artigo 4 - Wikidata on MARS⋆

As noted in [9], Wikidatas custom data model supports attributed statements (with the attributes referred to as qualifiers), and allows attributes with multiple values.  

“In spite of the huge practical significance of these data models ..., there is practically no support for using such data in knowledge representation.”

Wikidata’s custom data model goes beyond the Property Graph data model, which associates sets of attribute-value pairs with the nodes and edges of a directed graph, by allowing for attributes with multiple values.

As Wikidata, like RDF [2], has a single domain for everything, including predicates, we will be extending MARS in this direction, but in a way that does not permit some of the strange situations possible in RDF.

Definition 1. A datatype theory, D, consists of a finite set of named datatypes, D, each of which has a finite or infinite set of data values; a finite set of named and typed datatype relations, R, over D; and a finite set of named and typed datatype functions, F, over D. The relations are closed under negation.
So datatype theory for the rationals and integers would have as data values all the rational numbers (with the integers as a subset). The datatype functions and relations could include the comparison relations (both within each datatype and between the two datatypes) and arithmetic functions.

Definition 2. An extended MARS (eMARS), M, is a MARS extended with a datatype theory, D. All datatypes, datatype relations, and datatype functions of D as well as all predicates are distinct elements of the domain of M, δM. All data values in D are also elements of δM. Each datatype is a unary predicate of M which is true on the data values of the datatype.

The domain elements for datatypes, datatype relations, datatype functions, and predicates are all distinct, thus eliminating several unusual situations that can occur in RDF and can be forced in extensions of RDF.

Objects in Wikidata are items, which include predicates (properties). Facts in Wikidata are statements, consisting of a subject (an item) and a main snak. Snaks are predicate-object pairs, or some-value snaks, or no-value snaks. Statements also have associated qualifiers, which are also snaks. Statements have a rank, which is regular, preferred, or deprecated. Wikidata also provides optional typing information for the values of properties. We also use a characterization for each property used in a qualifier in Wikidata; a set of ontological rules for Wikidata; and a set of constraints.

Apresentação sobre o MARS -> https://web.stanford.edu/class/cs520/abstracts/pfps.pdf

Artigo 5 - MillenniumDB: A Persistent, Open-Source, Graph Database - Leitura de Artigo

... it (domain graph model) generalizes existing graph data models such as RDF and property graphs. We also show its utility in concisely modeling real-world knowledge graphs that contain higher-arity relations, such as Wikidata [43]. 

Formally, assume a universe Obj of objects (ids, strings, numbers, IRIs, etc.). We define domain graphs as follows: 

Definition 2.1. A domain graph 𝐺 =(𝑂,𝛾) consists of a finite set of objects 𝑂 ⊆ Obj and a partial mapping 𝛾 : 𝑂 →𝑂 ×𝑂 ×𝑂. 

Intuitively, 𝑂 is the set of objects that appear in our graph database, and 𝛾 models edges between objects. If 𝛾(𝑒) =(𝑛1,𝑡,𝑛2), this states that the edge (𝑛1,𝑡,𝑛2) has id 𝑒, type 𝑡, and links the source node 𝑛1 to the target node 𝑛2 

We can analogously define our model as a relation ... where eid (edge id) is a primary key of the relation 

DomainGraph(source,type,target,eid

Domain graphs can directly capture property graphs, ... However, given a legacy property graph, there are some potential “incompatibilities” with the resulting domain graph


 

  

Comentários

Postagens mais visitadas deste blog

Connected Papers: Uma abordagem alternativa para revisão da literatura

Durante um projeto de pesquisa podemos encontrar um artigo que nos identificamos em termos de problema de pesquisa e também de solução. Então surge a vontade de saber como essa área de pesquisa se desenvolveu até chegar a esse ponto ou quais desdobramentos ocorreram a partir dessa solução proposta para identificar o estado da arte nesse tema. Podemos seguir duas abordagens:  realizar uma revisão sistemática usando palavras chaves que melhor caracterizam o tema em bibliotecas digitais de referência para encontrar artigos relacionados ou realizar snowballing ancorado nesse artigo que identificamos previamente, explorando os artigos citados (backward) ou os artigos que o citam (forward)  Mas a ferramenta Connected Papers propõe uma abordagem alternativa para essa busca. O problema inicial é dado um artigo de interesse, precisamos encontrar outros artigos relacionados de "certa forma". Find different methods and approaches to the same subject Track down the state of the art rese...

Knowledge Graph Embedding with Triple Context - Leitura de Abstract

  Jun Shi, Huan Gao, Guilin Qi, and Zhangquan Zhou. 2017. Knowledge Graph Embedding with Triple Context. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (CIKM '17). Association for Computing Machinery, New York, NY, USA, 2299–2302. https://doi.org/10.1145/3132847.3133119 ABSTRACT Knowledge graph embedding, which aims to represent entities and relations in vector spaces, has shown outstanding performance on a few knowledge graph completion tasks. Most existing methods are based on the assumption that a knowledge graph is a set of separate triples, ignoring rich graph features, i.e., structural information in the graph. In this paper, we take advantages of structures in knowledge graphs, especially local structures around a triple, which we refer to as triple context. We then propose a Triple-Context-based knowledge Embedding model (TCE). For each triple, two kinds of structure information are considered as its context in the graph; one is the out...

KnOD 2021

Beyond Facts: Online Discourse and Knowledge Graphs A preface to the proceedings of the 1st International Workshop on Knowledge Graphs for Online Discourse Analysis (KnOD 2021, co-located with TheWebConf’21) https://ceur-ws.org/Vol-2877/preface.pdf https://knod2021.wordpress.com/   ABSTRACT Expressing opinions and interacting with others on the Web has led to the production of an abundance of online discourse data, such as claims and viewpoints on controversial topics, their sources and contexts . This data constitutes a valuable source of insights for studies into misinformation spread, bias reinforcement, echo chambers or political agenda setting. While knowledge graphs promise to provide the key to a Web of structured information, they are mainly focused on facts without keeping track of the diversity, connection or temporal evolution of online discourse data. As opposed to facts, claims are inherently more complex. Their interpretation strongly depends on the context and a vari...