Pular para o conteúdo principal

Towards a Definition of Knowledge Graphs - Leitura de Artigo

Ehrlinger, Lisa and Wolfram Wöß. “Towards a Definition of Knowledge Graphs.” SEMANTiCS (2016).

ABSTRACT

The prerequisite for widespread academic and commercial adoption of a concept or technology is a common understanding, based ideally on a definition that is free from ambiguity. We tackle this issue by discussing and defining the term knowledge graph, considering its history and diversity in interpretations and use.
 
1. INTRODUCTION
 
Other definitions may lead to the assumption that knowledge graph is a synonym for any graph-based knowledge representation (cf. [12, 16]). We argue that such a definition is not enough for an adequate application of knowledge graphs, since it does not enforce a minimum set of requirements a KG has to fulfill. Thus, even a simple graph-based vocabulary could be published as knowledge graph.  
 
2. SELECTED DEFINITIONS
 
Knowledge graphs have been in the focus of research since 2012 resulting in a wide variety of published descriptions and definitions. 
 
A knowledge graph (i) mainly describes real world entities and their interrelations, organized in a graph, (ii) defines possible classes and relations of entities in a schema, (iii) allows for potentially interrelating arbitrary entities with each other and (iv) covers various topical domains.
Paulheim [16] ...
listed in his survey of knowledge graph refinement the minimum set of characteristics that must be present to distinguish knowledge graphs from other knowledge collections, which basically restricts the term to any graph-based knowledge representation.
 
** A estrutura em grafo, a forma como os dados são representados, seria o diferenciados entre KG e KB **
 
Knowledge graphs are large networks of entities, their semantic types, properties, and relationships between entities.
Journal of Web Semantics [12]
 
Knowledge graphs could be envisaged as a network of all kind things which are relevant to a specific domain or to an organization. They are not limited to abstract concepts and relations but can also contain instances of things like documents and datasets.
Semantic Web Company [3]
 
Vague descriptions of knowledge graphs were published in the announcement of a special issue on knowledge graphs by the Journal of Web Semantics and by the Semantic Web Company. Both definitions could equally well describe an ontology or – even more generally – any kind of semantic knowledge representation and do not even enforce a graph structure. In addition, size is highlighted as an essential characteristic, which is reflected by phrases such as “large networks” or “vast networks”
[11], while it remains unclear what “large” means in this context.
 
** Na Era Big Data, o quanto de volume é necessário para ser considerado Big ainda é nebuloso ** 
** Não deixam claro ser em grafo mas quando falam em ser em rede estão deixando isso implícito **
** Ontologias também tem instâncias então não seria isso que diferente a Ontologia de um KG ** 
 
We define a Knowledge Graph as an RDF graph. An RDF graph consists of a set of RDF triples where each RDF triple (s, p, o) is an ordered set of the following RDF terms: a subject s U B, a predicate p U , and an object o
U B L. An RDF term is either a URI u U , a blank node b B, or a literal l L.
Färber et al. [7] ...
defined a knowledge graph as an Resource Description Framework (RDF) graph and stated that the term KG was coined by Google to describe any graph-based knowledge base (KB) [7]. Although this definition is the only formal one, it contradicts with more general definitions as it explicitly requires the RDF data model.
 
** Restrito ao modelo RDF **
 
[...] systems exist, [...], which use a variety of techniques to extract new knowledge, in the form of facts, from the web. These facts are interrelated, and hence, recently this extracted knowledge has been referred to as a knowledge graph.
Pujara et al. [17] ...
did not provide a concise definition, but rather described the characteristics of knowledge graphs. Unlike the other definitions, which focus solely on the inner structure of the KG, they highlighted the importance of an automatic extraction system.
 
** A capacidade de inferir novo conhecimento, revelar conhecimento implícito, seria uma característica dos KGs, mas ontologias também podem inferir conhecimento usando uma máquina de regras **
 
3. KNOWLEDGE GRAPH APPLICATIONS
 
In the 1980s, researchers from the University of Groningen and the University of Twente in the Netherlands initially introduced the term knowledge graph to formally describe their knowledge-based system that integrates knowledge from different sources for representing natural language [10, 15]. The authors proposed KGs with a limited set of relations and focus on qualitative modeling including human interaction, which clearly contrasts with the idea of KGs that has been widely discussed in recent years.
 
** Primeiras ocorrências do KG que não consideram as mesmas características esperadas de um KG **
 
In 2012, Google introduced the Knowledge Graph as a semantic enhancement of Google’s search function that does not match strings, but enables searching for “things”, in other words, real-world objects [18]. .... Since 2012, the term knowledge graph is also used to describe a family of applications. Frequently mentioned implementations are DBPedia, YAGO (Yet Another Great Ontology), Freebase, Wikidata, Yahoo’s semantic search assistant tool Spark, Google’s Knowledge Vault, Microsoft’s Satori and Facebook’s entity graph [7, 14, 16, 11].  
 
** Estado da Arte em termos de aplicações que usam KGs mas seriam mesmo KGs? **
 
4. TERMINOLOGICAL ANALYSIS AND DEFINITION
 
the terms knowledge graph and knowledge base are <Have Been> used interchangeably ... leads to the misleading assumption that the term knowledge graph is a synonym for knowledge base, which is itself often used as synonym for ontology.
 
** KG = KB ~= Ontologia ** 
** Muitas aplicações que usam/usaram ontologia só exploraram o lado terminológico como hierarquia de classes/conceitos e dentro do mesmo domínio, não exploraram inferências com instâncias em "grande" volume **
 
According to Akerkar and Sajja [1], a knowledge-based system uses artificial intelligence to solve problems, and it consists of two parts: a knowledge base and an inference engine. The knowledge base is a dataset with formal semantics that can contain different kinds of knowledge, for example, rules, facts, axioms, definitions, statements, and primitives [4].
 
** KB não tem só dados, tem regras, definições, restrições, etc ... que permitem inferir mais conhecimento **
 
An ontology is as a formal, explicit specification of a shared conceptualization that is characterized by high semantic expressiveness required for increased complexity [9]. Ontological representations allow semantic modeling of knowledge, and are therefore commonly used as knowledge bases in artificial intelligence (AI) applications, for example, in the context of knowledge-based systems. Application of an ontology as knowledge base facilitates validation of semantic relationships and derivation of  conclusions from known facts for inference (i.e., reasoning) [9]. We explicitly emphasize that an ontology does not differ from a knowledge base, although ontologies are sometimes erroneously classified as being at the same level as database schemas [6]. In fact, an ontology consists not only of classes and properties (e.g., owl:ObjectProperty and owl:DatatypeProperty), but can also hold instances (i.e., the population of the ontology).
 
** Ontologia também pode ter instâncias e regras, apesar de pouco explorado **
 
Thus, the difference between a knowledge graph and an ontology could be interpreted either as a matter of quantity (e.g., a large ontology), or of extended requirements (e.g., a built-in reasoner that allows new knowledge to be derived). 
Focusing on existing automatically generated “knowledge graphs”, we can identify a further essential characteristic: collection, extraction, and integration of information from external sources extends a pure knowledge-based system with the concept of integration systems.
 
 
 
** Dois requisitos: volume (?), obtido através da integração de diversas fontes, e um reasoner embutido **
 
A knowledge graph acquires and integrates information into an ontology and applies a reasoner to derive new knowledge. 
 
This definition aligns with the assumption that a knowledge graph is somehow superior and more complex than a knowledge base (e.g., an ontology) because it applies a reasoning engine to generate new knowledge and integrates one or more information sources. Consequently, a manually created knowledge graph that does not support integration aspects is a plain knowledge base or knowledge-based system if it provides reasoning capabilities.
 
** Wikidata é criada manualmente mas também recebe carga de dados **
 
The definition does not take the quantity aspect (size) into account, especially with respect to a large ABox of the ontology, since it is not clear what can be considered “large”. 
 
Furthermore, the question arises what constitutes the difference between the Semantic Web and knowledge graphs. Smaller KGs, for example, enterprise knowledge graphs, can be clearly differentiated from the Semantic Web because of their restricted domain. ... the Semantic Web could be interpreted as the most comprehensive knowledge graph, or – conversely – a knowledge graph that crawls the entire web could be interpreted as self-contained Semantic Web.

 

Comentários

Postagens mais visitadas deste blog

Connected Papers: Uma abordagem alternativa para revisão da literatura

Durante um projeto de pesquisa podemos encontrar um artigo que nos identificamos em termos de problema de pesquisa e também de solução. Então surge a vontade de saber como essa área de pesquisa se desenvolveu até chegar a esse ponto ou quais desdobramentos ocorreram a partir dessa solução proposta para identificar o estado da arte nesse tema. Podemos seguir duas abordagens:  realizar uma revisão sistemática usando palavras chaves que melhor caracterizam o tema em bibliotecas digitais de referência para encontrar artigos relacionados ou realizar snowballing ancorado nesse artigo que identificamos previamente, explorando os artigos citados (backward) ou os artigos que o citam (forward)  Mas a ferramenta Connected Papers propõe uma abordagem alternativa para essa busca. O problema inicial é dado um artigo de interesse, precisamos encontrar outros artigos relacionados de "certa forma". Find different methods and approaches to the same subject Track down the state of the art rese...

Knowledge Graph Embedding with Triple Context - Leitura de Abstract

  Jun Shi, Huan Gao, Guilin Qi, and Zhangquan Zhou. 2017. Knowledge Graph Embedding with Triple Context. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (CIKM '17). Association for Computing Machinery, New York, NY, USA, 2299–2302. https://doi.org/10.1145/3132847.3133119 ABSTRACT Knowledge graph embedding, which aims to represent entities and relations in vector spaces, has shown outstanding performance on a few knowledge graph completion tasks. Most existing methods are based on the assumption that a knowledge graph is a set of separate triples, ignoring rich graph features, i.e., structural information in the graph. In this paper, we take advantages of structures in knowledge graphs, especially local structures around a triple, which we refer to as triple context. We then propose a Triple-Context-based knowledge Embedding model (TCE). For each triple, two kinds of structure information are considered as its context in the graph; one is the out...

KnOD 2021

Beyond Facts: Online Discourse and Knowledge Graphs A preface to the proceedings of the 1st International Workshop on Knowledge Graphs for Online Discourse Analysis (KnOD 2021, co-located with TheWebConf’21) https://ceur-ws.org/Vol-2877/preface.pdf https://knod2021.wordpress.com/   ABSTRACT Expressing opinions and interacting with others on the Web has led to the production of an abundance of online discourse data, such as claims and viewpoints on controversial topics, their sources and contexts . This data constitutes a valuable source of insights for studies into misinformation spread, bias reinforcement, echo chambers or political agenda setting. While knowledge graphs promise to provide the key to a Web of structured information, they are mainly focused on facts without keeping track of the diversity, connection or temporal evolution of online discourse data. As opposed to facts, claims are inherently more complex. Their interpretation strongly depends on the context and a vari...