Q. Wang, Z. Mao, B. Wang and L. Guo, "Knowledge Graph Embedding: A Survey of Approaches and Applications," in IEEE Transactions on Knowledge and Data Engineering, vol. 29, no. 12, pp. 2724-2743, 1 Dec. 2017, doi: 10.1109/TKDE.2017.2754499.
- Techniques that conduct embedding using only facts observed in the KG
- Techniques that further incorporate additional information besides facts.
- How embeddings can be applied to and benefit a wide variety of tasks (in-KG applications and out-of-KG applications)
(3)
In-KG applications: link prediction, triple classification, entity classification, and entity resolution,
Link prediction is typically referred to as the task of predicting an entity that has a specific relation with
another given entity, i.e., predicting
With entity and relation representations learned beforehand, link prediction can be carried out simply by a ranking
procedure. Various evaluation metrics have been designed
based on such ranks, e.g., mean rank (the average of predicted ranks), mean reciprocal rank (the average of reciprocal
ranks), Hits@
Triple classification consists in verifying whether an unseen triple fact (h,r,t) is true or not. This task, again, can be seen as some sort of completion of the input KG. Triples with higher
scores tend to be true facts. Specifically, we introduce for each relation
Entity classification aims to categorize entities into different semantic categories, e.g.,
AlfredHitchcock is a Person, and Psycho a
CreativeWork. Given that in most cases the relation encoding entity types (denoted as IsA
) is contained in the KG and has already been included into the embedding process, entity classification
can be treated as a specific link prediction task, i.e.,
Entity resolution consists in verifying whether two entities refer to the same object. In some KGs many nodes
actually refer to identical objects. A scenario where the KG already
contains a relation stating whether two entities are equivalent (denoted as EqualTo) and an
embedding has been learned for that relation. In this case, entity resolution degenerates to a triple classification
problem, i.e., to judge whether the triple
proposed to perform entity resolution solely on the basis of entity
representations. More specifically, given two entities
Sobre a fórmula || x - y|| ... A norma ou módulo de um vetor é um número real que representa o comprimento desse vetor e a fração é a razão entre 2 e uma constante fornecida pelo usuário. Mas isso foi um teste que os autores de outro artigo fizeram e não uma definição / prova pelo trecho extraído abaixo:
Based on this representation we compute the similarity between two en-tities x,y by using the heat kernel k(x,y) =e−‖x−y‖2/δ,where δ is a user-given constant and use this similarity score as a measure for the likelihood that x and y refer to the same entity. This is a relative ad hoc approach to entity resolution, but the focus of this experiment is again rather on assessing the collective learning capabilities of our approach than conducting a full entity resolution experiment.
Out-of-KG applications are those which break through the boundary of the input KG and scale to broader domains. We introduce three such applications as examples, including relation extraction, question answering, and recommender systems.
Relation extraction aims to extract relational facts from plain text where entities have already been detected. For
example, given a sentence “Alfred Hitchcock directed Psycho” with the entities
Relation extraction by jointly embedding plain text and KGs. In their work, text and KGs are represented in the same matrix. Each row of the matrix stands for a pair of entities, and each column a textual mention or a KG relation. If two entities co-occur with a mention in plain text or with a relation in KGs, the corresponding entry is set to one, and otherwise to zero.
Question answering over KGs. Given a question expressed in natural language, the task is to retrieve the correct answer supported by a triple or set of triples from a KG. The use of KGs simplifies question answering by organizing a great variety of answers in a structured format. However, it remains a challenging task because of the great variability of natural language and of the large scale of KGs.
hybrid recommendation (systems) framework which leverages heterogeneous information in a KG to improve the quality of collaborative filtering.
Specifically, they used three types of information stored in the KG, including structural knowledge (triple facts),
textual knowledge (e.g., the textual summary of a book or a movie), and visual knowledge (e.g., a book's front
cover or a movie's poster image), to derive semantic representations for items
Comentários
Postar um comentário
Sinta-se a vontade para comentar. Críticas construtivas são sempre bem vindas.