Designing Graph Databases With GRAPHED

Designing Graph Databases With GRAPHED - Leitura de Artigo

Olhando Lattes do meu orientador encontrei um artigo sobre Modelagem Conceitual para Graph Databases de 2019 escrito em parceria com pesquisadores da UnB.

O artigo trata de uma proposta de notação para um diagrama para o modelo conceitual para dados conectados.

Journal of Database Management
Volume 30 • Issue 1 • January-March 2019

Designing Graph Databases With GRAPHED

Motivação: Although there have been advances in graph database technology, a notation to represent the conceptual graph model continues to present a challenge. There is no approach for data model widely accepted by the academic and business community in the graph databases. One advantage of data models is the ability to verify if some queries can be resolved with their structures, and this is important to validate the model requirements as well.

Definições:

multigraphs are graphs that can have several edges between the same pair of vertices.
hypergraphs are graphs that enable more than two vertices in an edge (called hyperedge).
nested graphs are graphs with nodes built from other graphs, there is, that allows groups of vertices that represent an aggregation level concept (hypernode).

Proposta: An independent implementation notation for conceptual graph data modeling named GRAPHED (Graph Description Diagram for Graph Databases) that covers the four types of graphical structures: simple, with attributes, hyperedge and nested (Angles, 2012) besides cardinality, weight and types.

Avaliação: Effectiveness and compatibility verification of GRAPHED in two case studies: fraud identification, and a biological network model.

Detalhamento da Solução:

Simple and Attributed Vertex

the optional Type information (long no exemplo de Person) is used to indicate the identifier’s domain.

Hypervertex (Hypernode) for Nested Graphs

Simple and Attributed Edges

The difference between solid and dashed arrows regards if the relationship was derived from other relationships. It works as a notation for views or inferred relationships such as inferring that people with the same mother are siblings (como definir a regra para a inferência?).
Other optional information that complements the edge is included as part of the label. This information includes the cardinality between the vertices for that relationship – represented by numbers in parenthesis – also, whether the type of relationship is a domain and not a constant string value, as well as the weight description indicating the value type, and the attributes of the relationship.
Edges can have attributes. Therefore, they can be written as in a vertex, appearing below the label with their names and types.

The other feature for edges was created to indicate not only the labels of the edge but also the domain for relationships if the edge is written with a type. In this case, the relationship is identified not only by a label but with a specific type, which specializes the edges for the specific relationship.

PARTNER_QUALIFICATION={SHAREHOLDER, DIRECTOR}

Hyperedges

It also covers hyperedges and hypernodes by respectively generalizing the simple edge and using a restructured concept from the hypernode. Hypervertices are the structures used to create nested graphs. If the definition is extended from a binary relation to a finite subset of V, allowing more than two elements in e, the edges will be able to link several vertices in the same relationship. Then, this new type of subset is called “hyperedge” and the graph that supports it is considered a “hypergraph”. Therefore, a hypergraph H(V, E) is a set of V with a set of relations E, where e = {(v1, v2, …, vn)} | e [pertence] E and {v1, v2, …, vn} [pertence] V. They join vertices and edges in subgraphs and work as nodes as well. Hypergraphs enable several instances to be grouped in a single hyperedge, including instances of the same entity.

Considerações adicionais em relação a outros modelos conceituais para grafos:

1) The ER model was not created with graph structures in mind, yet it adequately covers relationships, even whether there are several entities. However, since an ER model covers simple and attributed graphs using the entities to represent nodes and relations for the edges, at least one extension should be defined to include directions for directed graphs, which are the most common data structure in graph databases.

A regular ER model could be addressed to represent the hyperedges linking the entities together. Therefore, the ER cardinality notation describes the number of instances for a relationship with another instance of some entity. It does not specify how many instances might appear inside one instance of a relationship.

>> O modelo ER pode ser estendido para contemplar hipergrafos mas não permite uma especificação de cardinalidade diferenciada?

2) RDF describes both schema and instances together, and it has been kept as a relevant theme, considering semantic web researches (Wylot, Hauswirth, Cudré-Mauroux, and Sakr, 2018). Anything in RDF is modeled as a Resource, and the RDF graph is a set of triples, composed of subjects connected to objects of predicates. It fits for modeling graphs where even attributes are described as vertices linked by the predicates. However, it still does not have support graphs where vertices and relationships can have attributes inside them. In this way, a more complex graph needs some notation extensions.

>> Os atributos de um predicado podem ser especificados por reificação ou pelo uso do quad (g, s, p , o)

Referências interessantes a serem exploradas

Angles, R. (2012). A comparison of current graph database models. In 2012 IEEE 28th international conference on data engineering workshops (ICDEW) (pp. 171-177). IEEE. doi:10.1109/ICDEW.2012.31

Gil, D., & Song, I. Y. (2016). Modeling and Management of Big Data. Future Generation Computer Systems, 63(C), 96–99. doi:10.1016/j.future.2015.07.019

Kaur, K., & Rani, R. (2013). Modeling and querying data in NoSQL databases. In Proceedings of the IEEE International Conference on Big Data 2013 (pp. 1-7). IEEE. doi:10.1109/BigData.2013.6691765

Comentários

Veronica dos Santos30 de abril de 2021 às 16:21
Nenhuma das 3 referências consta no meu SLR e nem o journal of database management estava na lista de journals para seleção. A referência Angles, R. (2012) está na monografia de disciplina NOSQL Graph Databases
ResponderExcluir
Respostas
Veronica dos Santos1 de julho de 2021 às 17:22
O Journal of Database Management poderia ser outra opção para submissão do SLR sobre Modelagem de Dados em Grafo
ResponderExcluir
Respostas

Adicionar comentário

Postar um comentário

Sinta-se a vontade para comentar. Críticas construtivas são sempre bem vindas.

Pesquisa de Doutorado da Veronica

Pesquisar este blog

Designing Graph Databases With GRAPHED - Leitura de Artigo

Designing Graph Databases With GRAPHED

Marcadores

Comentários

Postar um comentário

Postagens mais visitadas deste blog

Connected Papers: Uma abordagem alternativa para revisão da literatura

Knowledge Graph Embedding with Triple Context - Leitura de Abstract

Exploratory Search: From Finding to Understanding - Leitura de Artigo