TigerGraph 101: An Intro to Graph - WEBINAR - 4 episódios

Vértices e Arestas ficam em memória em tempo de execução, o vértice é uma área de memória que contém o primary ID e as arestas são ponteiros para essas áreas. Atributos (chave e valor) de vértices e arestas ficam em disco. Assim se a operações percorrer um caminho no grafo são resolvidas em memória, sendo extremamente rápidas, se a operação envolver atributos então é necessário fazer acesso a disco, sendo mais lenta.

Se um determinado atributo de um vértice for usado como filtro esse deve ser modelado como um vértice e não como um atributo.

You CAN NOT have more than one of the same Edge type between the same two Vertices

Uma aresta de um tipo X só ocorre uma vez em um par de vértices. Ou seja, v -X> u e v -Y> u estão OK mas não pode haver outra aresta do tipo X ou Y mesmo que com atributos diferentes. O primary ID de aresta é formado por primary ID do primeiro vértice + tipo da aresta + primary ID do segundo vértice. Nesse caso não é possível modelar fatos que envolvem as mesmas entidades em contextos diferentes. Por exemplo: MBachelet -president> Chile

Existem funções para gerar UUID (gsql_uuid_v4) em caso de entidades sem primary ID natural para os vértices / nós.

3 - https://youtu.be/sJ5o_b_9G0s

Schema Modeling and Data Loading

Presentation: https://docs.google.com/presentation/d/1xHOoWGYtnhdzxlAKgIJB3CfR1LR0diTShoSKlyhXpB0/edit?usp=sharing

Notebook / repo: https://github.com/DanBarkus/Patent_Graph

A modelagem e carga de dados poder ser feita através da interface gráfica ou por linha de código com GSQL

Global Schema x Graph Schema: é possível aproveitar partes do global no local (grafo) e não é necessário aderir a todo o Global Schema. A modificação do grafo local é por job e a do global é por comando. Existe interdependência para realizar alterações no global que afetam o local.

Sugestão de convenção de nomes para tipo de vértices, tipo de arestas e nome de atributos.

Somente um atributo por vértice pode ser indexado para acelerar o look up. É possível definir valores default para atributos sem valor no momento da carga.

As arestas podem ser direcionadas ou não, com GSQL é possível percorrer o grafo em qualquer direção (se criar o reverse edge)

O schema deve ser publicado.

GSQL Pattern
CREATE VERTEX <VERTEX_TYPE_NAME>(PRIMARY_ID <PRIMARY_ID_NAME> <PRIMARY_ID_TYPE>, <ATTRIBUTE_NAME> <ATTRIBUTE_TYPE>, ...)

CREATE DIRECTED|UNDIRECTED EDGE <EDGE_TYPE_NAME>(FROM <SOURCE_VERTEX_TYPE>, TO <DESTINATION_VERTEX_TYPE>, <ATTRIBUTE_NAME> <ATTRIBUTE_TYPE>, ...)

Data Mapping para carregar os dados de arquivos para o grafo. O primeiro passo é carregar os arquivos.

É possível juntar campos dos arquivos de entrada para gerar um uniqueID, caso não exista. Cada arquivo deve ter um loading job com os mapeamentos para carregar, é possível carregar dois arquivos em paralelo mas não é o default da interface.

É possível deletar vértices e arestas assim como dropar todos o grafo.

VOLTAR PARA REVER O JNOTEBOOK COM O EXEMPLO

4 - https://youtu.be/dP7ammiYqQ0

Querying and Beyond

Presentation:

https://docs.google.com/presentation/d/1d-M_mP5jIAAp3fjtHqOblQk7bbygRBBoQZRtFijPQ7g/edit?usp=sharing

Installed vs Interpreted Queries

Installed .... armazenadas, podem receber parâmetros por variáveis

• Compiled for fastest possible runtime performance
• REST endpoint created for easy access
• Takes 1-2 minutes to compile and install

Interpreted ... executadas sob demanda, ad-hoc
• Immediately runnable
• Faster query writing iteration time
• Some limited functionality

The SELECT Statement: Utilizes graph patterns to traverse edges

Sempre opera sobre um conjunto de vértices (vértices únicos).

A Basic Query

Most basic SELECT that you can write
• Creates a vertex_set of all Inventor vertices to use as the source_vertex_set
• Selects all vertices from source_vertex_set
• Places all vertices in result_vertex_set
• Print result_vertex_set

Pattern Matching

Patterns define edge traversals made by the SELECT statement
Follow the format: vertex_type - (edge_type) - vertex_type
Pattern: source_vertex_set :alias_vertex_set - (edge_type:edge_type_alias ) - destination_vertex_type :destination_alias

definir o tipo de vértice de origem, o tipo de aresta e o tipo de vértice de destino

Example:
start:s - (filed_application:f) - Application:a

Exist within the FROM clause of the SELECT statement
Aliases and edge_type do not need to be included if not needed

FROM é o MATCH do Cypher

Pattern Matching Query: Will output all Application vertices connected to an Inventor vertex by any edge

WHERE Clause: Specifies conditions that must be met for a vertex or edge to be selected
Multiple checks can be strung together with commas between

WHERE can be used on any vertex or edge attribute

** Entender melhor se é possível fazer filtro e join com edge attirbute **

Global ACCUM Query

For each edge between an Inventor named Joseph and an Application, the @@applications SumAccum will increment by 1

CREATE QUERY sample() FOR GRAPH Patents

{ SumAccum<INT> @@applications; start = {Inventor.*};

result = SELECT s FROM start:s - () - Application:a

WHERE s.first == “Joseph” ACCUM @@applications += 1;

Totalizar os nós do tipo application que estão ligados a nós com o first(name) = Joseph

Com um único @ antes da variável de acumulador passa a ser local accumulator e funciona como um atributo para o vértice, funcionam somente dentro do bloco de "DML"

Cláusula HAVING pode ser aplicada no resultado do accumulator

GSQL tem LIMIT, OFFSET, ORDER BY,

Multi-hops (path query), repeated patterns (recursão)

Knowledge graphs: Introduction, history, and perspectives - Leitura de Artigo

Chaudhri, V. K., C. Baru, N. Chittar, X. L. Dong, M. Genesereth, J. Hendler, A. Kalyanpur, D. Lenat, J. Sequeda, D. Vrandečić, and K.Wang 2022. “ Knowledge graphs: Introduction, history, and perspectives. ” AI Magazine 43: 17–29. https://doi.org/10.1002/aaai.12033 Knowledge graphs (KGs) have emerged as a compelling abstraction for organizing the world’s structured knowledge and for integrating information extracted from multiple data sources. KNOWLEDGE GRAPH DEFINITION A KG is a directed labeled graph in which domain-specific meanings are associated with nodes and edges. [ Definição focado no COMO representar, diferente dos KBs ] There are multiple approaches for associating meanings with the nodes and edges. At the simplest level, the meanings could be stated as documentation strings expressed in a human understandable language such as English. At a computational level, the meanings can be expressed in a formal specification language such as first-order logic. An active area of curren...

Pesquisa de Doutorado da Veronica

Pesquisar este blog

TigerGraph 101: An Intro to Graph - WEBINAR - 4 episódios

Querying and Beyond

Installed vs Interpreted Queries

Installed .... armazenadas, podem receber parâmetros por variáveis

• Compiled for fastest possible runtime performance
• REST endpoint created for easy access
• Takes 1-2 minutes to compile and install

Interpreted ... executadas sob demanda, ad-hoc
• Immediately runnable
• Faster query writing iteration time
• Some limited functionality

Marcadores

Comentários

Postar um comentário

Postagens mais visitadas deste blog

Aprendizado de Máquina Relacional

Connected Papers: Uma abordagem alternativa para revisão da literatura

Knowledge graphs: Introduction, history, and perspectives - Leitura de Artigo

Pesquisa de Doutorado da Veronica

TigerGraph 101: An Intro to Graph - WEBINAR - 4 episódios

Querying and Beyond

Installed vs Interpreted Queries

Installed .... armazenadas, podem receber parâmetros por variáveis

• Compiled for fastest possible runtime performance• REST endpoint created for easy access• Takes 1-2 minutes to compile and install

Interpreted ... executadas sob demanda, ad-hoc• Immediately runnable• Faster query writing iteration time• Some limited functionality

Marcadores

Comentários

Postar um comentário

Postagens mais visitadas deste blog

Aprendizado de Máquina Relacional

Connected Papers: Uma abordagem alternativa para revisão da literatura

Knowledge graphs: Introduction, history, and perspectives - Leitura de Artigo

• Compiled for fastest possible runtime performance
• REST endpoint created for easy access
• Takes 1-2 minutes to compile and install

Interpreted ... executadas sob demanda, ad-hoc
• Immediately runnable
• Faster query writing iteration time
• Some limited functionality