A Gentle Introduction to Graph Neural Networks (2021)

(distill.pub)

362 points misonic | 1 comments | 20 Dec 24 04:10 UTC | HN request time: 0.203s | source

Show context

cherryteastain ◴[20 Dec 24 10:07 UTC] No.42469706[source]▶

There are a lot of papers using GNNs for physics simulations (e.g. computational fluid dynamics) because the unstructured meshes used to discretize the problem domain for such applications map very neatly to a graph structure.

In practice, every such mesh/graph is used once to solve a particular problem. Hence it makes little sense to train a GNN for a specific graph. However, that's exactly what most papers did because no one found a way to make a GNN that can adjust well to a different mesh/graph and different simulation parameters. I wonder if there's a breakthrough waiting just around the corner to make such a generalization possible.

replies(2): >>42470241 #>>42470602 #

magicalhippo ◴[20 Dec 24 11:31 UTC] No.42470241[source]▶

>>42469706 #

Naive question:

Words in sentences kinda forms graphs, referencing other words or are leafs being referenced, both inside sentences and between sentences.

Given the success of the attention mechanism in modern LLMs, how well would they do if you trained a LLM to process an actual graph?

I guess you'd need some alternate tokenizer for optimal performance.

replies(4): >>42470363 #>>42470817 #>>42472814 #>>42474565 #

disattention ◴[20 Dec 24 13:14 UTC] No.42470817[source]▶

>>42470241 #

This is actually a good insight. It turns out that transformers are indeed a form of graph network, precisely because of the attention mechanism. Graph attention networks are actually a very popular GNN architecture. Generally, the issue with using an LLM style architecture for generic graphs is modeling the sparsity, but is possible by using the graph adjacency matrix to mask the attention matrix. There are a number of papers and articles which address this connection, and plenty of research into mechanisms for sparsifying attention in transformers.

There are also graph tokenizers for using more standard transformers on graphs for doing things like classification, generation, and community detection.

replies(1): >>42471502 #

1. algo_trader ◴[20 Dec 24 14:48 UTC] No.42471502[source]▶

>>42470817 #

Any canonical papers on GNN for code graphs?

↑