How do I choose between node-level, edge-level, and graph-level tasks?

Match the task level to what you want to predict. If you want a label for each entity (user, document, protein), use node-level. If you want to predict a relationship between two entities (will they interact? should we recommend this?), use edge-level. If you want to characterize an entire system as a whole (is this molecule toxic? what category does this circuit belong to?), use graph-level. The GNN architecture is nearly identical for all three — only the output layer and loss function change.

What is negative sampling and why is it necessary for link prediction?

Negative sampling is the practice of creating artificial 'non-edges' during training. In a real graph, you only observe connections that exist. If you only train on positive examples, the model has no signal for what a non-connection looks like and will predict every pair as connected. By randomly sampling pairs of nodes with no edge and labeling them as negatives, you give the model a contrastive signal: 'these two nodes should have a low similarity score.' The ratio of negatives to positives (often 1:1 or 1:5) is an important hyperparameter.

Can a single GNN model perform both node-level and graph-level tasks simultaneously?

Yes. The message passing layers are shared and produce node embeddings for both tasks. For node-level outputs, you apply a per-node classifier directly to those embeddings. For graph-level outputs, you apply a readout layer to aggregate them into a graph vector, then pass that through a separate classifier. This is called multi-task learning on graphs and is used in production systems where you might want to simultaneously predict user behavior (node-level) and community health (graph-level).

🚀 LEVEL UP TO SENIOR:Unlock 500+ Advanced Practical Challenges & Exercises.

🎓 COURSERA PARTNER:Earn professional Google, Meta, and IBM certificates to supercharge your resume.

Tutorials

HTML MASTER CLASS /// LEARN TAGS /// BUILD STRUCTURE /// SEMANTIC WEB /// HTML MASTER CLASS /// LEARN TAGS ///

⚡ Total XP: 0|💻 artificialintelligence XP: 0

Node, Edge, and Graph Tasks in AI & Artificial Intelligence

Explore the full taxonomy of graph learning tasks. From labeling individual nodes to predicting missing links and classifying entire molecular systems. Learn how to frame any relational problem as a GNN task, understand the readout mechanism for graph-level inference, and see how regression and classification both apply across all three levels.

LOADING ENGINE...

Skill Matrix

UNLOCK NODES BY LEARNING NEW TAGS.

Task Hub

Prediction levels.

Quick Quiz //

You need to flag individual bank accounts as fraudulent in a transaction network. Which task type is this?

Before writing a single line of GNN code, you need to answer one question: what are you predicting? GNNs operate at three distinct levels of granularity — the node, the edge, and the entire graph. Choosing the right level determines your model's output layer, your loss function, and how you evaluate success.

1Node-Level and Edge-Level Prediction

Node-level tasks are the most common starting point. After message passing, each node has a learned embedding that reflects its own features plus the context of its neighborhood. You pass this embedding through a linear classifier or MLP to get a label. A classic example is semi-supervised node classification on the Cora citation network, where the goal is to categorize academic papers (nodes) into research topics using only a small number of labeled examples and the graph's citation structure.

Edge-level tasks (Link Prediction) focus on the relationships between pairs of nodes. The model takes the embeddings of two nodes, combines them (via dot product or concatenation into an MLP), and outputs a probability score for whether an edge should exist. This is the core mechanism behind every 'People You May Know' feature and product recommendation engine. You train with Negative Sampling: for every real edge, you sample several node pairs that are not connected and teach the model to distinguish them. Without negative samples, the model would naively predict every pair as connected.

—

// Node Classification Head
function nodeClassifier(h_i) {
  // h_i = node embedding after MP layers
  return softmax(Linear(h_i));
  // → [P(bot), P(human), ...]
}

// Link Prediction: Dot-Product Decoder
function linkPredictor(h_u, h_v) {
  // Negative sampling: v is often random
  const score = dot(h_u, h_v);
  return sigmoid(score);
  // → P(edge exists between u and v)
}

localhost:3000

localhost:3000/gnn-outputs

Node Classifier

Node 42: BOT → 94% confidence ✓

Link Predictor

Edge (A,C): P = 0.87 → Recommend ✓

2Graph-Level Tasks and the Readout Layer

For Graph-level tasks, we need a single fixed-size vector that represents the entire graph, regardless of how many nodes it contains. This is the Readout or Global Pooling layer — the GNN's equivalent of the fully connected layer in a CNN classifier.

The simplest readout operations are Global Mean and Global Sum over all node embeddings. These are differentiable and cheap, but they discard structural information. If two graphs have the same node features but different topology, Global Mean cannot tell them apart. For tasks where structure matters — like classifying different types of chemical compounds — more powerful methods like Global Attention Pooling (which learns to weight important nodes) or Hierarchical Pooling (DiffPool, which progressively clusters nodes into super-nodes) are preferred. Both regression (predicting a molecule's boiling point) and classification (predicting toxicity) can be applied at the graph level using the same readout architecture.

—

// Global Readout for Graph Classification
function globalMeanPool(nodeEmbeds, dim) {
  const N = nodeEmbeds.length;
  const sum = nodeEmbeds.reduce(
    (s, h) => s.map((v, i) => v + h[i]),
    new Array(dim).fill(0)
  );
  return sum.map(v => v / N);
}

// Classify the entire graph:
const graphVec = globalMeanPool(embeddings, 64);
const toxicity = sigmoid(classifier(graphVec));
// → 'TOXIC: false'

localhost:3000

localhost:3000/graph-readout

Graph-Level Prediction

Molecule C8H11NO2: TOXIC → false ✓

23 atom embeddings → 1 graph vector via global mean pool → binary classifier