Dataset

Enron Email

Email sender-recipient graph derived from the SNAP Enron email network.

Bipartite incidence pattern for Enron Email

Graph Summary

Two-way graph uses anonymized sender and receiver identifiers.

The primary benchmark graph is built from unique (id1, id2) pairs. The figure shows a binned sparsity pattern of the bipartite incidence block, with both partitions relabeled to contiguous integer identifiers. This block is the off-diagonal part of the corresponding graph Laplacian.

367,662rows
367,662unique edges
36,692id1 levels
36,692id2 levels
1,887components
73,384nodes

Variables

Columns in the clean CSV:

id1 id2 x1 x2 y

The v1 graph uses id1 and id2. No additional identifier-like columns are included in this clean v1 CSV.

Source Notes

https://snap.stanford.edu/data/email-Enron.html

SNAP Enron email network.

Historical Benchmark

2017 SEC benchmark timings for this dataset, in seconds:

MethodCitationSeconds
MAP-Aitken (Guimaraes 2012) 51.4
MAP-SD (Gaure 2013) 38.1
MAP-CG-Sym (Correia 2016) 29.7
MAP+Prune (Correia 2016) 31.5
LSMR (Gomez 2016) 51.0