Dataset

Synthetic Complete

A complete bipartite synthetic benchmark where every unit in the first partition is matched to every unit in the second partition.

Bipartite incidence pattern for Synthetic Complete

Graph Summary

Best-case connectivity; alternating projections converge almost immediately.

The primary benchmark graph is built from unique (id1, id2) pairs. The figure shows a binned sparsity pattern of the bipartite incidence block, with both partitions relabeled to contiguous integer identifiers. This block is the off-diagonal part of the corresponding graph Laplacian.

500,000rows
500,000unique edges
1,000id1 levels
500id2 levels
1components
1,500nodes

Variables

Columns in the clean CSV:

id1 id2 x1 x2 y

The v1 graph uses id1 and id2. No additional identifier-like columns are included in this clean v1 CSV.

Source Notes

Generated by the synthetic benchmark scripts included with this collection.

Synthetic benchmark designed for complete bipartite two-way fixed effects.

Creation Files

The synthetic data-generating files are included for researchers who want to inspect or adapt the construction.

Historical Benchmark

2017 SEC benchmark timings for this dataset, in seconds:

MethodCitationSeconds
MAP-Aitken (Guimaraes 2012) 2.3
MAP-SD (Gaure 2013) 2.5
MAP-CG-Sym (Correia 2016) 3.5
MAP+Prune (Correia 2016) 7.5
LSMR (Gomez 2016) 2.8