Skip to content

rel-stack: user badges (Classification)

This docs example now reflects the current runner path: the local script materializes the official rel-stack tables from the relbench library and then runs the GraphReduce badge pipeline. The old S3 CSV download and separate Tags.csv dependency are no longer part of the example.

from pathlib import Path
from relbench_dataset_utils import materialize_relbench_dataset

materialized = materialize_relbench_dataset(
    "rel-stack",
    Path("tests/data/relbench/rel-stack"),
    {
        "users": "Users.csv",
        "posts": "Posts.csv",
        "badges": "Badges.csv",
        "postHistory": "PostHistory.csv",
        "postLinks": "PostLinks.csv",
        "votes": "Votes.csv",
        "comments": "Comments.csv",
    },
)
print(materialized)

Current implementation:

Interactive Runner