algorithmic skeleton (clone / checkout / cache / group-by-(repo, base_commit) / parallel workers) mirrors the Python script. Usage: python scripts/detect_repo_specs_cpp.py --input data.jsonl --output ...