I have a simple PySpark structured streaming app that transforms incoming messages into a graph (using GraphFrames). A simplified example of the code is given below ...
This workspace benchmarks PageRank memory usage on the LiveJournal graph stored as CSR arrays in livejournal-csr.duckdb. GraphFrames also needs Java and Spark. The ...