Skip to content

Conversation

@edufuga
Copy link

@edufuga edufuga commented Jan 30, 2026

Description

Adds a new documentation page that explains Apache Spark in general terms and documents how Spark is used within Corporate Memory’s BUILD component. It covers Spark’s main abstractions (RDDs, DataFrames), the rationale for using Spark in BUILD, what “Spark-aware datasets” are (and which types exist), and how Spark-backed workflow execution relates to the Knowledge Graph (Spark computes in BUILD; EXPLORE persists; the workflow execution engine mediates).

Context

This documentation complements the plugin documentations in and from BUILD itself.

@edufuga edufuga requested a review from rpietzsch January 30, 2026 15:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

2 participants