graph LR
External_Data_Systems["External Data Systems"]
Pathway_I_O_Connectors["Pathway I/O Connectors"]
Pathway_Python_API_Core_Extensions_["Pathway Python API (Core & Extensions)"]
Pathway_Graph_Execution_Engine_Python_["Pathway Graph Execution Engine (Python)"]
Pathway_Rust_Execution_Engine["Pathway Rust Execution Engine"]
Pathway_Control_Observability["Pathway Control & Observability"]
External_Data_Systems -- "data flows to" --> Pathway_I_O_Connectors
Pathway_I_O_Connectors -- "data flows to" --> External_Data_Systems
Pathway_I_O_Connectors -- "provides ingested data streams to" --> Pathway_Python_API_Core_Extensions_
Pathway_Python_API_Core_Extensions_ -- "passes logical dataflow graph to" --> Pathway_Graph_Execution_Engine_Python_
Pathway_Graph_Execution_Engine_Python_ -- "sends compiled execution plans to" --> Pathway_Rust_Execution_Engine
Pathway_Rust_Execution_Engine -- "sends execution results/status to" --> Pathway_Graph_Execution_Engine_Python_
Pathway_Graph_Execution_Engine_Python_ -- "sends runtime metrics and logs to" --> Pathway_Control_Observability
Pathway_Python_API_Core_Extensions_ -- "sends debug requests and information to" --> Pathway_Control_Observability
Pathway_Control_Observability -- "issues control commands to" --> Pathway_Graph_Execution_Engine_Python_
click Pathway_I_O_Connectors href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/pathway/Pathway_I_O_Connectors.md" "Details"
click Pathway_Python_API_Core_Extensions_ href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/pathway/Pathway_Python_API_Core_Extensions_.md" "Details"
click Pathway_Graph_Execution_Engine_Python_ href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/pathway/Pathway_Graph_Execution_Engine_Python_.md" "Details"
click Pathway_Control_Observability href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/pathway/Pathway_Control_Observability.md" "Details"
Pathway is a high-performance data processing framework designed for real-time data pipelines. It comprises a user-friendly Python API for defining data transformations, which are then compiled and executed by an optimized Rust-based engine. Data ingress and egress are managed by dedicated I/O connectors, interacting with various external data systems. A Python-based graph execution engine orchestrates the interaction between the Python API and the Rust core, while a control and observability layer provides tools for managing and monitoring pipelines. This architecture ensures a clear separation of concerns, allowing for flexible pipeline definition in Python and efficient, incremental execution in Rust.
Represents all external data sources (e.g., Kafka, S3, Postgres, GDrive, Airbyte, HTTP, SharePoint) and sinks (e.g., Postgres, Delta Lake, HTTP). This is a conceptual component representing external systems that Pathway interacts with, and thus does not have direct source code within the Pathway project. Its interaction is facilitated by the Pathway I/O Connectors.
Related Classes/Methods: None
Pathway I/O Connectors [Expand]
Handles data ingestion from and egress to External Data Systems. Responsible for serialization/deserialization and protocol specifics.
Related Classes/Methods:
Pathway Python API (Core & Extensions) [Expand]
The user-facing layer for defining data pipelines. Includes core dataflow operations (transformations, joins, UDFs), schema management, and specialized extensions for LLM/AI processing and standard library functions. Also includes the SQL interface for query definition.
Related Classes/Methods:
pathway.internals.tablepathway.internals.joinspathway.internals.udfspathway.internals.expressionpathway.internals.schemapathway.internals.dtypepathway.xpacks.llmpathway.stdlibpathway.internals.sql
Pathway Graph Execution Engine (Python) [Expand]
Compiles the logical dataflow graph defined by the Pathway Python API into an executable plan and orchestrates its execution on the Pathway Rust Execution Engine. This component acts as the primary interface to the Rust engine.
Related Classes/Methods:
The high-performance, incremental computation engine written in Rust. It executes the compiled dataflow graph received from the Pathway Graph Execution Engine (Python). As this component is written in Rust, its source code is external to the Python project.
Related Classes/Methods: None
Pathway Control & Observability [Expand]
Provides command-line tools for project management and deployment, and monitoring/debugging capabilities for pipeline runtime.
Related Classes/Methods: