graph LR
Metadata_Catalog_Service["Metadata Catalog Service"]
PDC_Public_API["PDC Public API"]
PDC_Core_Data_Manager["PDC Core Data Manager"]
PDC_Storage_Backend["PDC Storage Backend"]
PDC_Metadata_Storage["PDC Metadata Storage"]
PDC_Common_Utilities["PDC Common Utilities"]
PDC_Tools["PDC Tools"]
PDC_Public_API -- "sends requests to" --> Metadata_Catalog_Service
PDC_Core_Data_Manager -- "sends notifications to" --> Metadata_Catalog_Service
PDC_Public_API -- "sends requests to" --> PDC_Core_Data_Manager
PDC_Core_Data_Manager -- "interacts with" --> PDC_Storage_Backend
PDC_Metadata_Storage -- "provides services to" --> Metadata_Catalog_Service
PDC_Public_API -- "uses" --> PDC_Common_Utilities
Metadata_Catalog_Service -- "uses" --> PDC_Common_Utilities
PDC_Tools -- "interacts with" --> PDC_Public_API
click Metadata_Catalog_Service href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/pdc/Metadata_Catalog_Service.md" "Details"
The Metadata Catalog Service is a pivotal component within the PDC system, serving as the central hub for all metadata operations. It is designed to provide robust capabilities for data discovery, provenance tracking, and efficient data access by managing the lifecycle of metadata associated with scientific data objects. This service is integral to the system's ability to organize, search, and understand the vast amounts of data generated and managed in an HPC environment.
Metadata Catalog Service [Expand]
A dedicated service for storing, indexing, and querying all metadata associated with the scientific data objects managed by PDC. This component is critical for data discovery, provenance tracking, and enabling efficient data access. It acts as the authoritative source for metadata within the system.
Related Classes/Methods:
pdc_metadata_core.cpppdc_metadata_api.hpdc_metadata_logic.py
The primary external interface for users and other system components to interact with the PDC system. It exposes functionalities for both data and metadata operations, ensuring secure and controlled access.
Related Classes/Methods:
pdc_api_gateway.cpppdc_api_bindings.py
Responsible for the lifecycle management of the actual scientific data objects, including their storage, retrieval, and integrity. It orchestrates data movement and ensures consistency between data and its associated metadata.
Related Classes/Methods:
pdc_data_manager.cpppdc_data_orchestrator.py
The underlying persistent storage system for scientific data. This component provides high-performance, scalable, and fault-tolerant storage capabilities essential for HPC environments.
Related Classes/Methods:
pdc_storage_interface.hpdc_storage_driver.cpp
A dedicated persistence layer specifically optimized for storing, indexing, and querying metadata. This component ensures the high availability and performance of metadata operations for the Metadata Catalog Service.
Related Classes/Methods:
pdc_metadata_db.cpppdc_metadata_index.h
A collection of shared libraries and helper functions that provide common services such as logging, configuration management, error handling, and data serialization/deserialization across all PDC components.
Related Classes/Methods:
pdc_utils.hpdc_common_lib.cpppdc_config.py
A suite of command-line utilities and scripts designed for system administration, data ingestion, monitoring, and diagnostic purposes. These tools facilitate the management and operation of the PDC system.
Related Classes/Methods:
pdc_admin_cli.pypdc_ingest_tool.py