Skip to content

Latest commit

 

History

History
66 lines (43 loc) · 4.25 KB

File metadata and controls

66 lines (43 loc) · 4.25 KB
graph LR
    Application_Interface_CLI["Application Interface/CLI"]
    User["User"]
    Data_Storage["Data Storage"]
    CTGAN_Model["CTGAN Model"]
    TVAE_Model["TVAE Model"]
    Application_Interface_CLI -- "interacts with" --> User
    Application_Interface_CLI -- "consumes" --> Data_Storage
    Application_Interface_CLI -- "produces" --> Data_Storage
    Application_Interface_CLI -- "orchestrates" --> CTGAN_Model
    Application_Interface_CLI -- "orchestrates" --> TVAE_Model
    User -- "interacts with" --> Application_Interface_CLI
    Data_Storage -- "provides" --> Application_Interface_CLI
    Data_Storage -- "receives" --> Application_Interface_CLI
    CTGAN_Model -- "trained by" --> Application_Interface_CLI
    CTGAN_Model -- "generates data for" --> Application_Interface_CLI
    TVAE_Model -- "trained by" --> Application_Interface_CLI
    TVAE_Model -- "generates data for" --> Application_Interface_CLI
    click Application_Interface_CLI href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/CTGAN/Application_Interface_CLI.md" "Details"
    click CTGAN_Model href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/CTGAN/CTGAN_Model.md" "Details"
    click TVAE_Model href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/CTGAN/TVAE_Model.md" "Details"
Loading

CodeBoardingDemoContact

Details

The CTGAN system's architecture is centered around its Application Interface/CLI, which acts as the primary interaction point for the User. This interface is responsible for managing the entire data synthesis workflow, from parsing user commands and loading Raw Input Data from Data Storage to orchestrating the training and generation processes of either the CTGAN Model or the TVAE Model. Once synthetic data is generated by the chosen model, the Application Interface/CLI ensures it is persistently stored back into Data Storage. This design clearly separates user interaction and data management from the core generative modeling capabilities.

Application Interface/CLI [Expand]

The primary user-facing component, responsible for parsing command-line arguments, loading raw input data, orchestrating the training and generation processes of the selected generative model (CTGAN or TVAE), and saving the resulting synthetic data. It acts as the central coordinator for the entire data synthesis workflow, utilizing libraries like argparse for argument parsing and pandas for data handling.

Related Classes/Methods:

User

The external actor who initiates and controls the data synthesis process through the command-line interface.

Related Classes/Methods: None

Data Storage

Represents the persistent storage mechanism (e.g., file system) for both the raw input datasets consumed by the application and the synthetic datasets produced. This component abstracts the underlying file system interactions for data input and output.

Related Classes/Methods: None

CTGAN Model [Expand]

The core generative adversarial network model responsible for learning the distribution of tabular data and generating high-quality synthetic samples. It is trained and utilized by the Application Interface/CLI to perform the primary data synthesis task.

Related Classes/Methods: None

TVAE Model [Expand]

An alternative generative model based on a Variational Autoencoder, designed for learning data distributions and generating synthetic data. Similar to CTGAN, it is trained and utilized by the Application Interface/CLI as an option for data synthesis.

Related Classes/Methods: None