Skip to content

Latest commit

 

History

History
101 lines (55 loc) · 6.02 KB

File metadata and controls

101 lines (55 loc) · 6.02 KB
graph LR
    classy_vision_dataset_classy_dataset["classy_vision.dataset.classy_dataset"]
    classy_vision_dataset_transforms["classy_vision.dataset.transforms"]
    classy_vision_dataset_image_path_dataset["classy_vision.dataset.image_path_dataset"]
    classy_vision_dataset_classy_video_dataset["classy_vision.dataset.classy_video_dataset"]
    classy_vision_dataset_dataloader_async_gpu_wrapper["classy_vision.dataset.dataloader_async_gpu_wrapper"]
    classy_vision_dataset_transforms_util["classy_vision.dataset.transforms.util"]
    classy_vision_dataset_transforms_autoaugment["classy_vision.dataset.transforms.autoaugment"]
    classy_vision_tasks_classification_task["classy_vision.tasks.classification_task"]
    classy_vision_dataset_image_path_dataset -- "extends" --> classy_vision_dataset_classy_dataset
    classy_vision_dataset_classy_video_dataset -- "extends" --> classy_vision_dataset_classy_dataset
    classy_vision_dataset_transforms -- "orchestrates transformations for" --> classy_vision_dataset_classy_dataset
    classy_vision_dataset_dataloader_async_gpu_wrapper -- "wraps" --> classy_vision_dataset_classy_dataset
    classy_vision_tasks_classification_task -- "consumes" --> classy_vision_dataset_classy_dataset
    classy_vision_dataset_transforms -- "utilizes" --> classy_vision_dataset_transforms_util
    classy_vision_dataset_transforms -- "integrates" --> classy_vision_dataset_transforms_autoaugment
    classy_vision_tasks_classification_task -- "applies augmentations configured by" --> classy_vision_dataset_transforms
Loading

CodeBoardingDemoContact

Details

The Data Pipeline subsystem in ClassyVision is responsible for all aspects of data handling, from loading raw data to applying complex transformations and preparing data batches for model training. It is designed to be highly modular and extensible, aligning with the project's ML Toolkit/Framework nature.

classy_vision.dataset.classy_dataset

Defines the foundational interface for all datasets, ensuring consistent methods for data access, iteration, and batching. It serves as the abstract base for specific dataset implementations.

Related Classes/Methods:

classy_vision.dataset.transforms

Acts as a central factory and orchestrator for constructing and composing various data transformation pipelines. It aggregates and manages individual transformation utilities.

Related Classes/Methods:

classy_vision.dataset.image_path_dataset

Handles the loading and management of image data directly from file system paths, implementing the classy_dataset interface for common image classification tasks.

Related Classes/Methods:

classy_vision.dataset.classy_video_dataset

Manages the specific complexities of video data, including frame/clip sampling and worker initialization, extending the classy_dataset interface for video-based tasks.

Related Classes/Methods:

classy_vision.dataset.dataloader_async_gpu_wrapper

Enhances data loading performance by asynchronously prefetching data to the GPU, thereby reducing CPU-GPU transfer bottlenecks during the training process.

Related Classes/Methods:

classy_vision.dataset.transforms.util

Provides a library of common image transformations, including those optimized for standard benchmarks like ImageNet, serving as fundamental building blocks for transformation pipelines.

Related Classes/Methods:

classy_vision.dataset.transforms.autoaugment

Implements and applies automated data augmentation policies to enhance model robustness and generalization, integrated into transformation pipelines.

Related Classes/Methods:

classy_vision.tasks.classification_task

Orchestrates the entire data flow for classification tasks, including configuring datasets, building dataloaders, and managing the batching process for consumption by the model. This component represents the primary consumer and orchestrator of the data pipeline within a training task.

Related Classes/Methods: