graph LR
Base_Model["Base Model"]
Neural_Augmentation_Manager["Neural Augmentation Manager"]
Classification_Models["Classification Models"]
Detection_Models["Detection Models"]
CLIP_Model["CLIP Model"]
Base_Image_Encoder["Base Image Encoder"]
Basic_Augmentor["Basic Augmentor"]
Distribution_Augmentor["Distribution Augmentor"]
Classification_Models -- "inherits from" --> Base_Model
Detection_Models -- "inherits from" --> Base_Model
CLIP_Model -- "inherits from" --> Base_Model
Base_Image_Encoder -- "inherits from" --> Base_Model
Neural_Augmentation_Manager -- "instantiates" --> Basic_Augmentor
Neural_Augmentation_Manager -- "instantiates" --> Distribution_Augmentor
Neural_Augmentation_Manager -- "provides augmented data to" --> Classification_Models
Neural_Augmentation_Manager -- "provides augmented data to" --> Detection_Models
Neural_Augmentation_Manager -- "provides augmented data to" --> CLIP_Model
The cvnets model subsystem provides a structured framework for developing and deploying computer vision models. At its core, the Base Model establishes a common contract for all models, promoting reusability and maintainability. Specialized model categories like Classification Models and Detection Models, along with specific implementations such as the CLIP Model and Base Image Encoder, extend this base functionality to address diverse vision tasks. A dedicated Neural Augmentation Manager orchestrates data pre-processing, applying various augmentation strategies (e.g., Basic Augmentor, Distribution Augmentor) to enhance the training data. This modular design allows for easy integration of new models and augmentation techniques, forming a robust pipeline for computer vision applications.
Serves as the abstract base class for all computer vision models. It defines common interfaces, properties (e.g., model information), and functionalities (e.g., dummy input/label generation), establishing the architectural contract for all derived models.
Related Classes/Methods:
Manages and applies neural augmentation techniques to input data. It acts as a factory for creating different types of augmentors and orchestrates their application to data before it enters the model.
Related Classes/Methods:
Represents a category of specific computer vision models designed for classification tasks, inheriting from the Base Model. This category includes various concrete implementations found within its directory.
Related Classes/Methods:
Represents a category of specific computer vision models designed for object detection tasks, inheriting from the Base Model. This category includes various concrete implementations found within its directory.
Related Classes/Methods:
Represents the concrete CLIP model implementation, inheriting from the Base Model.
Related Classes/Methods:
Represents a foundational and concrete component for image encoding, likely used as a sub-component within more complex models, inheriting from the Base Model.
Related Classes/Methods:
A concrete implementation of a neural augmentor, providing a basic augmentation strategy.
Related Classes/Methods:
A concrete implementation of a neural augmentor, providing a distribution-based augmentation strategy.
Related Classes/Methods: