graph LR
ImageBaseDatasetHandler["ImageBaseDatasetHandler"]
SpecificDatasetImplementations["SpecificDatasetImplementations"]
DataAugmentationModule["DataAugmentationModule"]
ROMPGeneralUtilities["ROMPGeneralUtilities"]
Trace2VideoDataLoader["Trace2VideoDataLoader"]
SpecificDatasetImplementations -- "implements/extends" --> ImageBaseDatasetHandler
ImageBaseDatasetHandler -- "utilizes" --> DataAugmentationModule
ImageBaseDatasetHandler -- "leverages" --> ROMPGeneralUtilities
Trace2VideoDataLoader -- "leverages" --> ROMPGeneralUtilities
Trace2VideoDataLoader -- "prepares data for" --> DataAugmentationModule
Data Input & Preprocessing
Serves as the abstract base for handling single-frame image datasets, defining the interface and common logic for loading, standardizing, and preparing individual image frames for subsequent processing.
Related Classes/Methods:
Concrete dataset classes that inherit from or utilize ImageBaseDatasetHandler, providing dataset-specific logic for parsing annotations and locating files for various image datasets.
Related Classes/Methods:
Provides a comprehensive set of functions for augmenting image and pose data, crucial for increasing training data diversity and model robustness by applying various transformations.
Related Classes/Methods:
Offers general utility functions for the ROMP project, including fundamental image preprocessing, temporal smoothing, and extracting frame paths, which are leveraged across different data handling components.
Related Classes/Methods:
Specializes in loading and preparing video sequences specifically for the trace2 model, handling the complexities of video input and ensuring readiness for sequential model processing.
Related Classes/Methods: