awesome-architecture-mds/developer-tools/markitdown/on_boarding.md at main · CodeBoarding/awesome-architecture-mds

graph LR
    CLI_Entrypoint["CLI Entrypoint"]
    MarkItDown_Engine["MarkItDown Engine"]
    Converter_Registry_Dispatcher["Converter Registry & Dispatcher"]
    PlainTextConverter["PlainTextConverter"]
    PdfConverter["PdfConverter"]
    ImageConverter["ImageConverter"]
    DocumentIntelligenceConverter["DocumentIntelligenceConverter"]
    llm_caption["llm_caption"]
    DocumentConverterResult["DocumentConverterResult"]
    Unclassified["Unclassified"]
    CLI_Entrypoint -- "invokes" --> MarkItDown_Engine
    MarkItDown_Engine -- "registers and dispatches via" --> Converter_Registry_Dispatcher
    Converter_Registry_Dispatcher -- "dispatches to" --> PlainTextConverter
    Converter_Registry_Dispatcher -- "dispatches to" --> PdfConverter
    Converter_Registry_Dispatcher -- "dispatches to" --> ImageConverter
    Converter_Registry_Dispatcher -- "dispatches to" --> DocumentIntelligenceConverter
    ImageConverter -- "optionally uses" --> llm_caption
    PlainTextConverter -- "produces" --> DocumentConverterResult
    PdfConverter -- "produces" --> DocumentConverterResult
    ImageConverter -- "produces" --> DocumentConverterResult
    DocumentIntelligenceConverter -- "produces" --> DocumentConverterResult
    CLI_Entrypoint -- "receives output from" --> DocumentConverterResult

Details

MarkItDown is a thin, command‑line‑driven document‑to‑markdown engine. The main() entry‑point parses user options, builds a StreamInfo hint object and instantiates the MarkItDown façade. The façade registers a prioritized list of converter objects (plain‑text, PDF, image, Azure Document‑Intelligence, etc.) in a registry. When a conversion request arrives, the registry walks the list, asks each converter whether it accepts the supplied stream, and dispatches to the first matching converter. Converters perform the core transformation and may invoke optional AI enrichment helpers – an OpenAI‑style LLM for image captioning (llm_caption) or Azure Document‑Intelligence for OCR‑rich documents. The resulting markdown is returned to the CLI, which writes it to stdout or a user‑specified file. This layered design (CLI → Engine → Registry → Converters → Optional AI) yields a clear, modular data‑flow that maps directly onto a compact flow‑graph with distinct visual boundaries for each architectural component.

CLI Entrypoint

Entry point that parses command‑line arguments, builds StreamInfo, creates MarkItDown engine, invokes conversion, and writes markdown output.

Related Classes/Methods:

markitdown.__main__.main:13-200

MarkItDown Engine

Facade that holds the converter registry, registers built‑in converters (and optional plugins) on construction.

Related Classes/Methods:

markitdown._markitdown.MarkItDown:93-776

Converter Registry & Dispatcher

Ordered list of ConverterRegistration objects; dispatches conversion request to the first converter that accepts the stream.

Related Classes/Methods:

markitdown._markitdown.ConverterRegistration:85-90

PlainTextConverter

Handles plain‑text, JSON, and markdown files; implements accepts() and convert().

Related Classes/Methods:

markitdown.converters._plain_text_converter.PlainTextConverter:33-71

PdfConverter

Handles PDF files; implements accepts() and convert().

Related Classes/Methods:

markitdown.converters._pdf_converter.PdfConverter:31-77

ImageConverter

Handles JPEG/PNG images; may add EXIF metadata and optional LLM caption.

Related Classes/Methods:

markitdown.converters._image_converter.ImageConverter:16-138

DocumentIntelligenceConverter

Uses Azure Document‑Intelligence service for OCR and layout extraction; also a converter.

Related Classes/Methods:

markitdown.converters._doc_intel_converter.DocumentIntelligenceConverter:130-254

llm_caption

Helper that generates a natural‑language caption for an image via an OpenAI‑compatible LLM.

Related Classes/Methods:

markitdown.converters._llm_caption.llm_caption:7-50

DocumentConverterResult

Result object containing generated markdown and related metadata.

Related Classes/Methods: None

Unclassified

Component for all unclassified files and utility functions (Utility functions/External Libraries/Dependencies)

Related Classes/Methods: None

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Details

CLI Entrypoint

MarkItDown Engine

Converter Registry & Dispatcher

PlainTextConverter

PdfConverter

ImageConverter

DocumentIntelligenceConverter

llm_caption

DocumentConverterResult

Unclassified

FAQ

FilesExpand file tree

on_boarding.md

Latest commit

History

on_boarding.md

File metadata and controls

Details

CLI Entrypoint

MarkItDown Engine

Converter Registry & Dispatcher

PlainTextConverter

PdfConverter

ImageConverter

DocumentIntelligenceConverter

llm_caption

DocumentConverterResult

Unclassified

FAQ