A lot of AI document pipelines still treat conversion as a boring preprocessing step, but it is really part of the architecture. Once files feed retrieval, agents, or automation, the conversion layer has to preserve structure, expose output for review, and respect trust boundaries. This MarkItDown page frames that idea nicely: https://www.markitdown.store/
