Skip to content

Document Converter

FieldValue
Namedoc-converter
Categorydata-pipeline
Complexitymoderate
Tagssidecar, pandoc, document-conversion, llm-synthesis, http-body
Authorrandybias
Min Version0.7.0

Convert documents between formats using a pandoc sidecar, then summarize the output with Claude. This scaffold demonstrates the HTTP body sidecar data flow pattern — content travels as JSON in the request and response body, with no shared volume required.

This scaffold includes a sidecar container:

FieldValue
Namepandoc
Imagepandoc/core:3.6
Port3030
Health checkGET /version
CommunicationHTTP body (JSON request/response to localhost:3030)

The pandoc sidecar runs pandoc-server --port 3030 using the official pandoc image. No custom Docker image is needed. The engine node sends document content via HTTP POST and receives the converted output in the response body.

fetch-document → convert-document → summarize-output
NodePurpose
fetch-documentFetch a document from a URL, auto-detecting format
convert-documentPOST content to the pandoc sidecar for format conversion
summarize-outputSend a preview to Claude for a brief summary
  • manual only
ServiceTypeRequired
Anthropic APIExternalOptional (mock fallback if absent)
Document source URLExternalYes
KeyDefaultDescription
timeout120sPer-node timeout
retries0Retry count per node
document_urlTentacular READMEURL of document to fetch
input_formatmarkdownpandoc input format
output_formathtmlpandoc output format
max_summary_chars2000Max chars sent to Claude for summarization

Pandoc supports 40+ input and output formats. Common pairs: markdown to HTML, HTML to markdown, RST to markdown, markdown to LaTeX.

  • anthropic.api_key — Claude API key for summarization (optional; mock fallback used if absent)
Terminal window
tntc scaffold init doc-converter
tntc scaffold init doc-converter my-custom-name
tntc scaffold info doc-converter
  • The pandoc sidecar starts pandoc-server --port 3030 — no custom image needed
  • /version is used as the readiness probe (returns pandoc version JSON)
  • PDFs cannot be produced by pandoc-server (pandoc limitation in server mode)
  • Contrast with video-frame-analyzer which uses a shared emptyDir volume for large file handoff

Scaffold source: quickstarts/doc-converter/