Markdown & HTML
The most popular output formats. Rekognita generates clean, structured Markdown or HTML with preserved heading hierarchy, tables, and lists.
Markdown Output
Markdown is the ideal format for:
- RAG pipelines and LLM context
- Documentation and Knowledge Bases
- Git repositories and READMEs
Example
# Quarterly Report Q4 2024
## Executive Summary
The company achieved record revenue of **$4.5M** in Q4...
## Financial Results
| Quarter | Revenue | Expenses | Profit |
|---------|---------|----------|--------|
| Q1 | $2.4M | $1.8M | $600K |
| Q2 | $3.1M | $2.1M | $1.0M |
| Q3 | $3.8M | $2.4M | $1.4M |
| Q4 | $4.5M | $2.7M | $1.8M |
## Key Highlights
- Net profit increased by **67%** QoQ
- New product line launched in Q2
- Customer base grew to **12,000+**
> ¹ All figures are audited and in compliance with IFRS.HTML Output
HTML preserves full formatting and is suitable for:
- Embedding into web applications
- Rendering with CSS styles
- Further conversion to PDF
Example
<h1>Quarterly Report Q4 2024</h1>
<h2>Executive Summary</h2>
<p>The company achieved record revenue of <strong>$4.5M</strong>...</p>
<h2>Financial Results</h2>
<table>
<thead>
<tr><th>Quarter</th><th>Revenue</th><th>Profit</th></tr>
</thead>
<tbody>
<tr><td>Q1</td><td>$2.4M</td><td>$600K</td></tr>
<tr><td>Q2</td><td>$3.1M</td><td>$1.0M</td></tr>
</tbody>
</table>API Request
POST /v1/documents/convert
Content-Type: multipart/form-data
Authorization: Bearer rk_sk_your_key
file=@document.pdf
output_format=markdown # or "html"
model=rekognita-balancedSDK Example
from rekognita import RekognitaClient
client = RekognitaClient()
# Markdown
md_result = client.documents.convert(
file="report.pdf",
output_format="markdown"
)
print(md_result.content) # Markdown text
# HTML
html_result = client.documents.convert(
file="report.pdf",
output_format="html"
)
print(html_result.content) # HTML textRekognita vs Competitors
| Feature | Rekognita | Competitors |
|---|---|---|
| Heading Hierarchy | ✅ H1-H6 | ❌ Flat text |
| Tables | ✅ Complete with headers | ❌ Unstructured text |
| Lists | ✅ Numbered + nested | ⚠️ Simple text |
| Footnotes | ✅ Linked to text | ❌ Not supported |
| Images | ✅ Captions and references | ❌ Ignored |