docs
UAEN
Docs/Supported File Types

Supported File Types

Rekognita supports a wide range of document formats. Each format is processed by the appropriate parser to ensure maximum accuracy.

Documents

FormatExtensionsNotes
PDF.pdfDigital and scanned (OCR). Support for protected PDFs
Microsoft Word.docx, .docFull support for tables, images, and styles
Microsoft Excel.xlsx, .xlsTable conversion preserving structure
Microsoft PowerPoint.pptx, .pptText and image extraction from slides
OpenDocument.odt, .odsLibreOffice / OpenOffice documents
Rich Text.rtfBasic formatting support
Plain Text.txtUTF-8, various encodings

Images

FormatExtensionsMax Size
JPEG.jpg, .jpeg50 MB
PNG.png50 MB
TIFF.tiff, .tif100 MB (multi-page)
WebP.webp50 MB
BMP.bmp50 MB
HEIC.heic50 MB

Supported OCR Languages

Rekognita supports OCR for 25+ languages, including:

  • Latin: English, Deutsch, Français, Español, Italiano, Português, Nederlands, Polski
  • Cyrillic: Ukrainian, Russian, Belarusian, Bulgarian, Serbian
  • CJK: 中文 (Chinese), 日本語 (Japanese), 한국어 (Korean)
  • Arabic: العربية (Arabic), فارسی (Persian)
  • Other: हिन्दी (Hindi), ภาษาไทย (Thai), Tiếng Việt (Vietnamese)

Recommendations

  • For scanned documents, a resolution of ≥ 300 DPI is recommended
  • PDFs with embedded text are processed faster than scans
  • For Excel/PowerPoint, we recommend converting to PDF before uploading for the best results