March 19, 2026, 2:39 p.m.

We Customised Your Documents and Rules

Introduction:

We have released a lightweight document intelligence engine recently. This engine is designed for deterministic document parsing and analysis. The system focuses on reliable extraction and structural understanding of both digital and scanned documents.

At the moment LibDocAnax supports:

  • Structural and scanned document processing
  • English tokenisation
  • Chinese word segmentation
  • Deterministic structural text extraction
  • Basic document and image understanding
  • Simple JSON and governance-oriented JSON output

The engine is designed to process PDF, Office documents, and images and convert them into structured representations that can support search, compliance workflows, and document comparison.

We are gradually expanding the system with features such as improved layout detection, complex table handling, document comparison, and governance workflow support.

Custom Enterprise Integration

Future add-ons:

  • governance workflow
  • complex table extraction
  • improved layout detection
  • document comparison
  • spell checking
  • compliance workflows
We Customised Your Documents and Rules” enabled