OX

In the decades that Allette Systems has been helping customers to cope with unstructured data, one theme consistently repeats. That is – good decisions are more likely when people have good metrics about the quality of their source data.

It was the lack of tools to support data analysis that led us to create, OX, an open-source framework designed to help users process data analysis and conversions.

Typical use cases include the following:

  • Analysis of different markup used in source documents such as XML, PDF or DOCX file formats.
  • Compare two or more XML instances and generate a report of the different XPath expressions or attribute values.
  • Batch validate XML instances against XSD or Schematron schema.
  • Convert documents from one format to another including conversion rules for specific data sets.
  • Convert docx or PDF files into a generic interim format (PageSeeder PSML) before further processing into other structures or syntaxes.

Architecturally, the idea behind OX is to allow developers to configure processing scripts into a pipeline without worrying about creating an interface for end-users who might not be comfortable using the command line. Both the OX interface and the pipelines are expressed as XML files. It can be deployed with either a user interface or as a service, or both.

OX supports processes that might require user input – see the screenshot below, where a user can add bookmarks to a PDF document one entry at a time or by pasting a list of entries into the editor. OX will process these entries to create a new PDF that is better suited for epub or other applications. 

Go to OX Allette

Related

Australian Taxation Office (ATO)