Import and Publishing

Overview

The PageSeeder batch processing API and mature document model make it possible to import and export content from a range of data structures and formats, including:

  • conventional document formats such as Adobe InDesign, Microsoft Word, PDF and HTML,
  • relational databases, spreadsheets, and CSV,
  • industry standard XML initiatives such as DITA and Akomo Ntoso,
  • arbitrary markup languages such as wikis or markdown,
  • specialist formats such as JSON, SVG, MathJax or ASCIImath.

Developers

PageSeeder makes it possible to aggregate content from different sources into searchable documents. Data objects can overwrite specific content fragments without affecting the surrounding document. Features such as “diff before write” mean that a document is only updated when content changes rather than when a file is uploaded.

Analysts

“Low code” environment for managing formatting templates separate to data transformations. Filter content on any searchable parameters such as document type, version, status, workflow, last edited date, author, other metadata or content properties then consolidate results of a search into a single file document for publishing.

Use our OX conversion and validation workbench to analyze, test and validate any incoming data before loading it to PageSeeder.

Users

Manage your Word templates then upload them to PageSeeder so exported documents are branded. Know that complex paragraph and cross-reference numbering, plus markers such as footnotes, index and bibliography entries will export consistently.

Remap style names before importing or exporting. Validate publications to find errors before exporting.

Document model

Word and Excel support

  • Support for converting PSML documents to docx and docx to PSML is a standard feature of PageSeeder. Available under the export action icon on the Document -> Browse page, there is a standard conversion that will process any PSML document without modification;
  • Docx files can be converted into PageSeeder documents through the Upload menu;
  • How docx files are processed is customizable;
  • Saving the results of a search in CSV format so the data can be imported into Excel;
  • The use of an Excel worksheet to build a document collection.

Hierarchical and graph data

Tabular and relational data

OX

Batch validation

The search results page allows users to validate multiple files at once using the Validate option at the Action menu.

Batch conversion

The search results page allows users to process multiple files at once using one of the actions batch functions at the Action menu.

External source data

Import

Import content from a range of data structures and formats.