The Four Essential Functions: Document Processing and Transformation

It's possible to process and transform documents in FormKiQ, with features such as document events, customized workflows, anti-malware protection, and more.

It's possible to process and transform documents in FormKiQ with the following features:

Document Events

FormKiQ's Document Events capability provides real-time notification of document lifecycle changes — creation, update, deletion, classification changes, workflow state transitions, and other document actions — enabling automated responses to document activity across your AWS environment and connected external systems.

Document Events are built on Amazon S3 event notifications and Amazon Simple Notification Service (SNS), running entirely within your AWS account. Every document action that occurs within FormKiQ can generate an event that is immediately available to any downstream system, service, or workflow that subscribes to the relevant SNS topic — with no polling, no scheduled batch processing, and no delay between the document action and the event notification.

Available with:

FormKiQ Core FormKiQ Essentials FormKiQ Advanced FormKiQ Enterprise

How Document Events work

When a document action occurs in FormKiQ — a document is uploaded, a metadata attribute is updated, a classification schema is applied, a workflow state changes, or a document is deleted — FormKiQ publishes a structured event payload to an SNS topic within your AWS account. The event payload includes the document identifier, the action type, relevant metadata attributes, the timestamp of the action, and the site and user context — providing downstream consumers with everything they need to respond intelligently to the event without needing to query FormKiQ for additional context.

SNS's fan-out architecture means that a single document event can be delivered simultaneously to multiple subscribers — a Lambda function, an SQS queue, an EventBridge rule, and an external webhook endpoint can all receive the same event in parallel, each responding according to their own logic.

What can be triggered by Document Events

AWS Lambda functions

Trigger serverless functions in response to document events — initiating an AI processing pipeline when a new document is uploaded, updating a connected system's record when metadata changes, sending a notification when a document reaches a specific workflow state, or archiving a document when a retention period is reached.

Amazon SQS queues

Route document events to SQS queues for reliable, ordered processing — appropriate for high-volume event streams where downstream processing needs to be rate-limited, batched, or decoupled from the event source. SQS ensures events are not lost if a downstream processor is temporarily unavailable.

Amazon EventBridge

Route document events through EventBridge to connect document activity to the broader AWS event-driven architecture — triggering Step Functions workflows, routing events to other AWS services, or forwarding events to connected SaaS applications through EventBridge's partner integrations.

External webhooks and APIs

Deliver document events to external systems — CRM platforms, case management systems, ERP systems, or custom applications — through outbound webhook delivery, keeping connected systems informed of document activity in real time without requiring those systems to poll FormKiQ for changes.

Workflow automation platforms

Connect document events to Microsoft Power Automate, Zapier, and similar tools through SNS-to-webhook delivery, triggering automated actions in connected services in response to FormKiQ document activity.

Document Events and FormKiQ Workflows

Document Events and FormKiQ's built-in workflow, queue, and ruleset architecture work together — Document Events can trigger FormKiQ workflows, and FormKiQ workflow state transitions generate Document Events that can trigger external actions. This bidirectional relationship allows document activity in FormKiQ to drive both internal workflow progression and external system notifications from the same event infrastructure, without requiring separate integration layers for internal and external event handling.

Document Events in compliance and audit programs

For regulated deployments, Document Events provide a real-time audit stream that complements FormKiQ's built-in audit trail. Every document action generates an event that can be captured, stored, and analyzed independently of FormKiQ's own audit log — supporting audit programs that require an immutable, independently maintained record of document activity, or compliance monitoring programs that need to detect and respond to specific document actions in real time.

Event payloads can be routed to Amazon CloudWatch for monitoring and alerting, to Amazon Kinesis for high-volume event stream processing and analytics, or to a dedicated audit log store for long-term retention independent of the FormKiQ deployment.

Custom Application and Administration Development

FormKiQ's architecture is designed to be extended — not just configured. Because FormKiQ deploys into your own AWS account and exposes a complete, documented API surface alongside direct access to the underlying AWS resources, your development team has everything needed to build custom applications, administrative interfaces, and integration components that reflect the specific operational and compliance requirements of your deployment.

Available with:

FormKiQ Core FormKiQ Essentials FormKiQ Advanced FormKiQ Enterprise

What custom development enables

Custom interfaces and administration tools

FormKiQ's Document Console is built using ReactJS and serves as both a functional document management interface and a reference implementation for custom interface development. Organizations with specific interface requirements — custom intake portals, role-specific administration views, branded external-facing document access interfaces, or compliance-specific review tools — can build on the same API and component patterns used by the Document Console, using the existing codebase as a practical starting point rather than building from scratch.

Custom compliance and governance tooling

Compliance programs often require administrative interfaces that go beyond what a general-purpose document management console provides — custom audit reporting views, compliance status dashboards, disposition workflow interfaces, or regulatory submission preparation tools. Because FormKiQ's underlying data is accessible through both the FormKiQ API and directly through the AWS resources in your account, custom compliance tooling can surface exactly the information and workflow controls your compliance program requires.

Custom integrations with AWS services

Every AWS service available in your account is accessible for integration with FormKiQ — connecting document workflows to services your organization already uses, triggering document actions from events in other systems, or building processing pipelines that combine FormKiQ's document layer with other AWS managed services. The FormKiQ codebase provides practical examples of how communication across AWS services is structured within the platform architecture.

Extended platform components

FormKiQ's modular architecture allows individual platform components to be extended or replaced where your requirements go beyond the standard implementation. Workflow logic, classification pipelines, metadata validation, and processing functions can all be extended within the FormKiQ module architecture — keeping custom extensions maintainable and upgrade-compatible rather than diverging into a separate codebase.

Development resources and support

FormKiQ provides complete API documentation, architectural guidance, and access to the FormKiQ engineering team through Advanced and Enterprise support channels. For organizations requiring hands-on development support, FormKiQ's Professional Services program provides custom development engagement for integrations, interface components, and platform extensions that benefit from direct involvement of the FormKiQ engineering team.

Document Digitization and Paper Document Intake

FormKiQ supports multiple methods for converting physical documents into governed digital records — from direct upload of scanned files through automated capture pipelines that connect scanning infrastructure directly to FormKiQ's intake and processing workflow. For organizations with ongoing paper document intake requirements or large historical paper collections requiring digitization, FormKiQ provides a flexible, scalable path from physical document to fully classified, searchable, governed digital record.

Available with:

FormKiQ Core FormKiQ Essentials FormKiQ Advanced FormKiQ Enterprise

Digitization methods

FormKiQ Document Console

The Document Console provides a direct upload interface for scanned documents — appropriate for lower-volume digitization workflows where staff are scanning and uploading documents individually or in small batches. Uploaded scanned documents can be processed through FormKiQ's OCR pipeline immediately on upload, with full-text extraction, classification, and metadata attribution applied before the document enters the governed document collection.

FormKiQ CLI

The FormKiQ CLI provides high-speed bulk upload capability for large-scale digitization programs — appropriate for historical paper collections where documents have been scanned in bulk and need to be imported with metadata mapping and classification applied at scale. The CLI's asynchronous processing architecture handles large volumes without manual intervention, with OCR and AI processing pipelines triggered automatically on ingestion. Available with Advanced and Enterprise.

AWS CLI

Direct AWS CLI access to the S3 buckets underlying FormKiQ's document storage enables programmatic bulk upload for digitization programs that integrate with existing scanning infrastructure or document capture workflows — appropriate for organizations with established scan-to-cloud pipelines that need to route output directly into FormKiQ's storage layer.

Email Ingestion Gateway

For organizations using scan-to-email printers or multifunction devices that deliver scanned documents as email attachments, the Email Ingestion Gateway provides a turnkey connection between the scanning device and FormKiQ's intake workflow. Documents scanned and emailed to a configured address are automatically captured, processed through OCR and classification, and stored as governed records — without any manual download or upload step. This is the recommended approach for ongoing paper intake workflows where scan-to-email is the primary capture mechanism. Available with Advanced and Enterprise.

Document Scanner and Capture Gateway

For organizations with dedicated scanning infrastructure — high-volume document scanners, multifunction devices with direct network output, or capture platforms — the Document Scanner and Capture Gateway provides a structured connection between the capture device and FormKiQ's intake and processing workflow, with metadata extraction and classification applied at the point of capture. Available with Advanced and Enterprise.

OCR and AI processing for digitized documents

Documents ingested through any digitization method are processed through FormKiQ's OCR pipeline — using Tesseract for standard text extraction in Core, and AWS Textract with custom mappings in Essentials, Advanced, and Enterprise for higher-accuracy extraction across a wider range of document formats and layouts. For organizations using the AI Processing and Analysis module, digitized documents can additionally be processed through classification, metadata extraction, sensitivity analysis, and vision processing pipelines — transforming scanned paper documents into richly attributed, searchable, governed digital records in a single automated workflow.

Document Import and Export

FormKiQ provides a comprehensive set of import and export capabilities — from direct API-based document exchange through automated gateway ingestion and workflow-orchestrated export — allowing document flows to be integrated into any operational architecture without custom middleware or proprietary connector requirements.

Available with:

FormKiQ Core FormKiQ Essentials FormKiQ Advanced FormKiQ Enterprise

Document import

API-based import

The FormKiQ Document API provides RESTful endpoints for document creation and upload — the primary integration point for applications, services, and systems that need to push documents into FormKiQ programmatically. Any system that can make an HTTP request can submit documents to FormKiQ, with metadata, classification schema, and workflow routing specified at the point of submission. API-based import supports both synchronous and asynchronous processing patterns, with webhook-based notification of processing completion for asynchronous flows.

Inbound Webhooks

FormKiQ's inbound webhook endpoints provide a lightweight import mechanism for external services and automation platforms that support outbound webhook delivery — allowing documents and document events from third-party systems to trigger document creation in FormKiQ without requiring full API integration. Both open and authenticated webhook configurations are supported.

Document Gateways

FormKiQ's Document Gateway modules provide structured, automated import from specific source systems — Microsoft SharePoint and Office 365, Google Drive and Workspace, email ingestion, SFTP and managed file transfer, document scanners and capture platforms, and cloud storage environments. Each gateway handles the source-specific connection, authentication, and event-driven ingestion logic, delivering documents into FormKiQ's intake pipeline with metadata mapping and classification applied at the point of ingestion. Available with Advanced and Enterprise.

FormKiQ CLI and AWS CLI

For bulk import and migration programs, the FormKiQ CLI and AWS CLI provide high-throughput import capability appropriate for initial migration from existing document stores, historical collection import, and large-scale digitization ingestion programs. FormKiQ CLI available with Advanced and Enterprise.

Document export

API-based export

The FormKiQ Document API provides complete retrieval endpoints for document content and metadata — supporting any export pattern that requires programmatic access, from individual document retrieval through bulk collection export based on metadata queries. Export responses include the original document binary alongside the complete metadata record, supporting downstream systems that need both the document and its governance context.

Document Actions

FormKiQ's Document Actions provide workflow-orchestrated export capability — configurable actions triggered by document events, workflow state transitions, or scheduled processes to export documents to defined destinations. Document Actions can push documents to external systems, cloud storage locations, SFTP endpoints, or downstream processing pipelines as part of a defined workflow, with the export action logged in FormKiQ's audit trail.

Direct AWS resource access

Because FormKiQ stores documents in S3 and metadata in DynamoDB within your own AWS account, export operations can be performed directly against these resources using AWS-native tooling — S3 batch operations for large-scale export, DynamoDB exports for metadata collection export, and AWS Data Exchange or similar services for structured data sharing programs.

Building custom import and export flows

For organizations with import or export requirements that go beyond the standard API, gateway, and action capabilities, FormKiQ's architecture supports custom transformation and routing services built within your AWS account — Lambda functions that transform incoming data before submission to the FormKiQ API, Step Functions workflows that orchestrate complex multi-step import or export processes, or EventBridge rules that route FormKiQ document events to external destinations based on document attributes.

The FormKiQ Document Events system provides the trigger layer for event-driven export flows — document events published to SNS can initiate export actions, notify external systems, or trigger transformation pipelines in response to specific document lifecycle events, without requiring scheduled polling or batch export jobs.

Document Actions and Workflows

FormKiQ provides two complementary automation layers: Document Actions, available in all editions, for attaching discrete processing steps to documents; and Document Workflows, available in commercial editions, for defining structured multi-step automation with condition sets, approval routing, and system integration.

Document Actions

Available with:

FormKiQ Core FormKiQ Essentials FormKiQ Advanced FormKiQ Enterprise

Document Actions are discrete, configurable processing steps that can be attached to a document at creation or at any later point. Each action sends a request to a FormKiQ module or an external service via Amazon SQS and executes independently — common uses include triggering OCR, invoking AI Processing and Analysis, sending outbound webhooks, and running post-processing transformations. Multiple actions can be assigned to a document and will run in sequence, but they operate without conditions, branching, or approval gates — those capabilities belong to Document Workflows.

Document Workflows

Available with:

FormKiQ Essentials FormKiQ Advanced FormKiQ Enterprise

Document Workflows are a distinct automation engine that defines structured sequences of processing steps — each step invoking Document Actions to perform its work — with full support for conditions, branching, and approval gates. Each workflow step can evaluate document attributes, classification results, or external data to determine which path to follow, and approval stages hold document progression until a designated reviewer acts. Every step produces a governed, traceable audit record, making workflows well-suited to compliance-sensitive processes where the full processing history of a document must be demonstrable.

Common workflow patterns include:

  • Intake classification and routing — classify documents on arrival and direct them to the appropriate workspace, team queue, or processing step based on type, source, or extracted attributes
  • Approval workflows — route documents to one or more reviewers, hold progression until approval is confirmed, and record the decision and timestamp against the document record
  • Condition-based branching — define condition sets that evaluate document metadata and route processing to different steps depending on the result, supporting complex real-world intake and review scenarios
  • System integration triggers — connect workflow steps to external systems via outbound webhooks or EventBridge, so document state changes in FormKiQ drive downstream actions in ERP, CRM, case management, or other connected platforms
  • Post-processing automation — trigger OCR, AI Processing and Analysis, or document generation steps automatically as part of a defined workflow, reducing manual intervention in high-volume processing environments

Outbound Webhooks

FormKiQ's outbound webhook capability enables real-time notification of document activity to external systems, services, and automation platforms — delivering structured event payloads to any webhook-enabled endpoint in response to document lifecycle events and workflow actions.

Outbound webhooks in FormKiQ can be triggered from two sources: Document Events — the real-time notifications generated by document creation, update, and deletion — and Document Actions — the configurable workflow steps that can be attached to documents and triggered by workflow state transitions, classification changes, or other document lifecycle milestones. This dual trigger model gives you precise control over when external systems are notified and what information they receive.

Available with:

FormKiQ Core FormKiQ Essentials FormKiQ Advanced FormKiQ Enterprise

How outbound webhooks work

When a triggering event or action occurs, FormKiQ constructs a structured webhook payload containing the document identifier, event or action type, relevant metadata attributes, timestamp, and site and user context, and delivers it to the configured webhook endpoint via an HTTP POST request. The receiving endpoint can be any service that accepts inbound HTTP webhook delivery — external applications, workflow automation platforms, notification services, CRM and ERP systems, case management platforms, or custom services built within your own infrastructure.

Outbound webhooks support both internal and external delivery targets. Webhooks destined for services within your AWS environment can be routed through API Gateway or Lambda endpoints. Webhooks destined for external services — third-party SaaS platforms, partner systems, or public-facing APIs — are delivered directly to the configured external endpoint.

Outbound webhooks triggered by Document Events

Document Event-triggered webhooks fire in response to document lifecycle changes — creation, update, deletion, and metadata changes — delivering real-time notification to connected systems as document activity occurs. This is appropriate for use cases where an external system needs to stay synchronized with FormKiQ document state — a CRM that needs to know when a contract document has been updated, a case management system that needs to know when a supporting document has been added, or a compliance monitoring system that needs to know when a document has been classified or reclassified.

Outbound webhooks triggered by Document Actions

Document Action-triggered webhooks fire when a specific configured action completes within a FormKiQ workflow — giving you precise control over which workflow events generate external notifications. This is appropriate for use cases where external notification should only occur at specific workflow milestones — when a document has been approved and is ready for signature, when OCR processing has completed and extracted content is available, when an AI analysis action has completed and results are ready for review, or when a retention action has been applied and an external records system needs to be updated.

Common outbound webhook use cases

  • Notifying a CRM or ERP system when a contract document reaches a specific workflow state
  • Triggering a downstream processing pipeline when document OCR or AI analysis completes
  • Delivering real-time document activity notifications to a compliance monitoring or audit system
  • Notifying external parties when documents relevant to them have been updated or made available
  • Connecting FormKiQ document workflows to automation platforms such as Microsoft Power Automate or Zapier
  • Triggering notifications in communication platforms when documents require review or approval action
  • Synchronizing document state changes with connected case management, grants management, or claims management systems

Document OCR Processing

FormKiQ's optical character recognition capability transforms document content from binary image data into structured, searchable, and processable text — making scanned documents, photographed pages, image-based PDFs, and other non-text-native document formats fully accessible to FormKiQ's search, classification, metadata extraction, and AI processing capabilities.

OCR processing in FormKiQ is available at two levels — a standard capability included across all editions, and an enhanced capability available from Essentials onward — reflecting the different accuracy, format support, and structured extraction requirements of different document programs.

Available with:

FormKiQ Core FormKiQ Essentials FormKiQ Advanced FormKiQ Enterprise

Standard OCR — Tesseract

FormKiQ Core includes OCR processing using Tesseract, an open source OCR engine that provides reliable text extraction across a wide range of common document formats and languages. Tesseract-based OCR is appropriate for standard digitization workflows where the primary requirement is making document text searchable and available for processing — scanned typed documents, printed forms, and other document types where text is clearly legible and consistently formatted.

Tesseract processing runs within your AWS account using Lambda functions, with extracted text stored as part of the document's metadata record and available for full-text indexing, keyword search, and downstream processing.

Enhanced OCR and IDP — AWS Textract

From Essentials onward, FormKiQ provides OCR and intelligent document processing using Amazon Textract — AWS's managed machine learning service for document text and data extraction. Textract goes significantly beyond standard OCR in several important ways:

  • Structured data extraction — Textract extracts not just raw text but structured data, identifying tables and extracting their content, identifying form fields and extracting key-value pairs, and understanding the spatial layout of document content to preserve the relationship between elements that simple text extraction loses.
  • Higher accuracy across document types — Textract's machine learning models provide higher extraction accuracy across a wider range of document types, formats, and quality levels than rules-based OCR — including lower-quality scans, complex layouts, multi-column formats, and documents where text and visual elements are intermixed.
  • Custom mappings — FormKiQ's Textract integration supports custom extraction mappings — configuring extraction of specific fields, tables, or form elements from defined document types, with the extracted data mapped directly to FormKiQ metadata attributes. A contract processed through a custom mapping arrives in FormKiQ with counterparty name, dates, and key clause content extracted and attributed. An invoice arrives with vendor details, line items, and totals extracted as structured metadata.
  • Form and table data — Textract's form and table extraction capability makes structured document types — application forms, financial tables, regulatory filings — fully processable as structured data rather than flat text, enabling downstream automation and analysis that flat text extraction cannot support.

OCR and downstream processing

Full-text search

OCR-extracted text is indexed by FormKiQ's search layer — making scanned and image-based documents fully searchable by content alongside text-native documents, with no distinction in search behavior between document types.

AI Processing and Analysis

OCR-extracted text is available as input to FormKiQ's AI Processing and Analysis module — enabling classification, metadata extraction, summarization, analysis, and sensitivity classification to be applied to scanned and image-based documents in the same way as text-native documents. For documents with complex visual content, OCR extraction can be combined with vision processing to provide a complete content understanding from both the text and visual layers.

Metadata attribution

Structured data extracted through custom mappings is applied directly as FormKiQ metadata attributes — making extracted field values searchable, queryable, and available to workflow routing rules without any additional processing step.

OCR processing workflow

OCR processing in FormKiQ can be triggered automatically on document upload through a configured workflow rule, on demand through a Document Action, or as part of a bulk processing operation applied to an existing document collection. Processing status and extracted content are available through the FormKiQ API and Document Console on completion, with a Document Event published to notify downstream subscribers that OCR processing has completed and results are available.

Amazon EventBridge Integration

FormKiQ supports integration with Amazon EventBridge — AWS's serverless event bus service — enabling FormKiQ document activity to participate in event-driven architectures across your AWS environment and connected external applications, at any scale and with the routing flexibility that EventBridge's rule-based delivery model provides.

Available with:

FormKiQ Essentials FormKiQ Advanced FormKiQ Enterprise

What EventBridge integration enables

Amazon EventBridge allows teams to build loosely coupled, event-driven architectures where services communicate through events rather than direct integrations. FormKiQ's integration with EventBridge brings document lifecycle events into this architecture, allowing document activity to trigger actions across any EventBridge-connected service without building point-to-point integrations from FormKiQ to each downstream system.

Routing document events to AWS services

EventBridge rules can route FormKiQ document events to any AWS service that EventBridge supports as a target — Step Functions workflows, Lambda functions, SQS queues, SNS topics, Kinesis streams, and others — based on the content of the event payload. A document classified as sensitive can be routed to a dedicated Lambda function that applies additional access restrictions. A document that completes an approval workflow can trigger a Step Functions execution that coordinates the next steps across multiple systems.

Connecting to SaaS applications

EventBridge's SaaS partner integrations allow FormKiQ document events to be delivered to connected SaaS applications — Salesforce, ServiceNow, Zendesk, and others — through EventBridge's partner event bus, without requiring custom webhook infrastructure or point-to-point API integrations.

Cross-account and cross-region event delivery

EventBridge supports cross-account and cross-region event delivery — allowing FormKiQ document events generated in one AWS account or region to be routed to services in other accounts or regions. For organizations with multi-account or multi-region AWS architectures, this means FormKiQ document activity can participate in centralized event processing, monitoring, and audit pipelines that span the full environment.

How FormKiQ publishes to EventBridge

FormKiQ integrates with EventBridge through three pathways:

Document API

The FormKiQ Document API can be configured to publish document events directly to an EventBridge event bus, making every API-generated document action available as an EventBridge event for downstream routing and processing.

Document Actions

FormKiQ's Document Actions can include EventBridge publication as a configured action step — allowing specific workflow milestones to publish structured events to EventBridge with custom payloads that include the document context relevant to the downstream processing that the event triggers.

FormKiQ Workflows

FormKiQ's workflow and ruleset architecture can be configured to publish EventBridge events as part of workflow execution — connecting internal workflow state transitions to the broader event-driven architecture of your AWS environment.

Document Anti-Virus and Anti-Malware Scanning

FormKiQ's anti-virus and anti-malware scanning capability automatically inspects every document uploaded to FormKiQ for known viruses, malware, and other malicious content — protecting your document collection, your users, and your connected systems from documents that carry harmful payloads before they enter the governed document environment.

Available with:

FormKiQ Essentials FormKiQ Advanced FormKiQ Enterprise

How scanning works

Document scanning in FormKiQ is implemented using ClamAV — a widely deployed, actively maintained open source anti-virus engine — running within AWS Lambda functions in your own AWS account. Every document uploaded to FormKiQ is automatically submitted to the scanning pipeline on ingestion, with the scan completing before the document is made accessible to users or downstream systems.

Because scanning runs within your AWS account using Lambda's serverless execution model, there is no fixed scanning capacity to provision or manage — the scanning pipeline scales automatically with document ingestion volume, handling high-volume intake programs without performance degradation or queue buildup.

What is detected

ClamAV's signature database covers a broad range of known malicious content categories, including:

  • Viruses and worms — self-replicating malicious code embedded in document files
  • Trojan horses — malicious payloads disguised as legitimate document content
  • Adware and spyware — software designed to compromise user systems or collect information without authorization
  • Ransomware — encryption-based malicious payloads targeting document and file content
  • Macro-based malware — malicious code embedded in document macros, particularly relevant for Word and Excel document intake
  • Exploit documents — documents crafted to exploit vulnerabilities in document processing software

Signature updates

FormKiQ's anti-virus module includes automatic ClamAV signature updates — keeping the detection database current with newly identified threats without requiring manual intervention. Signature updates are applied on a scheduled basis within your AWS environment, ensuring that the scanning pipeline reflects the current threat landscape rather than a static snapshot of known threats at the time of deployment.

Scan results and document handling

Scan results are applied as document metadata attributes — making scan status searchable and available to workflow routing rules. Documents that pass scanning proceed through the standard intake workflow. Documents that fail scanning — where malicious content is detected — are quarantined and flagged for administrator review, with a Document Event published to notify connected systems and administrators of the detection. Quarantine behavior and administrator notification can be configured to reflect your organization's security incident response process.

For organizations with security monitoring programs, scan results and quarantine events are logged in FormKiQ's audit trail and can be routed through Document Events to connected SIEM platforms, security monitoring tools, or administrator notification channels.

Anti-virus scanning in regulated deployments

For regulated deployments where document intake security is a compliance requirement — healthcare organizations receiving external documents, financial services firms processing inbound correspondence and submissions, government agencies receiving public submissions — anti-virus scanning provides a documented, auditable security control at the document intake layer. Scan results stored as document metadata provide evidence that every ingested document was inspected, supporting compliance documentation and audit requirements that mandate security controls on document intake.

Document Control, Versioning, and Artifacts

FormKiQ provides two complementary approaches to document version management — linear versioning for tracking the full history of changes to a document over time, and document artifacts for managing designated versions of a document through a defined lifecycle. Both approaches are available within the same deployment and can be applied independently or together depending on the level of version control your program requires.

Available with:

FormKiQ Essentials FormKiQ Advanced FormKiQ Enterprise

Linear Versioning

FormKiQ's linear versioning capability tracks every change to a document's content and metadata over time — maintaining a complete, ordered version history that allows any previous version to be retrieved, reviewed, or restored. Linear versioning is built on Amazon S3 Versions for document content and Amazon DynamoDB for metadata versioning. Every time a document's content is updated, the new version is stored alongside all previous versions in S3 — the original is never overwritten or deleted.

What linear versioning provides:

  • Complete version history for every document — content and metadata — from the point of ingestion
  • Retrieval of any previous version of a document's content or metadata at any point in the document lifecycle
  • The ability to set the current version to any previous version — restoring a document to an earlier state without losing the intervening history
  • Version-specific metadata — metadata attributes can be created and tracked for specific versions, allowing the metadata record at each point in the document's history to be reconstructed independently
  • Audit trail of all version changes including timestamp, user context, and change description
  • Protection against accidental overwrites and deletions — previous versions remain accessible even when a document is updated or nominally deleted

Document Artifacts

Document artifacts provide a structured approach to managing designated versions of a document through a defined lifecycle — allowing specific versions to be intentionally promoted to lifecycle stages that represent meaningful milestones in the document's development, approval, and retirement. Where linear versioning tracks every change automatically, artifact management is intentional: a document artifact is a designated version explicitly promoted to a specific lifecycle stage, representing a significant state rather than an incremental edit.

Document artifacts are a new capability in FormKiQ, available from Essentials onward.

Lifecycle stages

FormKiQ's artifact lifecycle is built around a set of standard stages that reflect the common progression of controlled documents, while supporting additional stages and customized lifecycle definitions per document classification schema.

Draft

A version in active development, not yet submitted for review or approved for use. Accessible to authorized authors and collaborators and clearly identified as not yet approved.

Review

A version submitted for formal review, frozen at the point of submission so that reviewers are evaluating a stable version. The review artifact is the specific version that reviewers comment on, approve, or return for revision.

Published

The current approved version, formally in effect and available for distribution and use. Represents the document's authoritative current state.

Archived

Previous published versions that are no longer current but remain accessible for reference, audit, and compliance purposes, retained according to the document's configured retention schedule.

These stages are a starting point, not a fixed constraint. Additional stages can be defined per document classification — a legal review stage before publication, a regulatory submission stage, a superseded stage, or multi-level approval stages.

Customizable lifecycles by document classification

Because artifact lifecycle configuration is tied to FormKiQ's document classification schema model, each document type can have its own lifecycle definition with its own stages, transition rules, required approvals, and access controls per stage. A contract document classification might define Draft, Legal Review, Commercial Approval, Executed, and Archived. A policy document might define Draft, Internal Review, Compliance Review, Published, and Superseded. A regulatory submission might define Draft, Internal Sign-Off, Submitted, Accepted, and Archived.

Lifecycle stage transitions and workflow integration

  • Stage transitions can be configured as formal workflow steps with defined approval requirements and role-based authorization — ensuring that only authorized users can advance a document to the next lifecycle stage
  • Access controls can be applied per lifecycle stage — restricting who can view, edit, or download artifacts at each stage independently of the access controls on the document's linear version history
  • Document Events are published on artifact stage transitions — notifying connected systems and triggering downstream actions such as distribution workflows, external system updates, or compliance notifications
  • Outbound webhooks and Document Actions can be triggered by specific lifecycle stage transitions — connecting artifact lifecycle events to the broader operational and compliance architecture of your deployment

Linear versioning and artifacts compared

Linear Versioning Document Artifacts
Tracks every change automaticallyYesNo — artifacts are designated intentionally
Maintains complete change historyYesComplements linear versioning
Supports lifecycle stage managementYes
Lifecycle customizationPer document classification schema
Previous version retrievalYes — any versionYes — any artifact version
Audit trailYes — all editsYes — artifact promotions logged alongside linear history
Appropriate forAll documents requiring change historyControlled documents with formal lifecycle requirements

Document Generation

FormKiQ's Document Generation module produces structured, repeatable document output from templates and data — integrating with FormKiQ's metadata and workflow layers so that generated documents are stored, classified, and governed the same way as any other document in the platform. The module covers template-based document generation, PDF splitting and joining, and PDF certificate generation.

Available as an Add-On Module with:

FormKiQ Advanced FormKiQ Enterprise

Document Generation from Templates and Data

Generate structured documents by combining predefined templates with data sourced from FormKiQ metadata, external system inputs, or structured JSON payloads. Templates define the structure, layout, and fixed content while the data input populates variable elements — names, dates, amounts, identifiers, decision outcomes. Data already captured on a FormKiQ document record can be used directly as generation inputs, and generation can be triggered by events in connected ERP, CRM, or case management systems.

Common fits: contract drafting, grant award and rejection letters, case correspondence, regulatory notices, HR correspondence, and any workflow where the same document structure needs to be produced repeatedly with different data inputs.

PDF File Splitting and Joining

Split multi-page PDF documents into individual component documents, or combine multiple PDF documents into a single consolidated file — as Document Actions within a workflow or on demand through the FormKiQ API. Splitting allows multi-document submissions to be decomposed into independently governed records; joining assembles components into a single deliverable. Each resulting document is stored in FormKiQ with full provenance metadata recording its origin.

Common fits: multi-document submission intake in grants, insurance, and regulatory programs; contract assembly with schedules and annexes; case file compilation; correspondence assembly.

PDF Certificates and Attestation Documents

Generate signed PDF certificates that carry a verifiable record of a specific event, outcome, decision, or attestation — including the timestamp of issuance, the data inputs used, and the FormKiQ record from which they were produced. Certificate generation integrates with FormKiQ's workflow layer and can be triggered automatically when a workflow reaches a defined milestone or on demand through the API.

Common fits: training and course completion certificates, compliance attestation documents, grant award certificates, inspection and audit completion records, identity and credential verification certificates.

View the full Document Generation module →

AI-Powered Document Processing and Transformation

FormKiQ's AI Processing and Analysis module extends the platform's processing and transformation capabilities with intelligent summarization, structured analysis, content generation, and vision processing — all running within your AWS account through Amazon Bedrock. These capabilities apply to the processing and transformation layer of document management, automatically producing outputs from document content that support downstream review, decision-making, and workflow.

Available as an Add-On Module with:

FormKiQ Advanced FormKiQ Enterprise

Document Summarization

Automatically generate concise, accurate summaries of document content — capturing key points, decisions, obligations, and context — stored as searchable metadata attributes alongside the original document record. Summaries are available to the KnowledgeBase module's retrieval model and can be configured to produce different formats for different document types.

Common fits: contract and legal document review, board and committee documentation, policy management, research programs, case file review, and any environment where document volume exceeds the practical capacity for full manual reading.

Document Analysis

Apply structured analytical reasoning to document content — evaluating documents against defined criteria, identifying obligations and commitments, assessing compliance with requirements, and producing structured analytical outputs that support review and governance decisions. Analytical outputs are stored as metadata attributes and can trigger workflow routing — a document that fails a compliance analysis can be automatically routed to a remediation queue.

Common fits: contract lifecycle management, regulatory submission review, grants eligibility assessment, vendor qualification review, policy governance, compliance monitoring.

Document and Content Generation

Automatically generate structured document content — drafts, responses, notices, and reports — based on reasoning over existing document content and context. This capability complements the Document Generation module: AI content generation produces free-form or semi-structured text from document reasoning, while Document Generation formats and outputs that content as a governed PDF or Word document. Generated content is stored as a document record with full metadata, version history, and audit trail including the generation inputs, model used, and generation timestamp.

Common fits: correspondence and response management, claims and adjudication communication, freedom of information response drafting, grants and regulatory communication, compliance reporting.

Vision and Multi-Modal Document Processing

Process documents that contain visual content — images, diagrams, photographs, mixed-content pages, and handwritten forms — using Amazon Bedrock's vision-capable models to extract meaning, classify content, and generate structured outputs from visual document elements that text-based OCR cannot fully address. Vision processing works alongside FormKiQ's OCR capability, extending document processing to document types where the meaning is carried visually rather than textually.

Common fits: insurance claims image processing, property inspection documentation, grants programs accepting handwritten submissions, healthcare document processing with clinical images, identity and credential verification, construction and engineering programs with mixed technical and visual content.

View the full AI Processing and Analysis module →