0.41.0

Features🔗

  • Multi-account application
  • Multiple users per account (multiple users can access the same account)
  • Handle multiple documents as one unit
  • OCR using tesseract
  • Full-Text Search based on Apache SOLR or PostgreSQL's text search
  • Conversion to PDF: all files are converted into a PDF file. PDFs with only images (as often returned from scanners) are converted into searchable PDF/A pdfs.
  • A powerful query language to find documents
  • The possibility to download all the filtered documents (once at least one filter has been selected).
  • use bookmarks to save more complex queries
  • customizable dashboards as the main page
  • Non-destructive: all your uploaded files are never modified and can always be downloaded untouched
  • Organize files using tags, folders, Custom Fields and other metadata
  • Text is analysed to find and attach meta data automatically
  • Keywords metadata from pdf files is extracted and matched against tags in the database
  • Manage document processing: cancel jobs, set priorities
  • Everything available via a documented REST Api; allows to generate clients for many languages
  • OpenID Connect support allows Docspell to integrate into your SSO setup, for example with keycloak.
  • Two-Factor Authentication using TOTP built in
  • mobile-friendly Web-UI with dark and light theme
  • Create anonymous “upload-urls” to upload files; these urls allow to be configured with metadata like tags, folder etc that are applied to all files uploaded through this url
  • Share documents via cryptic public links (optionally protected by a password)
  • Send documents via e-mail
  • Notifications for documents with due dates or events via E-Mail, Matrix or Gotify
  • Read your mailboxes via IMAP to import mails into docspell
  • Edit multiple items at once
  • Merge multiple items
  • REST server and document processing are separate applications which can be scaled-out independently
  • Everything stored in a SQL database: PostgreSQL, MariaDB or H2
    • H2 is embedded, a "one-file-only" database, avoids installing db servers
  • Choose a file backend to store files right in the database, in a S3 compatible storage or also use the filesystem directly
  • Support for encrypted PDFs
  • Files supported:
    • Documents:
      • PDF
      • common MS Office (doc, docx, xls, xlsx)
      • OpenDocument (odt, ods)
      • RichText (rtf)
      • Images (jpg, png, tiff)
      • HTML
      • text/* (treated as Markdown)
    • Archives (extracted automatically, can be nested)
      • zip
      • eml (e-mail files in plain text MIME)
  • Extend Docspell via addons
  • Tooling:
  • License: AGPLv3

Limitations🔗

These are current known limitations that may be of interest for considering docspell at the moment.

  • Documents cannot be modified.
  • You can remove and add documents but there is no versioning.
  • There is no fine grained permission management