Import from Paperless🔗
tools/import-paperless can be used to get started by
importing your data from paperless into docspell.
The script imports the files and also tags and correspondents.
The script was written some while ago. It currently doesn't work out of the box, but is a good starting point as the issues are most probably fixed by adjusting some api calls. Pull requests are very welcome! See this issue.
Copy the script to the machine where paperless is running. Run it with the following arguments:
- URL of Docspell, including http(s)
- Username for Docspell, possibly including Collective (if other name as user)
- Password for Docspell
- Path to Paperless' database file (
db.sqlite3). When using Paperless with docker, it is in the mapped directory
- Path to Paperless' document base directory. When using Paperless with docker, it is the mapped directory
Some settings can be changed inside the script, right at the top:
LIMIT="LIMIT 0"(default: inactive): For testing purposes, limits the number of tags and correspondents read from Paperless (this will most likely lead to warnings when processing the documents)
LIMIT_DOC="LIMIT 5"(default: inactive): For testing purposes, limits the number of documents and document-to-tag relations read from Paperless
SKIP_EXISTING_DOCS=true(default: true): Won't touch already existing documents. If set to
falsedocuments, which exist already, won't be uploaded again, but the tags, correspondent, date and title from Paperless will be applied.
Warning In case you already had set these information in Docspell, they will be overwritten!