Manipulate PDF with Adobe developer API

Nodes

3b9f1b78-6b80-4575-aa6c-96698700038916a2bb8f-ebdf-4229-90b3-5fd853c81d9e767a4dc9-a96d-4faf-8994-3ca756304676+1

Tags

Created by

didigi-stud.io

Last edited 39 days ago

Adobe developer API

Did you know that Adobe provides an API to perform all sort of manipulation on PDF files :

  • Split PDF, Combine PDF
  • OCR
  • Insert page, delete page, replace page, reorder page
  • Content extraction (text content, tables, pictures)
  • ...

The free tier allows up to 500 PDF operation / month. As it comes directly from Adobe, it works often better than other alternatives.

Adobe documentation:

What does this workflow do

The API is a bit painful to use. To perform a transformation on a PDF it requires to

  • Authenticate and get a temporal token
  • Register a new asset (file)
  • Upload you PDF to the registered asset
  • Perform a query according to the transformation requested
  • Wait for the query to be proccessed by Adobe backend
  • Download the result

This workflow is a generic wrapper to perform all these steps for any transformation endpoint. I usually use it from other workflow with an Execute Workflow node.

Examples are given in the workflow.

Example use case

This service is useful for example to clean PDF data for an AI / RAG system.

My favorite use-case is to extract table as images and forward images to an AI for image recognition / description which is often more accuarate than feedind raw tabular data to a LLM.

New to n8n?

Need help building new n8n workflows? Process automation for you or your company will save you time and money, and it's completely free!