Convert HTML to PDF & Extract Text from PDFs with CustomJS API
Last edited 56 days ago
This n8n workflow illustrates how to convert PDF files into text with the PDF Toolkit from www.customjs.space.
@custom-js/n8n-nodes-pdf-toolkit
Notice
Community nodes can only be installed on self-hosted instances of n8n.
What this workflow does
- Change the requested HTML to PDF..
- Extract text from the PDF.
- Use a Code node to handle URLs that point to PDF files.
- Convert the PDF to text.
Requirements
- Self-hosted n8n instance.
- CustomJS API key for converting PDF to text.
- HTML Data to convert PDF files.
- Code node for handling URL that indicates PDF file.
Workflow Steps:
-
Manual Trigger:
- Runs with user interaction.
-
HTML to PDF:
- Request HTML Data
- Convert HTML to PDF
-
Convert PDF to Text:
- Convert the generated Text from PDF
Usage
Get API key from customJS
- Sign up to customJS platform.
- Navigate to your profile page
- Press "Show" button to get API key
Set Credentials for CustomJS API on n8n
Copy and paste your API key generated from CustomJS here.
Design workflow
- A Manual Trigger for starting workflow.
- HTTP Request Nodes for downloading PDF files.
- Code node for handling URL that indicates PDF file.
- Convert PDF to Text.
You can replace logic for triggering and returning results. For example, you can trigger this workflow by calling a webhook and get a result as a response from webhook. Simply replace Manual Trigger and Write to Disk nodes.
You may also like
New to n8n?
Need help building new n8n workflows? Process automation for you or your company will save you time and money, and it's completely free!