Convert HTML to PDF & Extract Text from PDFs with CustomJS API

Węzły

2e3e39e3-0e17-4fae-9ac5-9ee31f9711ed

Stworzone przez

CuCustomJS

Ostatnio edytowane 8 dni temu

This n8n workflow illustrates how to convert PDF files into text with the PDF Toolkit from www.customjs.space.

@custom-js/n8n-nodes-pdf-toolkit

Notice

Community nodes can only be installed on self-hosted instances of n8n.

What this workflow does

  • Change the requested HTML to PDF..
  • Extract text from the PDF.
  • Use a Code node to handle URLs that point to PDF files.
  • Convert the PDF to text.

Requirements

  • Self-hosted n8n instance.
  • CustomJS API key for converting PDF to text.
  • HTML Data to convert PDF files.
  • Code node for handling URL that indicates PDF file.

Workflow Steps:

  1. Manual Trigger:

    • Runs with user interaction.
  2. HTML to PDF:

    • Request HTML Data
    • Convert HTML to PDF
  3. Convert PDF to Text:

    • Convert the generated Text from PDF

Usage

Get API key from customJS

  • Sign up to customJS platform.
  • Navigate to your profile page
  • Press "Show" button to get API key

Set Credentials for CustomJS API on n8n

Copy and paste your API key generated from CustomJS here.

Design workflow

  • A Manual Trigger for starting workflow.
  • HTTP Request Nodes for downloading PDF files.
  • Code node for handling URL that indicates PDF file.
  • Convert PDF to Text.

You can replace logic for triggering and returning results. For example, you can trigger this workflow by calling a webhook and get a result as a response from webhook. Simply replace Manual Trigger and Write to Disk nodes.

Nowy w Świecie n8n?

Potrzebujesz pomocy przy budowie nowych schematów n8n? Automatyzajce procesów dla Ciebie lub Twojej firmy pozwolą oszczędzić ci czas i pieniądze, a do tego bez żadnych kosztów!