Convert HTML to PDF & Extract Text from PDFs with CustomJS API

Nodes

2e3e39e3-0e17-4fae-9ac5-9ee31f9711ed

Created by

CuCustomJS

Last edited 56 days ago

This n8n workflow illustrates how to convert PDF files into text with the PDF Toolkit from www.customjs.space.

@custom-js/n8n-nodes-pdf-toolkit

Notice

Community nodes can only be installed on self-hosted instances of n8n.

What this workflow does

  • Change the requested HTML to PDF..
  • Extract text from the PDF.
  • Use a Code node to handle URLs that point to PDF files.
  • Convert the PDF to text.

Requirements

  • Self-hosted n8n instance.
  • CustomJS API key for converting PDF to text.
  • HTML Data to convert PDF files.
  • Code node for handling URL that indicates PDF file.

Workflow Steps:

  1. Manual Trigger:

    • Runs with user interaction.
  2. HTML to PDF:

    • Request HTML Data
    • Convert HTML to PDF
  3. Convert PDF to Text:

    • Convert the generated Text from PDF

Usage

Get API key from customJS

  • Sign up to customJS platform.
  • Navigate to your profile page
  • Press "Show" button to get API key

Set Credentials for CustomJS API on n8n

Copy and paste your API key generated from CustomJS here.

Design workflow

  • A Manual Trigger for starting workflow.
  • HTTP Request Nodes for downloading PDF files.
  • Code node for handling URL that indicates PDF file.
  • Convert PDF to Text.

You can replace logic for triggering and returning results. For example, you can trigger this workflow by calling a webhook and get a result as a response from webhook. Simply replace Manual Trigger and Write to Disk nodes.

New to n8n?

Need help building new n8n workflows? Process automation for you or your company will save you time and money, and it's completely free!