✨ Vision-Based AI Agent Scraper - with Google Sheets, ScrapingBee, and Gemini

Last edited 39 days ago

Important Notes:

Check Legal Regulations:

This workflow involves scraping, so ensure you comply with the legal regulations in your country before getting started. Better safe than sorry!

Workflow Description:

😮‍💨 Tired of struggling with XPath, CSS selectors, or DOM specificity when scraping ?

This AI-powered solution is here to simplify your workflow! With a vision-based AI Agent, you can extract data effortlessly without worrying about how the DOM is structured.

This workflow leverages a vision-based AI Agent, integrated with Google Sheets, ScrapingBee, and the Gemini-1.5-Pro model, to extract structured data from webpages. The AI Agent primarily uses screenshots for data extraction but switches to HTML scraping when necessary, ensuring high accuracy.

Key Features:

  • Google Sheets Integration: Manage URLs to scrape and store structured results.
  • ScrapingBee: Capture full-page screenshots and retrieve HTML data for fallback extraction.
  • AI-Powered Data Parsing: Use Gemini-1.5-Pro for vision-based scraping and a Structured Output Parser to format extracted data into JSON.
  • Token Efficiency: HTML is converted to Markdown to optimize processing costs.

This template is designed for e-commerce scraping but can be customized for various use cases.

New to n8n?

Need help building new n8n workflows? Process automation for you or your company will save you time and money, and it's completely free!