Skip to main content

Add Your Website for Training

Follow simple steps to add your website to Data Sources, and let Witzo automatically extract and structure your content for accurate AI interactions.

Overview

Website scraping is the primary way Witzo builds your knowledge base. Once added, your pages are crawled, processed, and indexed for AI-powered Conversations. This process ensures your AI has the most up-to-date information directly from your site.

Where to add URL

StepAction
1Go to Dashboard → Data Sources
2Click Add URL
3Enter your website homepage URL
4Click Save
5Start training

Login Page

How scraping works

Witzo automates the ingestion process through the following sequence:

This process enables retrieval-augmented generation (RAG), meaning answers are strictly based on your actual website content.

Content Guidelines

Crawl Limits

Limits are determined by your active subscription plan.

Important Notes
  • Only public content is scraped by default.
  • Large sites may take longer to process.
  • Authenticated pages are not supported.
  • JavaScript-heavy sites may have partial coverage.