Skip to main content

Review and Manage Scraped Pages

Review your scraped pages and choose which ones to keep or remove from your knowledge base.

Viewing discovered pages

StepAction
1Go to Dashboard -> Data Sources
2Click your website source
3View the list of discovered pages

Add website URL in Data Sources screen

Each page listed is currently part of your training data.

Removing unwanted pages

StepAction
1Select the page you want to remove
2Click Delete
3Confirm removal
4Retrain the source

Pages removed will no longer be used for AI responses.

Legal pages like:

  • Privacy Policy
  • Terms & Conditions
  • Cookie Policy

contain repetitive legal language.

  • This can introduce noise into AI answers.
  • Removing them improves clarity and reduces irrelevant responses.

Retraining after removal

After removing pages:

StepAction
1Click Retrain
2Wait for processing to complete

Retraining updates the indexed knowledge base.

Important Notes or Limitations
  • Changes do not reflect immediately until retraining completes.