Full-featured data pipeline

Everything you need topower your RAG pipeline

Connect any data source, embed with any model, and store in any vector database. All without writing a single line of ETL code.

12+
Source Connectors
4
Vector Stores
4
Embedding Models
1
Unified Platform

Connect your data sources

Pull documents from where your team already works. All connectors support browsing and selective sync.

Confluence

Live

Sync pages from Atlassian Confluence with OAuth authentication.

  • OAuth authentication
  • Tree browsing
  • Space filtering
  • Page content extraction

Google Drive

Live

Import documents, PDFs, and sheets from Google Drive.

  • OAuth authentication
  • Folder browsing
  • PDF extraction
  • Multiple file types

Jira

Live

Sync issues and project data from Atlassian Jira with OAuth.

  • OAuth authentication
  • Project browsing
  • Issue sync
  • Field mapping

Dropbox

Live

Access and sync documents and files from Dropbox accounts.

  • OAuth authentication
  • Folder browsing
  • File access
  • File content

Amazon S3

Live

Connect any S3-compatible storage for document ingestion.

  • Access key auth
  • Bucket browsing
  • Prefix filtering
  • Any S3-compatible

Supabase Storage

Live

Sync documents directly from Supabase Storage buckets.

  • API key auth
  • Bucket selection
  • File browsing
  • Direct integration

Notion

Live

Import pages and databases from Notion workspaces.

  • OAuth authentication
  • Page selection
  • Database support
  • Rich content

Website Crawler

Live

Crawl and index web pages and documentation sites.

  • URL-based
  • Depth control
  • Link following
  • SSRF protection

File Upload

Live

Direct upload of PDF, DOCX, TXT, CSV, and JSON files.

  • Drag & drop
  • Multiple formats
  • Bulk upload
  • Progress tracking

GitHub

Live

Sync repositories, issues, and discussions from GitHub.

  • OAuth authentication
  • Repo & org browsing
  • Issues & discussions
  • Code file sync

Azure Blob Storage

Live

Connect Azure Blob Storage containers for document ingestion.

  • Connection string auth
  • Container browsing
  • Virtual directory hierarchy
  • Prefix filtering

Google Cloud Storage

Live

Sync documents from Google Cloud Storage buckets.

  • Service account auth
  • Bucket browsing
  • Virtual directory hierarchy
  • Prefix filtering

Ready to build your RAG pipeline?

Get 250 free credits to start. Begin syncing documents in minutes.

250 free credits to start • No credit card required

Vector Data Loader | ETL Pipeline for Production RAG