apache/seatunnel

PR-I: Add dedicated knowledge sources

Open

#10,921 opened on May 19, 2026

View on GitHub
 (1 comment) (0 reactions) (0 assignees)Java (1,432 forks)batch import
featurehelp wanted

Repository metrics

Stars
 (6,897 stars)
PR merge metrics
 (Avg merge 19d 8h) (124 merged PRs in 30d)

Description

Scope

Add dedicated knowledge sources after the unified document contract is stable.

Candidate sources include Confluence, Google Drive, and SharePoint.

Deliverables

  • Add Confluence source proposal.
  • Add Google Drive source proposal.
  • Add SharePoint source proposal.
  • Ensure each source follows the unified document contract.
  • Prefer source-native version as DocumentHash priority 1 where available.

Acceptance Criteria

  • Dedicated sources emit or project the standard Knowledge Sync metadata.
  • Connector-specific logic stays inside connector modules.
  • Source-native versioning is used when available.
  • Sources integrate with the existing identity, parse, chunk, embedding, and lifecycle sink path.

Contributor guide