Skip to content

Phase 4: Streaming and advanced connectors#6

Closed
netsirius wants to merge 1 commit intomainfrom
feature/phase4-streaming-connectors
Closed

Phase 4: Streaming and advanced connectors#6
netsirius wants to merge 1 commit intomainfrom
feature/phase4-streaming-connectors

Conversation

@netsirius
Copy link
Copy Markdown
Owner

Summary

Expands the connector ecosystem with streaming support and 6 new connectors.

New source connectors

  • Kafka: batch + streaming modes, configurable offsets, health check
  • MongoDB: collection reads, aggregation pipeline support, TCP health check
  • REST API: generic with offset/cursor pagination, bearer/api-key auth
  • BigQuery source: table reads + SQL queries (was sink-only)

New sink connectors

  • Kafka: batch + streaming with checkpointing and triggers
  • Elasticsearch: Spark ES connector with REST bulk API fallback

Totals

  • 8 source connectors: SQL, PostgreSQL, File, Kafka, MongoDB, REST, BigQuery, Test
  • 6 sink connectors: BigQuery, DeltaLake, File, Kafka, Elasticsearch, Test
  • 14 total connectors

All connectors include healthCheck for weaver doctor.

Test plan

  • sbt compile — All modules compile
  • 47 core tests pass
  • CI green on GitHub Actions

New source connectors:
- Kafka (batch + streaming modes, configurable offsets)
- MongoDB (collection reads, aggregation pipeline support)
- REST API (generic, with offset/cursor pagination, bearer/api-key auth)
- BigQuery source (table reads + SQL queries)

New sink connectors:
- Kafka (batch + streaming with checkpointing)
- Elasticsearch (Spark ES connector with REST bulk API fallback)

All connectors include healthCheck for weaver doctor.

Dependencies:
- kafka-clients 3.7.0 (provided, for health checks)

Schema:
- Updated pipeline.schema.json with new connector types

Total connectors: 8 sources + 6 sinks = 14 connectors
@netsirius netsirius closed this Apr 6, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant