To all Apache SeaTunnel community members, developers, partners, and friends who care about us:
2025 passed by in the blink of an eye, yet Apache SeaTunnel achieved remarkable growth and many exciting milestones throughout the year! As one of the fastest-growing data integration projects worldwide, we have witnessed our GitHub stars and forks steadily climb, attracting increasing attention from the global community. We released multiple important versions, continuously refining the core engine, enriching the connector ecosystem, and introducing practical new features—pushing the boundaries of performance, stability, and flexibility in data integration.
Thanks to the enthusiastic support of our community, this year’s activities were vibrant and impactful. Apache SeaTunnel also gained broad recognition from enterprises across industries, becoming a core data integration tool for thousands of companies, with its industry influence continuing to expand.
Behind every achievement lies the collaboration and dedication of each community member. Now, let us look back together at the moments we shared throughout this year.
GitHub Metrics
- Stars: As of December 2025, the GitHub star count has surpassed 9k, ranking among the top data integration projects of its kind and making Apache SeaTunnel one of the fastest-growing data integration tools globally.
- Commits: 5,034, reflecting the community’s high iteration efficiency and sustained contribution momentum.
- Forks: 2.2k forks, demonstrating a vibrant open-source ecosystem and strong global developer participation.
- Issues: 2,142 as of December 2025, with continuously improving response efficiency and resolution quality.
- Contributors: 421 contributors from companies and institutions around the world, injecting strong momentum into ecosystem growth.
- Total PRs: 5,542, with a steady increase in merged PRs, enabling efficient feature iteration and bug fixes.
- Lines of Code: 790,690, with continuous improvements to the core engine and connector system, expanding functional coverage.
PMC Overview:
- PMC Members: 22
- Committers: 38
- Contributors: 609
Top 10 Contributors of the Year
Based on comprehensive performance in 2025—including PR submissions, code reviews, documentation improvements, and community support—the Top 10 Contributors of the Year (in no particular order) are as follows:
- Contribution Masters
- Review Stars
- Discussion Heroes
- Issue Reporters
Releases
A total of four versions were released throughout the year: 2.3.9, 2.3.10, 2.3.11, and 2.3.12.
Top 10 Feature Updates
In 2025, Apache SeaTunnel released several major versions, including 2.3.10, 2.3.11, and 2.3.12. These releases continuously expanded the connector ecosystem, optimized the core engine, introduced many practical new features, and comprehensively improved existing functionality:
- New Connectors Added: Version 2.3.12 introduced SensorsData and Databend connectors, further enriching data source coverage and meeting broader industry data integration needs.
- Significant Connector Enhancements: Versions 2.3.10 and 2.3.12 continuously enhanced existing connectors. Paimon now supports multi-source concurrency, permission validation, and LIKE/IN predicate pushdown; ClickHouse supports multi-table parallel reads and parallel schema fetching; MaxCompute Sink supports append upsert & delete session modes.
- LLM and Vector Processing Enhancements: Version 2.3.10 introduced support in Transforms-V2 for handling non-standard LLM responses, enabled Zhipu AI for Embedding and LLM modules, and enhanced JSONPath support for Map and Array types—better adapting to AI-driven data processing scenarios.
- Custom Encryption and Decryption Configuration: Version 2.3.10 added support for custom encryption/decryption configuration keys, improving flexibility and data security to meet enterprise-level encryption requirements.
- Zeta Engine Performance and Observability Improvements: Version 2.3.12 introduced fine-grained checkpoint monitoring, REST APIs returning SQL-format results, job metadata with startTime, and observable task queue sizes—significantly improving engine stability and operational convenience.
- File Connector Enhancements: Version 2.3.12 added support for binary chunking, customizable CSV delimiters, and filtering files by last modified time, covering more file-processing scenarios.
- SQL Transform Capability Upgrades: Version 2.3.12 introduced COALESCE type conversion, multi_if, vector functions, and Murmur64 hashing, greatly enhancing SQL processing flexibility.
- Multi-Scenario Connector Optimization: Various optimizations were made to HBase, Oracle-CDC, Google Sheets, DingTalk, and Slack connectors. Enhancements and parameter optimizations were also applied to StarRocks, JDBC (SQLServer/Dameng), Iceberg, Redis, and others, improving multi-source adaptability.
- Core Module Stability Improvements: Fixed issues such as incorrect Milvus SourceReader state checks, repeated Kafka source reads, and CSV read/write exceptions, as well as resolving problems in Doris, Mongo-CDC, Hive, and other scenarios to ensure production stability.
- Comprehensive Documentation Improvements: Fixed dead links and parameter errors across multiple connector documents, added documentation for Iceberg S3 Tables and JDBC GenericDialect, and supplemented Chinese documentation translations to enhance readability and usability.
Community Activities Review
CommunityOverCode 2025: Actively participated in this global open-source event, co-organized a DataOps track, and shared multiple Apache SeaTunnel innovations and practices in data integration, expanding the project’s international influence.
-
Technical Sharing Sessions: Regular online technical sharing events were held, with 13 successful sessions in 2025. Community experts, core contributors, and enterprise practitioners shared the latest technical progress and real-world use cases, generating strong engagement:
- Scaling Apache SeaTunnel for Enterprise: Billion-Level Data Processing and Intelligent Fault Tolerance in Real-World Use Cases (Speaker: Shi Desheng, Senior Big Data Engineer at a Cybersecurity Company)
- Building Scalable Data Pipelines: Apache SeaTunnel Meets Cloudberry
- Designing and Building X2SeaTunnel through AI Coding (Speaker: Wang Xiaogang, Active Apache SeaTunnel Contributor, China Telecom Cloud Big Data Expert)
- Unifying Multiple Data Pipelines with SeaTunnel: Practical Notes from Tongcheng Travel (Speaker: Zhou Xiaochen, Data Pipeline Lead at Tongcheng Travel)
- From “Decentralized” to “Unified”: SUPCON Uses SeaTunnel to Build an Efficient Data Collection Framework, Achieving 0 Failures in Core Data Synchronization Tasks! (Speaker: Cui Junle, Data Technology Supervisor at SUPCON)
- From Hour-Level to Minute-Level: How DMALL Cut Data Integration Costs by Two-Thirds with SeaTunnel (Speaker: Jia Min, Senior Big Data R&D Engineer at DMALL)
- Amazon Cloud Solutions Architect Shows You How to Migrate Data to Amazon Aurora DSQL Using Apache SeaTunnel
SeaTunnel Community “Demo Ark Program”:
- Phase 1: Synchronizing Data from MySQL to PostgreSQL Using Apache SeaTunnel (Speaker: Ma Quantai, Data Warehouse Engineer at AUX)
-
Phase 2: Seamlessly Merging and Syncing MySQL Databases with Apache SeaTunne (Speaker: Chen Fei, Big Data Engineer at ChinaPay)
- Community Calls: Biweekly community meetings to synchronize project progress, define development plans, and address real project challenges.
- OSPP: Students Dong Jiaxin (University of Science and Technology Beijing) and Wu Tianyu (Shanghai Jiao Tong University) contributed Flink engine CDC source mode evolution support and Metalake support, respectively, significantly enhancing project capabilities.
📓Final Project Report 1: Schema Evolution Support on Apache SeaTunnel Flink Engine
📒Final Project Report 2| Apache SeaTunnel Adds Metalake Support
- Monthly “Merge Star” Awards: Monthly recognition of outstanding contributors, with over 90 contributors awarded throughout the year, continuously motivating participation and energizing the open-source ecosystem.
Community Ecosystem Expansion
- Extensive Enterprise Adoption: Serving as the core data integration tool for thousands of enterprises worldwide across finance, retail, internet, energy, and government sectors. In DMALL’s new retail scenarios, it supports PB-level real-time data synchronization; in a leading financial institution, it enables efficient cross-source integration with an 80% improvement in data processing efficiency.
-
Community Partnerships:
- Participated as a core partner in the AI Hackathon jointly organized by OceanBase and Ant Open Source, with Machine Heart as co-organizer
- Integrated with the Cloudberry database, exploring future expansion toward high-performance scenarios
Commercial Edition Enhancements: Commercial products based on Apache SeaTunnel continued to evolve, serving leading enterprises with new features such as enterprise-grade access control, cross-cluster data synchronization, and visualized operations monitoring—driving synergy between commercialization and the open-source ecosystem.
-
Awards and Recognition:
- Awarded “Outstanding Open Source Project” at the 2025 Shanghai Open Source Innovation Elite Conference, further enhancing visibility and industry influence.
- Received the 2025 “Annual Outstanding Technical Team Award” at the 16th China Database Technology Conference (DTCC 2025).
In 2025, Apache SeaTunnel achieved a year full of accomplishments. The community continued to grow, core capabilities advanced steadily, and enterprise recognition increased consistently—establishing SeaTunnel as a benchmark open-source project in the data integration domain. This is not only a celebration of achievements, but also a call to move forward. In the future, we will continue to deepen our focus on data integration, tackle more technical challenges, and expand into broader application scenarios. May we move forward together to build a new open-source ecosystem for data integration and create even more remarkable chapters ahead.








Top comments (0)