DEV Community

Sagara
Sagara

Posted on

Personal Picks: Data Product News (March 19, 2025)

※This is an English translation of my original article in Japanese.

Hello, I'm Sagara.

As a consultant specializing in Modern Data Stack, I'm constantly keeping up with the flood of information in this space. In this article, I'll summarize some interesting Modern Data Stack updates from the past two weeks.

Note: This doesn't cover all the latest information about these products. It only includes information that **I personally found interesting.

Data Warehouse/Data Lakehouse

General

Is Apache Iceberg the New Hadoop? Navigating the Complexities of Modern Data Lakehouses

Data Engineering Weekly published an article comparing Apache Iceberg with Hadoop from implementation challenges and ecosystem perspectives.

The article points out that Iceberg itself is not a complete solution and requires a surrounding ecosystem to function effectively, including catalogs, compute engines, and maintenance processes. Hadoop similarly had a vast ecosystem, and managing it was a major source of complexity - suggesting Iceberg faces similar challenges.

I personally agree with these observations and am interested to see how various vendors will address these operational challenges.

https://www.dataengineeringweekly.com/p/is-apache-iceberg-the-new-hadoop

Snowflake

v9.7 and Bundle 2025_02 Details Released

Snowflake has published the details of their latest version v9.7 and Bundle 2025_02 on their release notes page.

Some notable updates include:

https://docs.snowflake.com/release-notes/2025/9_07

Snowflake Can Now Define External Catalog Iceberg Tables by Referencing Amazon S3 Tables

A Snowflake Product Manager published a blog announcing that Snowflake can now define external catalog Iceberg tables by referencing Amazon S3 Tables. This feature is currently in preview.

(This became possible after Amazon S3 Tables added support for Apache Iceberg REST Catalog API on March 13.)

https://medium.com/snowflake/snowflake-integrates-with-amazon-s3-tables-d6cebf5fdcb2

BigQuery

Article on Cost Optimization with BigQuery Slot Management

Medium published a comprehensive article on cost optimization through BigQuery slot management.

The article systematically covers everything from explaining slots and cost structures to the various features available for slot management, which I found quite helpful.

https://medium.com/google-cloud/bigquery-cost-optimization-with-slot-management-e6eb50697265

MotherDuck/DuckDB

DuckDB Announces Support for Apache Iceberg REST Catalog

DuckDB has announced support for Apache Iceberg REST Catalog.

Previously, DuckDB could only reference Iceberg by directly specifying metadata files on storage, so being able to connect with a catalog allows access to the latest metadata files, which I personally find very exciting!

The article below discussing this feature mentions integration with S3 Tables.

https://duckdb.org/2025/03/14/preview-amazon-s3-tables

DuckDB Announces "Local UI" - A GUI Tool for Local PC Execution

DuckDB has announced "Local UI," a GUI tool that can be run on local PCs.

As shown in the image from the blog below, it allows you to execute and analyze queries in a notebook format while obtaining statistics for tables and columns.

https://duckdb.org/2025/03/12/duckdb-ui.html

basics

Data Transform

dbt

Article on Git Branching Strategies for dbt

The dbt Developer blog published an article summarizing best practices for Git branching strategies when using dbt.

https://docs.getdbt.com/blog/git-branching-strategies-with-dbt

On a related note, I recently published an article about dbt project structure and development processes when using Dev/QA/Prod environments in dbt Cloud, which might also be helpful.

https://dev.classmethod.jp/articles/dbt-cloud-dev-qa-prod-environments/

Article on dbt's New Engine: minijinja

The dbt Developer blog published an article about dbt's new engine, which uses Rust-implemented minijinja to dramatically speed up project parsing. Compared to traditional Jinja-based parsing, it achieves over 30x speed improvements, especially for large projects!

https://docs.getdbt.com/blog/faster-project-parsing-with-rust

More details will be explained at the upcoming dbt Developer Day on March 19-20, 2025. If you're interested, please join.

https://www.getdbt.com/resources/webinars/dbt-developer-day

Business Intelligence

Looker

Release Notes for Version 25.4 Published

Looker has published the release notes for their latest version, 25.4.

I found these updates particularly interesting:

  • Added gemini_in_looker role as a permission to use Gemini in Looker
  • Studio in Looker activation is no longer required to use Conversational Analytics
    • I've written a blog post about this feature that might be helpful
  • Support for key pair authentication in Snowflake connection settings

https://cloud.google.com/looker/docs/release-notes

Omni

Raised $69M in Series B Funding on Their 3rd Anniversary

Omni announced that they've raised $69M in Series B funding on their third anniversary.

Translating a quote from the article: "We're proud to announce that Omni has completed a $69 million Series B funding round led by ICONIQ Growth, with participation from existing investors Theory Ventures, First Round Capital, Redpoint Ventures, GV, and Snowflake Ventures. At this milestone of our third year since founding, our valuation has reached $650 million, with 8x year-over-year growth in both revenue and customer usage."

https://omni.co/blog/omni-series-b-funding

Data Catalog

Atlan

Visualize ER Diagrams Based on Excel-Defined Definitions

Atlan announced a new feature that allows users to import ER diagrams into Atlan using Excel templates.

https://shipped.atlan.com/import-data-modeling-assets-into-atlan-203xo4

ER diagram visualization

Select Star

Released New Features Including Fivetran Integration and Improved Tableau Integration

Select Star released new features including Fivetran integration and improved Tableau integration.

Regarding Tableau specifically, it's now possible to manage sheets and views separately, identifying which sheets are in which views and displaying lineage. (Personally, I was most surprised to learn that custom SQL can also be displayed in lineage!)

https://docs.selectstar.com/changelog/march-12-2025-fivetran-integration-tableau-updates-and-more

Tableau integration

Data Activation (Reverse ETL)

Census

Census Announces "Census Store" to Manage Catalog and Data Updates in Iceberg Format

Census announced a new feature called "Census Store" that manages catalog and data updates in Iceberg format.

https://www.getcensus.com/blog/why-we-chose-iceberg-as-the-foundation-for-census-store

According to the documentation, it's possible to use both Census-managed S3 buckets and user-managed S3 buckets to manage data.

https://docs.getcensus.com/misc/data-storage/census-store

Documentation is also available for using Census Store with Snowflake as an example use case.

https://docs.getcensus.com/misc/data-storage/census-store/query-census-store-from-snowflake

Census also announced "SaaS Datasets" as a feature to load SaaS data into Census Store. It currently supports Salesforce and HubSpot.

https://www.getcensus.com/blog/introducing-census-saas-datasets-connect-transform-and-activate-your-saas-data

Similar to Fivetran's Managed Data Lake Service, I'm seeing a gradual increase in features where products manage catalog and data updates themselves.

Data Orchestration

Dagster

Announced "dg CLI" and "Components"

Dagster's OSS version announced new features: "dg CLI" and "Components" (currently in preview).

"dg CLI" is a CLI tool for managing Dagster projects, and "Components" is a feature that allows defining Dagster-related resources using YAML.

https://github.com/dagster-io/dagster/discussions/28472

5 Playwright CLI Flags That Will Transform Your Testing Workflow

  • 0:56 --last-failed
  • 2:34 --only-changed
  • 4:27 --repeat-each
  • 5:15 --forbid-only
  • 5:51 --ui --headed --workers 1

Learn how these powerful command-line options can save you time, strengthen your test suite, and streamline your Playwright testing experience. Click on any timestamp above to jump directly to that section in the tutorial!

Top comments (0)

Sentry image

See why 4M developers consider Sentry, “not bad.”

Fixing code doesn’t have to be the worst part of your day. Learn how Sentry can help.

Learn more

👋 Kindness is contagious

Engage with a wealth of insights in this thoughtful article, valued within the supportive DEV Community. Coders of every background are welcome to join in and add to our collective wisdom.

A sincere "thank you" often brightens someone’s day. Share your gratitude in the comments below!

On DEV, the act of sharing knowledge eases our journey and fortifies our community ties. Found value in this? A quick thank you to the author can make a significant impact.

Okay