All Data and AI Weekly 218-01 Dec 2025
( AI, Data, NiFi, Iceberg, Polaris, Streamlit, Flink, Kafka, Python, Java, SQL, MCP, LLM, RAG, Cortex AI, AISQL, Search, Unstructured Data )
#218-01December2025
🚀 NiFi + AI + AI Data Cloud + Iceberg. 🚀
Philly, Princeton, NYC and Youtube Events
Code and Open Source Projects
https://github.com/tspannhw/TrafficAI/tree/main/Agents
https://github.com/tspannhw/conferences
Focus: Cloud Architecture, Snowflake Data Cloud, & AI Agents
New
New Kitty! His name is Hot Pocket. His is smol.
🏛️ The Look Back: Architectural Foundations
Reflecting on core principles that shape modern software engineering.
-
The 12-Factor App Revisited
- Why 12-Factor Application Patterns Matter & Deep Dive into 12-Factor – A refresher on the methodology for building software-as-a-service apps, emphasizing declarative setups and cloud-native resilience.
-
Legends of Big Data
- A Bootiful Podcast: Tim Spann – A conversation with Big Data legend Tim Spann on the evolution of the Spring community and streaming data.
❄️ Snowflake Data Cloud Updates
Major moves in Data Governance, Cortex AI, and Pipeline Engineering.
AI & Cortex Ecosystem
- Models & Agents: Snowflake has expanded its AI capabilities significantly.
-
Search & Optimization:
- Cortex Search: Boosts & Decays – Fine-tuning search relevance.
- Cortex AI-to-SQL Optimization
Engineering & Governance
- Compliance: OneTrust Partnership for Data-Level Compliance and Trust Center Extensions.
-
Pipelines:
- Unstructured Data Pipeline Setup (SQL)
- Real-time Dashboards with Apache Superset
- Snowflake Flow Diff – A tool for comparing flow definitions.
- Semantic View Terraform Provider
- Knowledge Sharing:
🤖 The AI Frontier: Agents & LLMs
Emerging tools for building autonomous agents and open-source models.
-
Agent Frameworks:
- Pydantic AI Agents – A framework for building production-grade agents.
- FinRobot – An open-source AI agent platform specifically for finance.
- Asterisk AI Voice Agent – Real-time voice interaction capabilities.
-
Model Control Protocol (MCP):
- MCP-UI Organization – User interface components for MCP.
-
New Models:
- Segment Anything 3 (SAM3) – The latest image segmentation model from Meta.
🛠️ Developer Toolkit
Utilities to boost productivity and handle data at the edge.
-
CLI & Terminal:
- Gemini CLI Tips – Mastering Google's AI from the command line.
- Parquet Tools – Essential utility for inspecting parquet files.
- Cmux – A terminal multiplexer for managing multiple streams.
-
Applied AI:
- Building Transit Ridership Analysis with Cursor AI – A practical guide to using AI editors for data projects.
-
Hardware/IoT:
- Meshtastic Devices – Open source, off-grid, decentralized mesh networking.
📅 Upcoming Events & Labs
- Virtual Lab: Building Your First Multimodal Document Pipeline (Dec 4, 2025)
Thanks
https://github.com/timothyspann
© 2020-2025 Tim Spann https://www.youtube.com/@FLaNK-Stack
Top comments (0)