Our blog
Insights and inspiration, case studies and community for AI/ML and Data Engineers.
ETL is dead, long live ETL (for multimodal data)
Why did ELT become the most effective pattern for structured data? A key innovation in the past decade that unlocked the modern data stack was the decoupling of storage and compute enabled by cloud data warehouses as well as cloud data platforms like Databricks. This...
NiFi FlowGen Improvements at Datavolo (already!)
In the past week, since Datavolo released its Flow Generation capability, we've witnessed fantastic adoption as users have eagerly requested flows from the Flow Generation bot. We're excited to share that we have recently upgraded our models, enhancing both the power...
Seven Strategies for Securing Data Ingest Pipelines
Introduction Information security is an elusive but essential quality of modern computer systems. Implementing secure design principles involves different techniques depending on the domain, but core concepts apply regardless of architecture, language, or layers of...
The Evolution of AI Engineering and Datavolo’s Role
Humility is the first lesson In the machine learning era of software engineering, one persistent truth has emerged: engineers are increasingly submitting to the will of the machine. A significant milestone in the transition from classical machine learning to deep...
Introducing our GenAI NiFi Flow Builder!
Hey everyone, it's been an incredible journey over the past ten years since we open-sourced Apache NiFi. Right from the beginning, our mission with NiFi was crystal clear: to make it easier for all of you to gather data from...
Field CTO Perspectives: Why Datavolo and Why Now?
Setting the Stage There are a few times in our lives when we feel the ground shifting under our feet due to seismic shifts in technology. You know these paradigm shifts are truly seismic when they lead to broader changes in society–the web, search engines, mobile, and...
GenAI/RAG is “Homecoming for NiFi”
While NiFi has been developed, enhanced, and hardened over the last 17 years, it feels as if GenAI is the very purpose for which it was originally developed.
Multimodal AI Demands Multimodal Data Pipelines
Innovation is the driving force behind human progress and we believe in the power of technology to enable humans to push beyond the boundaries of what’s possible. In this rapidly evolving landscape, staying ahead of the curve is what sets good organizations apart from the great ones. Over the years, the realm of possibility has expanded in both incremental steps and monumental leaps. Let’s take a closer look at the transformative journey from “Big Data” to Generative AI.