Datavolo's blog
What's in Datavolo's blog? Insights and inspiration, case studies and community for AI/ML and Data Engineers. Discover what we are talking about everyday here at Datavolo!
Fueling your AI Chatbots with Slack
The true power of chatbots is not in how much the large language model (LLM) powering it understands. It’s the ability to provide relevant, organization-specific information to the LLM so that it can provide a natural language interface to vast amounts of data. That...
Datavolo Architecture Viewpoint
The Evolving AI Stack Datavolo is going to play in three layers of the evolving AI stack: data pipelines, orchestration, and observability & governance. The value of any stack is determined by the app layer, as we saw with Windows, iOS, and countless other...
What is Data Observability for AI?
In today's data-driven world, understanding and measuring what is happening within and between disparate IT systems is paramount. Modern distributed application systems utilizing complex architectures with microservices and cloud-based infrastructure require a...
Reducing Observability Costs and Improving Operational Support at Datavolo
Finding the Observability Balance Through our evaluation of observability options at Datavolo, we’ve seen a lot of strong vendors providing real-time dashboards, ML-driven alerting, and every feature our engineers would use to evaluate our services across the three...
ETL is dead, long live ETL (for multimodal data)
Why did ELT become the most effective pattern for structured data? A key innovation in the past decade that unlocked the modern data stack was the decoupling of storage and compute enabled by cloud data warehouses as well as cloud data platforms like Databricks. This...
NiFi FlowGen Improvements at Datavolo (already!)
In the past week, since Datavolo released its Flow Generation capability, we've witnessed fantastic adoption as users have eagerly requested flows from the Flow Generation bot. We're excited to share that we have recently upgraded our models, enhancing both the power...
Seven Strategies for Securing Data Ingest Pipelines
Introduction Information security is an elusive but essential quality of modern computer systems. Implementing secure design principles involves different techniques depending on the domain, but core concepts apply regardless of architecture, language, or layers of...
The Evolution of AI Engineering and Datavolo’s Role
Humility is the first lesson In the machine learning era of software engineering, one persistent truth has emerged: engineers are increasingly submitting to the will of the machine. A significant milestone in the transition from classical machine learning to deep...
Introducing our GenAI NiFi Flow Builder!
Hey everyone, it's been an incredible journey over the past ten years since we open-sourced Apache NiFi. Right from the beginning, our mission with NiFi was crystal clear: to make it easier for all of you to gather data from...
Field CTO Perspectives: Why Datavolo and Why Now?
Setting the Stage There are a few times in our lives when we feel the ground shifting under our feet due to seismic shifts in technology. You know these paradigm shifts are truly seismic when they lead to broader changes in society–the web, search engines, mobile, and...
GenAI/RAG is “Homecoming for NiFi”
While NiFi has been developed, enhanced, and hardened over the last 17 years, it feels as if GenAI is the very purpose for which it was originally developed.
Multimodal AI Demands Multimodal Data Pipelines
Innovation is the driving force behind human progress and we believe in the power of technology to enable humans to push beyond the boundaries of what’s possible. In this rapidly evolving landscape, staying ahead of the curve is what sets good organizations apart from the great ones. Over the years, the realm of possibility has expanded in both incremental steps and monumental leaps. Let’s take a closer look at the transformative journey from “Big Data” to Generative AI.