Datafold, a San Francisco, CA-based data reliability platform that automates the most tedious parts of data engineering workflows, raised $20m in Series A funding.
Backers included NEA and Amplify Partners. Peter Sonsini, general partner at NEA, has joined Datafold’s board in conjunction with the funding.
The company intends to use the funds to continue to expand operations and its business reach.
Led by Gleb Mezhanskiy, founder and CEO, Datafold is a data reliability platform that helps data teams deliver data products via the ability to identify, prioritize, and investigate data quality issues proactively before they affect production. Its flagship feature, Data Diff, clearly shows data practitioners how a change in the data processing code will impact the resulting data and downstream products, such as BI dashboards and ML models. When integrated into the CI/CD process, Data Diff automates the data QA process to ensure that every proposed change (pull request) is tested before it gets shipped to production. Using its own SQL compiler, Datafold analyzes every query ever executed in the data warehouse to produce a graph of dependencies to see how data is produced and consumed, with even correlated subqueries, CASE WHEN statements, and other complex queries covered.
Customers include Thumbtack, Patreon, Faire, and Dutchie.