Archives

Databricks Announces Major Contributions to Flagship Open Source Projects at Data + AI Summit

Databricks Announces Major Contributions to Flagship Open Source Projects at Data _ AI Summit/IT Digest
Databricks Announces Major Contributions to Flagship Open Source Projects at Data _ AI Summit/IT Digest

Databricks, the data and AI company and pioneer of the data lakehouse paradigm, announced several contributions to popular data and AI open source projects including Delta Lake, MLflow, and Apache Spark.

At the Data + AI Summit, the largest gathering of the open source data and AI community, Databricks announced that the company will contribute all features and enhancements it has made to Delta Lake to the Linux Foundation and open source all Delta Lake APIs as part of the Delta Lake 2.0 release. In addition, the company announced MLflow 2.0, which includes MLflow Pipelines, a new feature to accelerate and simplify ML model deployments. Finally, the company introduced Spark Connect, to enable the use of Spark on virtually any device, and Project Lightspeed, a next generation Spark Structured Streaming engine for data streaming on the lakehouse.

Also Read: Synopsys and Arm Strengthen Partnership to Advance Next-Gen Mobile SoCs

“From the beginning, Databricks has been committed to open standards and the open source community. We have created, contributed to, fostered the growth of, and donated some of the most impactful innovations in modern open source technology,” said Ali Ghodsi, Co-Founder and CEO of Databricks. “Open data lakehouses are quickly becoming the standard for how the most innovative companies handle their data and AI. Delta Lake, MLflow and Spark are all core to this architectural transformation, and we’re proud to do our part in accelerating their innovation and adoption.”

Delta Lake 2.0 Brings the Lakehouse to Everyone
Delta Lake 2.0 will bring unmatched query performance to all Delta Lake users and enable everyone to build a highly performant data lakehouse on open standards. With this contribution, Databricks customers and the open source community will benefit from the full functionality and enhanced performance of Delta Lake 2.0. The Delta Lake 2.0 Release Candidate is now available and is expected to be fully released later this year. The breadth of the Delta Lake ecosystem makes it flexible and powerful in a wide range of use cases. Fueling this is a vibrant community of over 6,400 members, with contributing developers from more than 70 contributing organizations.