add first draft of wikipedia article by gene-bordegaray · Pull Request #21105 · apache/datafusion

gene-bordegaray · 2026-03-23T01:09:08Z

Which issue does this PR close?

Closes Write a wikipedia article for Apache DataFusion #21076.

Go to this page
Click Edit source
Paste dev/wiki/apache-datafusion.wikitext
Click Show preview

alamb

Thank you @gene-bordegaray -- this looks great. I left some suggestions on how to make some of this language tighter.

Maybe we can wait a few days more and then submit to the wikipedia editors 🤔

alamb · 2026-03-23T13:52:38Z

dev/wiki/apache-datafusion.wikitext

+DataFusion was first announced as a Rust-native query engine for Apache Arrow in February 2019. That announcement said the project had started about two years earlier and had recently been reimplemented to be Arrow-native before its donation to Apache Arrow.<ref name="donation-post" />
+
+After its donation, DataFusion was developed within the Apache Arrow project. Its development during the early 2020s coincided with wider adoption of Rust and Arrow-based analytical systems.<ref name="sigmod-paper" />
+
+In 2024, a paper describing DataFusion was accepted to the industry track of the [[ACM SIGMOD]] conference.<ref name="sigmod-accepted">{{cite web |title=SIGMOD 2024 Industrial Track: Accepted Papers |url=https://2024.sigmod.org/industrial-list.shtml |website=SIGMOD 2024 |access-date=2026-03-22}}</ref><ref name="sigmod-paper" /> On April 16, 2024, the project graduated from Apache Arrow and became a top-level Apache project; the Apache Software Foundation publicly announced the change on June 11, 2024.<ref name="asf-tlp" />


I think we can make this more concise, maybe something like

Suggested change

DataFusion was first announced as a Rust-native query engine for Apache Arrow in February 2019. That announcement said the project had started about two years earlier and had recently been reimplemented to be Arrow-native before its donation to Apache Arrow.<ref name="donation-post" />

After its donation, DataFusion was developed within the Apache Arrow project. Its development during the early 2020s coincided with wider adoption of Rust and Arrow-based analytical systems.<ref name="sigmod-paper" />

In 2024, a paper describing DataFusion was accepted to the industry track of the [[ACM SIGMOD]] conference.<ref name="sigmod-accepted">{{cite web |title=SIGMOD 2024 Industrial Track: Accepted Papers |url=https://2024.sigmod.org/industrial-list.shtml |website=SIGMOD 2024 |access-date=2026-03-22}}</ref><ref name="sigmod-paper" /> On April 16, 2024, the project graduated from Apache Arrow and became a top-level Apache project; the Apache Software Foundation publicly announced the change on June 11, 2024.<ref name="asf-tlp" />

DataFusion originally authored by Andy Grove starting in 2017. It was donated to the Apache Arrow Project in February 2019 <ref name="donation-post" />. In 2024, a paper describing DataFusion was accepted to the industry track of the [[ACM SIGMOD]] conference.<ref name="sigmod-accepted">{{cite web |title=SIGMOD 2024 Industrial Track: Accepted Papers |url=https://2024.sigmod.org/industrial-list.shtml |website=SIGMOD 2024 |access-date=2026-03-22}}</ref><ref name="sigmod-paper" /> In April 2024, the project graduated from Apache Arrow and became a top-level Apache project .<ref name="asf-tlp" />

alamb · 2026-03-23T13:55:29Z

dev/wiki/apache-datafusion.wikitext

+
+=== [[Apache Spark]] ===
+
+[[Apache Spark]] is a distributed analytics framework for processing data at cluster scale.<ref name="spark-sql">{{cite web |title=Spark SQL & DataFrames |url=https://spark.apache.org/sql/ |website=Apache Spark |access-date=2026-03-22}}</ref> DataFusion executes queries within a single process and is aimed at building embedded analytics systems rather than distributed workloads.<ref name="sigmod-paper" /> Apache DataFusion Comet, originally developed at [[Apple Inc.|Apple]] and donated to the Apache Software Foundation, is a native execution plugin that uses DataFusion to accelerate Spark's [[Java virtual machine|JVM]]-based SQL execution engine.<ref name="comet-donation" />


Maybe we should also cite https://auron.apache.org/ as another Apache project accelerating spark using DataFusion

alamb · 2026-03-23T13:58:50Z

dev/wiki/apache-datafusion.wikitext

+DataFusion has been adopted across a range of analytics and database products. [[Palantir Technologies|Palantir]] Foundry's release notes state that its Lightweight Pipelines are powered by DataFusion for rapid, low-latency data processing.<ref name="palantir-2025">{{cite web |title=Announcements: July 2025 |url=https://www.palantir.com/docs/foundry/announcements/2025-07 |website=Palantir Foundry Documentation |publisher=Palantir Technologies |date=2025-07-29 |access-date=2026-03-22}}</ref><ref name="palantir-2024">{{cite web |title=Announcements: February 2024 |url=https://www.palantir.com/docs/foundry/announcements/2024-02 |website=Palantir Foundry Documentation |publisher=Palantir Technologies |date=February 2024 |access-date=2026-03-22}}</ref> [[Cloudflare]] used DataFusion to execute SQL queries over log data stored in Cloudflare R2 in its Log Explorer product.<ref name="cloudflare">{{cite web |title=Cloudflare Log Explorer is now GA, providing native observability and forensics |url=https://blog.cloudflare.com/logexplorer-ga/ |website=The Cloudflare Blog |publisher=Cloudflare |date=2025-06-18 |access-date=2026-03-22}}</ref> [[InfluxDB]] 3.0 was rebuilt on what InfluxData called the FDAP stack: Apache Flight, DataFusion, Arrow, and Parquet.<ref name="influx-fdap">{{cite web |title=Flight, DataFusion, Arrow, and Parquet: Using the FDAP Architecture to build InfluxDB 3.0 |url=https://www.influxdata.com/blog/flight-datafusion-arrow-parquet-fdap-architecture-influxdb/ |website=InfluxData |date=2023-10-25 |access-date=2026-03-22}}</ref> Other adopters described in public references include EDB Postgres AI,<ref name="siliconangle-edb">{{cite web |title=Enterprise DB begins rolling AI features into PostgreSQL |url=https://siliconangle.com/2024/05/23/enterprise-db-begins-rolling-ai-features-postgresql/ |website=SiliconANGLE |date=2024-05-23 |access-date=2026-03-22}}</ref> Cube,<ref name="cube-pushdown">{{cite web |title=Query pushdown in Cube's semantic layer |url=https://cube.dev/blog/query-push-down-in-cubes-semantic-layer |website=Cube |date=2024-06-03 |access-date=2026-03-22}}</ref> Spice AI,<ref name="spice">{{cite web |title=How we use Apache DataFusion at Spice AI |url=https://spice.ai/blog/how-we-use-apache-datafusion-at-spice-ai |website=Spice AI |date=2026-01-17 |access-date=2026-03-22}}</ref> Pydantic Logfire,<ref name="logfire">{{cite web |title=We're changing database |url=https://github.com/pydantic/logfire/issues/408 |website=GitHub |date=2024-08-29 |access-date=2026-03-22}}</ref> and Kamu.<ref name="kamu">{{cite web |title=100X faster ingestion, and FlightSQL support for connecting BI tools |url=https://www.kamu.dev/blog/2023-09-datafusion-flightsql/ |website=Kamu Data |date=2023-09-26 |access-date=2026-03-22}}</ref>
+
+In 2024, ''CRN'' included Apache DataFusion in its list of "The 10 Coolest Open-Source Software Tools Of 2024".<ref name="crn">{{cite web |title=The 10 Coolest Open-Source Software Tools Of 2024 |url=https://www.crn.com/news/software/2024/the-10-coolest-open-source-software-tools-of-2024?page=3 |website=CRN |date=2024-11-21 |access-date=2026-03-22}}</ref>
+


I think we can potentially trim this section down too to make it more concise.

Suggested change

DataFusion has been adopted across a range of analytics and database products. [[Palantir Technologies|Palantir]] Foundry's release notes state that its Lightweight Pipelines are powered by DataFusion for rapid, low-latency data processing.<ref name="palantir-2025">{{cite web |title=Announcements: July 2025 |url=https://www.palantir.com/docs/foundry/announcements/2025-07 |website=Palantir Foundry Documentation |publisher=Palantir Technologies |date=2025-07-29 |access-date=2026-03-22}}</ref><ref name="palantir-2024">{{cite web |title=Announcements: February 2024 |url=https://www.palantir.com/docs/foundry/announcements/2024-02 |website=Palantir Foundry Documentation |publisher=Palantir Technologies |date=February 2024 |access-date=2026-03-22}}</ref> [[Cloudflare]] used DataFusion to execute SQL queries over log data stored in Cloudflare R2 in its Log Explorer product.<ref name="cloudflare">{{cite web |title=Cloudflare Log Explorer is now GA, providing native observability and forensics |url=https://blog.cloudflare.com/logexplorer-ga/ |website=The Cloudflare Blog |publisher=Cloudflare |date=2025-06-18 |access-date=2026-03-22}}</ref> [[InfluxDB]] 3.0 was rebuilt on what InfluxData called the FDAP stack: Apache Flight, DataFusion, Arrow, and Parquet.<ref name="influx-fdap">{{cite web |title=Flight, DataFusion, Arrow, and Parquet: Using the FDAP Architecture to build InfluxDB 3.0 |url=https://www.influxdata.com/blog/flight-datafusion-arrow-parquet-fdap-architecture-influxdb/ |website=InfluxData |date=2023-10-25 |access-date=2026-03-22}}</ref> Other adopters described in public references include EDB Postgres AI,<ref name="siliconangle-edb">{{cite web |title=Enterprise DB begins rolling AI features into PostgreSQL |url=https://siliconangle.com/2024/05/23/enterprise-db-begins-rolling-ai-features-postgresql/ |website=SiliconANGLE |date=2024-05-23 |access-date=2026-03-22}}</ref> Cube,<ref name="cube-pushdown">{{cite web |title=Query pushdown in Cube's semantic layer |url=https://cube.dev/blog/query-push-down-in-cubes-semantic-layer |website=Cube |date=2024-06-03 |access-date=2026-03-22}}</ref> Spice AI,<ref name="spice">{{cite web |title=How we use Apache DataFusion at Spice AI |url=https://spice.ai/blog/how-we-use-apache-datafusion-at-spice-ai |website=Spice AI |date=2026-01-17 |access-date=2026-03-22}}</ref> Pydantic Logfire,<ref name="logfire">{{cite web |title=We're changing database |url=https://github.com/pydantic/logfire/issues/408 |website=GitHub |date=2024-08-29 |access-date=2026-03-22}}</ref> and Kamu.<ref name="kamu">{{cite web |title=100X faster ingestion, and FlightSQL support for connecting BI tools |url=https://www.kamu.dev/blog/2023-09-datafusion-flightsql/ |website=Kamu Data |date=2023-09-26 |access-date=2026-03-22}}</ref>

In 2024, ''CRN'' included Apache DataFusion in its list of "The 10 Coolest Open-Source Software Tools Of 2024".<ref name="crn">{{cite web |title=The 10 Coolest Open-Source Software Tools Of 2024 |url=https://www.crn.com/news/software/2024/the-10-coolest-open-source-software-tools-of-2024?page=3 |website=CRN |date=2024-11-21 |access-date=2026-03-22}}</ref>

DataFusion has been adopted across a range of analytics and database products. [[Palantir Technologies|Palantir]] Lightweight Pipelines are powered by DataFusion.<ref name="palantir-2025">{{cite web |title=Announcements: July 2025 |url=https://www.palantir.com/docs/foundry/announcements/2025-07 |website=Palantir Foundry Documentation |publisher=Palantir Technologies |date=2025-07-29 |access-date=2026-03-22}}</ref><ref name="palantir-2024">{{cite web |title=Announcements: February 2024 |url=https://www.palantir.com/docs/foundry/announcements/2024-02 |website=Palantir Foundry Documentation |publisher=Palantir Technologies |date=February 2024 |access-date=2026-03-22}}</ref> [[Cloudflare]] used DataFusion in its Log Explorer product to execute SQL queries over log data stored in Cloudflare R2.<ref name="cloudflare">{{cite web |title=Cloudflare Log Explorer is now GA, providing native observability and forensics |url=https://blog.cloudflare.com/logexplorer-ga/ |website=The Cloudflare Blog |publisher=Cloudflare |date=2025-06-18 |access-date=2026-03-22}}</ref> [[InfluxDB]] 3.0 uses DataFusion along with other components of the FDAP stack: Apache Flight, DataFusion, Arrow, and Parquet.<ref name="influx-fdap">{{cite web |title=Flight, DataFusion, Arrow, and Parquet: Using the FDAP Architecture to build InfluxDB 3.0 |url=https://www.influxdata.com/blog/flight-datafusion-arrow-parquet-fdap-architecture-influxdb/ |website=InfluxData |date=2023-10-25 |access-date=2026-03-22}}</ref> Other users inclide EDB Postgres AI,<ref name="siliconangle-edb">{{cite web |title=Enterprise DB begins rolling AI features into PostgreSQL |url=https://siliconangle.com/2024/05/23/enterprise-db-begins-rolling-ai-features-postgresql/ |website=SiliconANGLE |date=2024-05-23 |access-date=2026-03-22}}</ref> Cube,<ref name="cube-pushdown">{{cite web |title=Query pushdown in Cube's semantic layer |url=https://cube.dev/blog/query-push-down-in-cubes-semantic-layer |website=Cube |date=2024-06-03 |access-date=2026-03-22}}</ref> Spice AI,<ref name="spice">{{cite web |title=How we use Apache DataFusion at Spice AI |url=https://spice.ai/blog/how-we-use-apache-datafusion-at-spice-ai |website=Spice AI |date=2026-01-17 |access-date=2026-03-22}}</ref> Pydantic Logfire,<ref name="logfire">{{cite web |title=We're changing database |url=https://github.com/pydantic/logfire/issues/408 |website=GitHub |date=2024-08-29 |access-date=2026-03-22}}</ref> and Kamu.<ref name="kamu">{{cite web |title=100X faster ingestion, and FlightSQL support for connecting BI tools |url=https://www.kamu.dev/blog/2023-09-datafusion-flightsql/ |website=Kamu Data |date=2023-09-26 |access-date=2026-03-22}}</ref>

In 2024, ''CRN'' included Apache DataFusion in its list of "The 10 Coolest Open-Source Software Tools Of 2024".<ref name="crn">{{cite web |title=The 10 Coolest Open-Source Software Tools Of 2024 |url=https://www.crn.com/news/software/2024/the-10-coolest-open-source-software-tools-of-2024?page=3 |website=CRN |date=2024-11-21 |access-date=2026-03-22}}</ref>

I would also probably choose to start with cloudflare workers rather than palantir as I know that usecase better (and they wrote a specific blog which I think is stronger)

gene-bordegaray · 2026-03-23T14:09:05Z

also a side note. I wanted to add the DF logo but my account needs to be verified (I think will be in a day or two) 😅

NNhanptnk · 2026-03-24T16:28:39Z

dev/wiki/apache-datafusion.wikitext

+| website = {{URL|https://datafusion.apache.org/}}
+}}
+
+'''Apache DataFusion''' is an [[open-source software|open-source]], embeddable analytical query engine written in [[Rust (programming language)|Rust]], built on [[Apache Arrow]]'s columnar memory format.<ref name="sigmod-paper">{{cite journal |last1=Lamb |first1=Andrew |last2=Shen |first2=Yijie |last3=Heres |first3=Daniel |last4=Chakraborty |first4=Jayjeet |last5=Kabak |first5=Mehmet Ozan |last6=Hsieh |first6=Liang-Chi |last7=Sun |first7=Chao |title=Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine |journal=Proceedings of the 2024 International Conference on Management of Data |year=2024 |doi=10.1145/3626246.3653368}}</ref><ref name="intro-docs">{{cite web |title=Introduction |url=https://datafusion.apache.org/user-guide/introduction.html |website=Apache DataFusion |publisher=Apache Software Foundation |access-date=2026-03-22}}</ref> It provides [[SQL]] and DataFrame interfaces for analytical query execution and is designed to be used as a library by developers building databases, query engines, and analytical tools, rather than as a standalone database server.<ref name="sigmod-paper" /><ref name="intro-docs" /> The project originated in 2017, was donated to the [[Apache Arrow]] project in 2019, and became a top-level project of the [[Apache Software Foundation]] in 2024.<ref name="donation-post">{{cite web |title=DataFusion: A Rust-native Query Engine for Apache Arrow |url=https://datafusion.apache.org/blog/2019/02/04/datafusion-donation/ |website=Apache DataFusion Blog |publisher=Apache Software Foundation |date=2019-02-04 |access-date=2026-03-22}}</ref><ref name="asf-tlp">{{cite web |title=Apache Software Foundation Announces New Top-Level Project Apache DataFusion |url=https://news.apache.org/foundation/entry/apache-software-foundation-announces-new-top-level-project-apache-datafusion |website=The ASF Blog |publisher=Apache Software Foundation |date=2024-06-11 |access-date=2026-03-22}}</ref>


It provides [[SQL]] and DataFrame interfaces for analytical query execution and is designed to be used as a library by developers building databases, query engines, and analytical tools, rather than as a standalone database server.

I think we can make this a bit better in the sense of introducing DataFusion and its uniqueness. Here's what I think :

Often described as the "LLVM for Databases," [Source 1] Apache DataFusion is a modular, Arrow-native query engine library designed for embedding into custom systems rather than operating as a monolithic standalone server [Source 2 and 3]. This high-performance Rust framework provides a composable foundation, allowing developers to precisely extend query planning and vectorized execution to meet unique architectural requirements. [Source 2 and 3]

Source 1 : https://midas.bu.edu/assets/slides/andrew_lamb_slides.pdf (cc @alamb )

Source 2 and 3 (this is the first two reference) : {{cite journal |last1=Lamb |first1=Andrew |last2=Shen |first2=Yijie |last3=Heres |first3=Daniel |last4=Chakraborty |first4=Jayjeet |last5=Kabak |first5=Mehmet Ozan |last6=Hsieh |first6=Liang-Chi |last7=Sun |first7=Chao |title=Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine |journal=Proceedings of the 2024 International Conference on Management of Data |year=2024 |doi=10.1145/3626246.3653368}}{{cite web |title=Introduction |url=https://datafusion.apache.org/user-guide/introduction.html |website=Apache DataFusion |publisher=Apache Software Foundation |access-date=2026-03-22}}

NNhanptnk · 2026-03-24T16:31:33Z

dev/wiki/apache-datafusion.wikitext

+| website = {{URL|https://datafusion.apache.org/}}
+}}
+
+'''Apache DataFusion''' is an [[open-source software|open-source]], embeddable analytical query engine written in [[Rust (programming language)|Rust]], built on [[Apache Arrow]]'s columnar memory format.<ref name="sigmod-paper">{{cite journal |last1=Lamb |first1=Andrew |last2=Shen |first2=Yijie |last3=Heres |first3=Daniel |last4=Chakraborty |first4=Jayjeet |last5=Kabak |first5=Mehmet Ozan |last6=Hsieh |first6=Liang-Chi |last7=Sun |first7=Chao |title=Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine |journal=Proceedings of the 2024 International Conference on Management of Data |year=2024 |doi=10.1145/3626246.3653368}}</ref><ref name="intro-docs">{{cite web |title=Introduction |url=https://datafusion.apache.org/user-guide/introduction.html |website=Apache DataFusion |publisher=Apache Software Foundation |access-date=2026-03-22}}</ref> It provides [[SQL]] and DataFrame interfaces for analytical query execution and is designed to be used as a library by developers building databases, query engines, and analytical tools, rather than as a standalone database server.<ref name="sigmod-paper" /><ref name="intro-docs" /> The project originated in 2017, was donated to the [[Apache Arrow]] project in 2019, and became a top-level project of the [[Apache Software Foundation]] in 2024.<ref name="donation-post">{{cite web |title=DataFusion: A Rust-native Query Engine for Apache Arrow |url=https://datafusion.apache.org/blog/2019/02/04/datafusion-donation/ |website=Apache DataFusion Blog |publisher=Apache Software Foundation |date=2019-02-04 |access-date=2026-03-22}}</ref><ref name="asf-tlp">{{cite web |title=Apache Software Foundation Announces New Top-Level Project Apache DataFusion |url=https://news.apache.org/foundation/entry/apache-software-foundation-announces-new-top-level-project-apache-datafusion |website=The ASF Blog |publisher=Apache Software Foundation |date=2024-06-11 |access-date=2026-03-22}}</ref>


I believe the project will continue to grow so we can write at the end :

Apache DataFusion now sees over one million monthly downloads. [cite crate.io source]

source : https://crates.io/search?q=datafusion

We could also say "as of March 2026, DataFusion saw one million monthly downloads" if we wanted to ensure the sstatement remained accurate

add first draft of article

eea53d6

github-actions bot added the development-process Related to development process of DataFusion label Mar 23, 2026

alamb reviewed Mar 23, 2026

View reviewed changes

alamb changed the title ~~add first draft of article~~ add first draft of wikipedia article Mar 23, 2026

gene-bordegaray added 3 commits March 23, 2026 10:33

docs: tighten wikipedia draft review follow-ups

b621e2c

add Auron link

fa06abe

shorten history more

2a014b2

NNhanptnk reviewed Mar 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add first draft of wikipedia article#21105

add first draft of wikipedia article#21105
gene-bordegaray wants to merge 4 commits intoapache:mainfrom
gene-bordegaray:issue-21076-wikipedia-draft

gene-bordegaray commented Mar 23, 2026 •

edited

Loading

Uh oh!

alamb left a comment

Uh oh!

alamb Mar 23, 2026

Uh oh!

alamb Mar 23, 2026

Uh oh!

alamb Mar 23, 2026

Uh oh!

alamb Mar 23, 2026

Uh oh!

gene-bordegaray commented Mar 23, 2026

Uh oh!

NNhanptnk Mar 24, 2026

Uh oh!

NNhanptnk Mar 24, 2026

Uh oh!

alamb Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		=== [[Apache Spark]] ===

		[[Apache Spark]] is a distributed analytics framework for processing data at cluster scale.<ref name="spark-sql">{{cite web \|title=Spark SQL & DataFrames \|url=https://spark.apache.org/sql/ \|website=Apache Spark \|access-date=2026-03-22}}</ref> DataFusion executes queries within a single process and is aimed at building embedded analytics systems rather than distributed workloads.<ref name="sigmod-paper" /> Apache DataFusion Comet, originally developed at [[Apple Inc.\|Apple]] and donated to the Apache Software Foundation, is a native execution plugin that uses DataFusion to accelerate Spark's [[Java virtual machine\|JVM]]-based SQL execution engine.<ref name="comet-donation" />

		DataFusion has been adopted across a range of analytics and database products. [[Palantir Technologies\|Palantir]] Foundry's release notes state that its Lightweight Pipelines are powered by DataFusion for rapid, low-latency data processing.<ref name="palantir-2025">{{cite web \|title=Announcements: July 2025 \|url=https://www.palantir.com/docs/foundry/announcements/2025-07 \|website=Palantir Foundry Documentation \|publisher=Palantir Technologies \|date=2025-07-29 \|access-date=2026-03-22}}</ref><ref name="palantir-2024">{{cite web \|title=Announcements: February 2024 \|url=https://www.palantir.com/docs/foundry/announcements/2024-02 \|website=Palantir Foundry Documentation \|publisher=Palantir Technologies \|date=February 2024 \|access-date=2026-03-22}}</ref> [[Cloudflare]] used DataFusion to execute SQL queries over log data stored in Cloudflare R2 in its Log Explorer product.<ref name="cloudflare">{{cite web \|title=Cloudflare Log Explorer is now GA, providing native observability and forensics \|url=https://blog.cloudflare.com/logexplorer-ga/ \|website=The Cloudflare Blog \|publisher=Cloudflare \|date=2025-06-18 \|access-date=2026-03-22}}</ref> [[InfluxDB]] 3.0 was rebuilt on what InfluxData called the FDAP stack: Apache Flight, DataFusion, Arrow, and Parquet.<ref name="influx-fdap">{{cite web \|title=Flight, DataFusion, Arrow, and Parquet: Using the FDAP Architecture to build InfluxDB 3.0 \|url=https://www.influxdata.com/blog/flight-datafusion-arrow-parquet-fdap-architecture-influxdb/ \|website=InfluxData \|date=2023-10-25 \|access-date=2026-03-22}}</ref> Other adopters described in public references include EDB Postgres AI,<ref name="siliconangle-edb">{{cite web \|title=Enterprise DB begins rolling AI features into PostgreSQL \|url=https://siliconangle.com/2024/05/23/enterprise-db-begins-rolling-ai-features-postgresql/ \|website=SiliconANGLE \|date=2024-05-23 \|access-date=2026-03-22}}</ref> Cube,<ref name="cube-pushdown">{{cite web \|title=Query pushdown in Cube's semantic layer \|url=https://cube.dev/blog/query-push-down-in-cubes-semantic-layer \|website=Cube \|date=2024-06-03 \|access-date=2026-03-22}}</ref> Spice AI,<ref name="spice">{{cite web \|title=How we use Apache DataFusion at Spice AI \|url=https://spice.ai/blog/how-we-use-apache-datafusion-at-spice-ai \|website=Spice AI \|date=2026-01-17 \|access-date=2026-03-22}}</ref> Pydantic Logfire,<ref name="logfire">{{cite web \|title=We're changing database \|url=https://github.com/pydantic/logfire/issues/408 \|website=GitHub \|date=2024-08-29 \|access-date=2026-03-22}}</ref> and Kamu.<ref name="kamu">{{cite web \|title=100X faster ingestion, and FlightSQL support for connecting BI tools \|url=https://www.kamu.dev/blog/2023-09-datafusion-flightsql/ \|website=Kamu Data \|date=2023-09-26 \|access-date=2026-03-22}}</ref>

		In 2024, ''CRN'' included Apache DataFusion in its list of "The 10 Coolest Open-Source Software Tools Of 2024".<ref name="crn">{{cite web \|title=The 10 Coolest Open-Source Software Tools Of 2024 \|url=https://www.crn.com/news/software/2024/the-10-coolest-open-source-software-tools-of-2024?page=3 \|website=CRN \|date=2024-11-21 \|access-date=2026-03-22}}</ref>

Conversation

gene-bordegaray commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

alamb Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

alamb Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

alamb Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

alamb Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

gene-bordegaray commented Mar 23, 2026

Uh oh!

NNhanptnk Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

NNhanptnk Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

alamb Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gene-bordegaray commented Mar 23, 2026 •

edited

Loading