Simba Spark ODBC Driver Simba Technologies Inc.
winget install --id=Databricks.SparkODBCDriver -e
The Simba Apache Spark ODBC Connector is used for direct SQL and HiveQL access to Apache Hadoop / Spark distributions, enabling Business Intelligence (BI), analytics, and reporting on Hadoop-based data. The connector efficiently transforms an application’s SQL query into the equivalent form in HiveQL, which is a subset of SQL-92. If an application is Spark-aware, then the connector is configurable to pass the query through to the database for processing. The connector interrogates Spark to obtain schema information to present to a SQL-based application. Queries, including joins, are translated from SQL to HiveQL
Simba Apache Spark ODBC Connector is a tool designed to enable direct SQL and HiveQL access to Apache Hadoop/Spark distributions, facilitating Business Intelligence (BI), analytics, and reporting on data stored in Hadoop-based systems.
Key Features:
- Enables efficient transformation of application-level SQL queries into HiveQL for compatibility with Hadoop/Spark environments.
- Configurable to pass Spark-aware queries directly to the database for processing, optimizing performance.
- Interrogates Spark to retrieve schema information, ensuring accurate representation of data structures to SQL-based applications.
- Supports a wide range of BI tools and reporting platforms by translating complex joins into HiveQL.
Audience & Benefit:
Ideal for data analysts, Business Intelligence professionals, and data engineers working with Hadoop/Spark ecosystems. This connector simplifies access to distributed data, enabling seamless analytics and reporting while maintaining compatibility with existing SQL-based workflows. It empowers users to derive actionable insights from large-scale datasets efficiently.
The Simba Apache Spark ODBC Connector can be installed via winget, ensuring straightforward setup and integration into your workflow.