Skip to content
Last updated

Specifying Your Query Engine

You can specify the query engine for Treasure Data. This setting specifies the query engine to be selected for queries in the TD Console. Your choices are:

  • Presto for ad hoc and shorter batch workloads. Presto provides low-latency SQL access to the data set.

  • Hive for large or complex batch workloads. Hive is a MapReduce-based SQL engine. This engine is really powerful when you do large data processing and heavy JOINs. Often used for ETL or sessionization.

About Hive Version Features

Hive 0.13Hive 2019.1 (Hive 2)Hive 2020.1 (Hive 2)Hive 2023.1 (Hive 4)
  • It runs on the MapReduce processing framework.
  • It contains features and fixes back ported from more recent versions of Hive.
  • It uses HiveQL, a non-ANSI compliant SQL like language.
  • It supports many UDFs unique to TD.
  • It is based on Hive 2.3.2 and later
  • It is fully ANSI-compliant SQL
  • It runs on the MapReduce processing framework
  • It supports the same UDFs used for TD in Hive 0.13

This version is being discontinued in favor of 2020.1.
  • It is based on Hive 2.3.2 and later
  • It is fully ANSI-compliant SQL
  • It runs on the Tez processing framework
  • It supports the same UDFs used for TD in Hive 0.13
  • Enhanced performance
  • New Hive capabilities