Skip to content
Last updated

Querying Data

Treasure Data allows users to issues queries from the following:

  • TD Console
  • API
  • JDBC/ODBC
  • Scheduled queries
  • Hosted workflow execution framework

Every query is managed as separate jobs, and for each job you can specify the one of these data processing engines:

  • Presto for ad hoc and shorter batch workloads. Presto provides low-latency SQL access to the data set.

  • Hive for large or complex batch workloads. Hive is a MapReduce-based SQL engine. This engine is very powerful and can support large data processing and heavy JOINs. This is often used for "extract, transform, and load" (ETL) or sessionization.

For additional detail about the differences between the query engines see Choosing an Analytics Engine.

The following topics are covered in this section:

Reference information can be found here: