Treasure Data allows users to issues queries from the following:
- TD Console
- API
- JDBC/ODBC
- Scheduled queries
- Hosted workflow execution framework
Every query is managed as separate jobs, and for each job you can specify the one of these data processing engines:
Presto for ad hoc and shorter batch workloads. Presto provides low-latency SQL access to the data set.
Hive for large or complex batch workloads. Hive is a MapReduce-based SQL engine. This engine is very powerful and can support large data processing and heavy JOINs. This is often used for "extract, transform, and load" (ETL) or sessionization.
For additional detail about the differences between the query engines see Choosing an Analytics Engine.
The following topics are covered in this section:
- About Queries
- Specifying Your Query Engine for a Specific Query
- Creating or Editing a Query in the TD Console
- Defining a Query in the TD Console
- Scheduling a Query in the TD Console
- Running Queries in the TD Console
Reference information can be found here:
Additional Hive documentation and Presto documentation can be found on the Treasure Data Developer Portal.