Treasure Data allows users to issues queries from API, JDBC/ODBC, the TD Console, via scheduled queries, or our hosted workflow execution framework.

All of these issued queries are managed as separate jobs. Treasure Data provides two major ways of processing data for data collected from both batch and streaming sources. For every query you issue, you can specify one of the following data processing engines:

  • Presto for ad hoc and shorter batch workloads. Presto provides low-latency SQL access to the data set.

  • Hive for large or complex batch workloads. Hive is a MapReduce-based SQL engine. This engine is really powerful when you do large data processing and heavy JOINs. Often used for ETL or sessionization.

This topic contains:

  • No labels