You can use Treasure Data with Pentaho Business Analytics using our JDBC driver. By combining Treasure Data with Pentaho, you can process huge volumes of data in the cloud, using existing Pentaho instances.
Don't have time to set up Pentaho and Treasure Data? Use our Setup Consultation Service.
Continue to the following topics:
Download Pentaho Report Designer
Download Pentaho Report Designer. Version 3.9.0-GA was tested for this article.
Download our JDBC Driver
This driver works only with Treasure Data. It does not work with other environments such as your local Hadoop/Hive cluster.
Download td-jdbc-jar-with-dependencies-(version).jar from the following link.
If you want to know more about Treasure Data JDBC driver, seehttps://github.com/treasure-data/td-jdbc.
Add Treasure Data as Data Source
Add JDBC Driver to Pentaho
Let’s copy the Treasure Data JDBC driver to Pentaho’s JDBC directory. The command for the MacOS environment is as follows:
Create a New Data Source
Navigate to Data -> Add Data Source -> Advanced -> JDBC (Custom). Create a new data source, select the GREEN plus button.
Configure JDBC Connection
Select Generic database -> Native (JDBC) and configure the JDBC connection as follows. Be sure to edit your credentials (your email + password).
API endpoint is up to your account site.
If your account site is in Tokyo, then the Custom Connection URL is "jdbc:td://api.treasuredata.co.jp/(database name)".
For details, see Sites and Endpoints.
You can create reports using generic SQL queries. Open the window from Data > Add Datasource > JDBC to add Treasure Data as the data source for the report. When you select Preview, the job starts running on the cloud (you can confirm this using the td jobs command). You can now design your reports using Treasure Data as a data source.