# Scala Apps Import Integration

td-agent was discontinued in December 2023 and has been replaced by **fluent-package**. The fluent-package is the official successor maintained by the [Cloud Native Computing Foundation](https://www.cncf.io/projects/).

Treasure Data provides [Fluentd](/products/customer-data-platform/integration-hub/streaming/td-agent/about-treasure-data-s-server-side-agent) to collect server-side logs and events and to import the data from Scala applications.

## Prerequisites

- Basic knowledge of Scala.
- Basic knowledge of Treasure Data.
- JVM, Scala, [sbt](https://github.com/sbt/sbt) v1.x or later.


## Installing Fluentd

Install Fluentd (fluent-package) on your application servers. Fluentd sits within your application servers, focusing on uploading application logs to the cloud.

The [td-logger-java](http://github.com/treasure-data/td-logger-java) library enables Scala applications to post records to their local Fluentd. Fluentd, in turn, uploads the data to the cloud every 5 minutes. Because the daemon runs on a local node, the logging latency is negligible.

## Fluentd (fluent-package) Install Options

td-agent was discontinued in December 2023 and has been replaced by **fluent-package**. The fluent-package is the official successor maintained by the [Cloud Native Computing Foundation](https://www.cncf.io/projects/). For migration guidance from td-agent, see [Fluentd Installation Guide](https://docs.fluentd.org/installation/install-fluent-package).

To install `fluent-package`, run one of the following commands based on your environment.

### RHEL/CentOS/Rocky Linux


```bash
# fluent-package 6 LTS (recommended)
curl -fsSL https://fluentd.cdn.cncf.io/sh/install-redhat-fluent-package6-lts.sh | sh
```

### Ubuntu


```bash
# Ubuntu 24.04 Noble - fluent-package 6 LTS
curl -fsSL https://fluentd.cdn.cncf.io/sh/install-ubuntu-noble-fluent-package6-lts.sh | sh

# Ubuntu 22.04 Jammy - fluent-package 6 LTS
curl -fsSL https://fluentd.cdn.cncf.io/sh/install-ubuntu-jammy-fluent-package6-lts.sh | sh
```

### Debian


```bash
# Debian Bookworm - fluent-package 6 LTS
curl -fsSL https://fluentd.cdn.cncf.io/sh/install-debian-bookworm-fluent-package6-lts.sh | sh
```

### Amazon Linux


```bash
# Amazon Linux 2023 - fluent-package 6 LTS
curl -fsSL https://fluentd.cdn.cncf.io/sh/install-amazon2023-fluent-package6-lts.sh | sh
```

### Windows

Download the MSI installer from:

- [fluent-package 6 LTS for Windows](https://fluentd.cdn.cncf.io/lts/6/windows/index.html)


After installation:

1. Edit the configuration file at `C:/opt/fluent/etc/fluent/fluentd.conf`
2. Start the service using `net start fluentdwinsvc` or via Services administrative tool


### macOS

fluent-package for macOS is planned to be available via Homebrew. For current installation options, see [Fluentd Installation Guide](https://docs.fluentd.org/installation).

## Starting the Service

After installation, start and verify the Fluentd service.

### Linux


```bash
sudo systemctl start fluentd.service
sudo systemctl status fluentd.service
```

The configuration file is located at `/etc/fluent/fluentd.conf`.

### Windows


```cmd
net start fluentdwinsvc
```

The configuration file is located at `C:\opt\fluent\etc\fluent\fluentd.conf`.

### macOS (gem installation)


```bash
fluentd -c /path/to/fluentd.conf
```

For more details, see the [Fluentd Documentation](https://docs.fluentd.org/).

## Modifying fluentd.conf

Next, specify your API key by setting the `apikey` option in your `/etc/fluent/fluentd.conf` file (for fluent-package). You can retrieve your API key from your profile in TD Console.


```conf
# Treasure Data Input and Output
<source>
  @type forward
  port 24224
</source>

<match td.*.*>
  @type tdlog
  endpoint api.treasuredata.com
  apikey YOUR_API_KEY
  auto_create_table
  use_ssl true
  <buffer>
    @type file
    path /var/log/fluent/buffer/td
  </buffer>
</match>
```

`YOUR_API_KEY` should be your actual apikey string. You can retrieve your API key from your profiles in TD Console. Using a [write-only API key](/products/my-settings/getting-your-api-keys) is recommended.

Restart the Fluentd service when the following lines are in place.


```bash
# Linux
sudo systemctl restart fluentd.service

# macOS (gem installation)
# Restart the Fluentd process manually
```

Fluentd now accepts data via port 24224, buffers the data (`/var/log/fluent/buffer/td`), and automatically uploads the data into the cloud.

## Using td-logger-java

First, add the following lines to build.sbt. The logger’s revision information can be found in [CHANGES.txt](https://github.com/treasure-data/td-logger-java/blob/master/CHANGES.txt).

If you need an all-in-one jar file, we provide one at [http://central.maven.org/maven2/com/treasuredata/](https://repo1.maven.org/maven2/com/treasuredata/).


```
/* in build.sbt */
// Dependencies
libraryDependencies ++= Seq(
  "com.treasuredata" % "td-logger" % "${logger.version}"
)
```

Next, configure your *treasure-data.properties* file using the following commands:


```
td.logger.agentmode=true
td.logger.agent.host=localhost
td.logger.agent.port=24224
td.logger.agent.tag=td
```

Finally, insert the following lines into your application to initialize and post records. You can [read more information about the API](https://github.com/treasure-data/td-logger-java).


```
import java.util.Properties
import com.treasure_data.logger.TreasureDataLogger
import scala.collection.JavaConverters._

object Main {
  def main(args: Array[String]) {
    var props = System.getProperties();
    props.load(getClass.getResourceAsStream("treasure-data.properties"));
    var LOG = TreasureDataLogger.getLogger("test_db");

    var map = Map("from" -> "userA", "to" -> "userB");
    LOG.log("follow", map.asJava.asInstanceOf[java.util.Map[String, java.lang.Object]]);
  }
}
```

This example expects the following structure.

- project_dir/build.sbt
- project_dir/src/main/scala/Main.scala
- project_dir/src/main/resources/treasure-data.properties


## Confirming Data Import

First, execute the preceding program.


```bash
$ sbt compile run
```

Sending a SIGUSR1 signal flushes Fluentd's buffer. Upload starts immediately.


```bash
# Linux
$ kill -USR1 $(cat /var/run/fluent/fluentd.pid)

# macOS (gem installation)
# Send SIGUSR1 to the Fluentd process
```

To confirm the data upload, use `td tables`.


```
$ td tables
+------------+------------+------+-----------+
| Database   | Table      | Type | Count     |
+------------+------------+------+-----------+
| test_db    | follow     | log  | 1         |
+------------+------------+------+-----------+
```

## Production Deployments

### High-Availability Configurations of Fluentd

For high-traffic websites (more than 5 application nodes), use a high availability configuration of Fluentd to improve data transfer reliability and query performance.

- [High-Availability Configurations of Fluentd](/products/customer-data-platform/integration-hub/streaming/td-agent/configuring-td-agent-for-high-availability)


### Monitoring Fluentd

Monitoring Fluentd itself is also important. Refer to the following document for general monitoring methods for Fluentd:

- [Monitoring Fluentd](/products/customer-data-platform/integration-hub/streaming/td-agent/monitoring-td-agent)


Fluentd is fully open-sourced under the [Fluentd project](http://fluentd.org/).

## Next Steps

We offer a schema mechanism that is more flexible than that of traditional RDBMSs. For queries, we leverage the Hive and Trino Query Languages.

- [Schema Management](/products/customer-data-platform/data-workbench/databases/schema-management)
- [Trino Query Language](/products/customer-data-platform/data-workbench/queries/trino/writing_trino_queries)
- [Hive Query Language](/products/customer-data-platform/data-workbench/queries/hive/writing_hive_queries)
- [Programmatic Access with REST API and its Bindings](/apis/td-api)