...
hidden | true |
---|
Keep the introduction brief. The customer is looking for this integration because they already have the product.
...
Amazon Simple Storage Service (Amazon S3) is an object storage service that offers scalability, data availability, security, and performance. Amazon S3 provides features for data organization and configuration of access controls for your business, organization, and compliance requirements.
Amazon Simple Storage Service (Amazon S3) is an object storage service that offers scalability, data availability, security, and performance. Amazon S3 provides features for data organization and configuration of access controls for your business, organization, and compliance requirements.
...
hidden | true |
---|
Replace the product name in the following sentence:
...
This TD export integration allows you to write job results from Treasure Data directly to
...
Amazon S3.
This TD export integration allows you to write job results from Treasure Data directly to Amazon S3.topic includes:
Table of Contents maxLevel 1
What can you do with this Integration?
...
hidden | true |
---|
The Product Manager needs to add two to three bullets to explain how the customer can benefit from this integration.
Example:
...
...
Opt-out: When an end-user updates the consent choosing not to participate.
...
Upload your own tracking of events, purchases, or custom profile attributes. This information can help you build precise customer segments that enhance the user experience for your campaigns.
- Create buckets: Create and name a bucket that stores data.
- Storing data: Store an infinite amount of data in a bucket.
This topic includes:
Table of Contents maxLevel 3 exclude What can you do with this Integration?
Prerequisites
Excerpt | ||
---|---|---|
| ||
Include a bulleted list of prerequisites the customer needs to successfully implement this export integration. Example:
|
...
Differences between Amazon S3 Export Integration v2 and Amazon S3 Export Integration v1
Review the information in the following table to understand the differences and potential advantages between v2 and v1.
Feature | Amazon S3 v2 | Amazon S3 v1 |
---|---|---|
Server-side Encryption with Customer Master Key (CMK) stored in AWS Key Management Service | X | |
Support for Quote Policy for output data format | X | |
Support Assume Role authentication method | X |
Prerequisites
Basic knowledge of Treasure Data, including the TD Toolbelt.
For AWS: the IAM User :
with s3:PutObject, s3:AbortMultipartUpload permissions permissions.
with kms:Decrypt, kms:GenerateDataKey* permissions when selecting the sse-kms setting.
Requirements and Limitations
...
...
List any product or integration limitations that affect the function of the integration.
Example:
Treasure Data does not create a campaign.
A transactional email action does not count over the rate limit of API calls.
As per the rate limit of dotdigital API (https://developer.dotdigital.com/docs/getting-started-with-the-api#restrictions): one Upload bulk contacts action requires two API calls, which count into the overall rate limit.
The default query result limit for export to S3 is 100GB. you You could config part size setting up to 5000 (MB), the file limit will be about 5TB.
The default export format is CSV RFC 4180.
Output in TSV, JSONL format is also supported.
Static IP Address of Treasure Data
The static IP address of Treasure Data is the access point and source of the linkage for this Integration. To determine the static IP address, contact your Customer Success representative or Technical support.
About S3 Server-Side Encryption
...
Use the Server-Side Encryption bucket policy if you require server-side encryption for all objects that are stored in your bucket. When you have the server-side encryption enabled, you don't have to turn on the SSE option. However, job results might may fail if you have bucket policies to reject HTTP requests without encryption information.
...
When you enable AWS KMS for server-side encryption in Amazon S3
...
:
If you don't input the KMS Key ID, it will create/using use the default KMS key.
if If you input the KMS Key Key ID, you must choose asymmetric symmetric CMK , (not asymmetric CMKs).
The AWS KMS CMK must be in the same Region as the bucket.
About File Formats for S3
Expand | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
For both the CSV, TSV , and JSONL formats, the following table lists the options you can use to customize the final format of the files written into the destination:
|
Use the TD Console to Create a Connection
In Treasure Data, you must create must create and configure the data connection prior to before running your query. As part of the data connection, you you provide authentication to access the integration.
...
Numbered Headings | ||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ||||||||||||||||||||||||||||
Open TD Console.Navigate to Integrations Hub > Catalog.Search for S3 and selectAmazonS3.Select Create Authentication.Type the credentials to authenticate:
Select Continue.Type a name for your connection.Select Done. |
Define your Query
Numbered Headings |
---|
Select an existing integration authentication. |
Specify the Result Export Target
Numbered Headings | ||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ||||||||||||||||||||||||
Select Export Results.You can select an existing authentication or create a new authentication for the external service to be used for output. Choose one of the following:Use Existing Integration Create a New Integration (Optional) Specify information for Export to Amazon S3.
|
Create an Activation Using an Integration
Include Page | ||||
---|---|---|---|---|
|
Integration Export Parameters for S3
- Define any additional Export Results details
...
- and content review the integration parameters.
For example, your Export Results screen might be different, or you might not have additional details to fill out
...
- .
...
- Select Done.
- Run your query
- Validate that your data moved to the destination you specified.
...
Parameter | Data Type | Required? | Supported in V1? | Description |
---|---|---|---|---|
Server-side Encryption | String | yes, only sse-s3 | Support values:
| |
Server-side Encryption Algorithm | String | yes | Support value:
| |
KMS Key ID | String | no | Symmetric AWS KMS Key Id, if not input KMS key idID. If there is no input for the KMS Key ID, it will create/using use the default KMS keyKey. | |
Bucket | String | yes | yes | Provide the S3 bucket name (Ex., your_bucket_name). |
Path | String | yes | yes | Specify the s3 filename (object key), and include an extension (Ex. test.csv). |
Format | String | yes | Format of the exported file: csv, tsv, jsonl | |
Compression | String | yes | The compression format of the exported files (Ex., None or gz) | |
Header | Boolean | yes | Include a header in the exported file. | |
Delimiter | String | yes | Use to specify the delimiter character (Ex., (comma)) | |
String for NULL values | String | yes | Placed holder to insert for null values (Ex. Empty String) | |
End-of-line character | String | yes | Specify the EOL(End-Of-Line) representation (Ex. CRLF, LF) | |
Quote Policy | String | no | Use to determine field type to quote. Support values:
Default value: MINIMAL | |
Quote character (Optional) | Char | yes | The character used for quotes in the exported file (Ex. "). Only quote those fields which contain the delimiter, quote, or any of the characters in the lineterminator. If the input is more than 1 character, the default value will be used. | |
Escape character(Optional) | Char | yes | The escape character is used in the exported file. If the input is more than 1 character, the default value will be used. | |
Part Size (MB) (Optional) | Integer | no | The part size in multipart upload upload. Default 10, min 5, max 5000 |
...
Code Block | ||||
---|---|---|---|---|
| ||||
SELECT * FROM www_access |
(Optional) Schedule
...
Query Export Jobs
You can use Scheduled Jobs with Result Export to periodically write the output result to a target destination that you specify.
...
Within Treasure Workflow, you can specify the use of this data connector to export data.
Learn more at Using Workflows to Export Exporting Data with the TD ToolbeltParameters.
S3 (v2) Configuration Keys
...
Code Block |
---|
_export: td: database: td.database +s3v2_test_export_task: td>: export_s3v2_test.sql database: ${td.database} result_connection: s3v2_conn result_settings: bucket: my-bucket path: /path/to/target.csv sse_type: sse-s3 format: csv compression: gz header: false delimiter: default null_value: empty newline: LF quote_policy: MINIMAL escape: '"' quote: '"' part_size: 20 |
(Optional)
...
Export
...
Integration Using the CLI
To output the result of a single query to an S3 buck add the --result option to the td query command. After the job is finished, the results are written into your s3.
You can specify detailed settings to export your S3 via the --result parameter.
...
Creating authentication with Assume Role is only supported through the console. Attempting to create it through the TD CLI will result in an error.
Example CLI Command for S3 (v2)
Code Block | ||||
---|---|---|---|---|
| ||||
td query \ --result '{"type":"s3_v2","auth_method":"basic","region":"us-east-2","access_key_id": "************","secret_access_key":"***************","bucket":"bucket_name","path":"path/to/file.csv","format":"csv","compression":"none","header":true,"delimiter":"default","null_value":"default","newline":"CRLF","quote_policy":"NONE","part_size":10}' \ -w -d testdb \ "SELECT 1 as col" -T presto |
...