Exploratory Data Analysis
Copy for LLM
Copy page as Markdown for LLMs
View as Markdown
Open this page as Markdown
Open in ChatGPT
Get insights from ChatGPT
Open in Claude
Get insights from Claude
Connect to Cursor
Install MCP server on Cursor
Connect to VS Code
Install MCP server on VS Code

This notebook runs Exploratory Data Analysis (EDA) targeting the table specified by the input_table parameter.

Supported analytics methods:

Some example visualizations from the EDA Notebook are shown below:

Find a sample workflow here in Treasure Boxes.

+run_eda:
  ipynb>:
    notebook: EDA
    input_table: ml_datasets.bank_marketing
    eda: all
    sampling_threshold: 1000000

Parameter name	Parameter on Console	Description	Default Value
docker.task_mem	Docker Task Mem	Task memory size. Available values are 64g, 128g (default), 256g, 384g, or 512g depending on your contracted tiers	128g
input_table	Input Table	specify a TD table used for EDA as dbname.table_name	-
target_column	Target Column	column name used for the label	None
ignore_columns	Ignore Columns	columns to ignore for EDA	time
sampling_threshold	Sampling Threshold	threshold used for sampling. See the executed notebook in detail	10_000_000
eda	Eda	all or comma separated strings to specify types of EDA to run	all

Exploratory Data AnalysisCopyCopy for LLMCopy page as Markdown for LLMsView as MarkdownOpen this page as MarkdownOpen in ChatGPTGet insights from ChatGPTOpen in ClaudeGet insights from ClaudeConnect to CursorInstall MCP server on CursorConnect to VS CodeInstall MCP server on VS Code