Example data

The Deephaven Express package includes several built-in, deterministically generated ticking datasets for testing and experimentation. These datasets initialize with a default number of rows and are designed to demonstrate a range of plot types and pivot table use cases.

`iris` dataset

from deephaven.plot import express as dx

iris = dx.data.iris()

This function generates a deterministically random dataset inspired by the classic 1936 Iris flower dataset commonly used for classification tasks, with an additional "ticking" feature. The ticking feature represents a continuously increasing simulated timestamp.

Returns: Table A Deephaven Table

Parameters	Type	Default	Description
ticking	bool	True	If true, the table will tick new data every second.

`marketing` dataset

from deephaven.plot import express as dx

marketing = dx.data.marketing()

This dataset is intended to be used with the dx.funnel and dx.funnel_area plot types. Each row in this dataset represents an individual that has visited a company website. The individual may download an instance of the product, be considered a potential customer, formally request the price of the product, or purchase the product and receive an invoice. Each of these categories is a strict subset of the last, so it lends itself well to funnel plots.

Returns: Table A Deephaven Table

Parameters	Type	Default	Description
ticking	bool	True	If true, the table will tick new data every second.

`jobs` dataset

from deephaven.plot import express as dx

jobs = dx.data.jobs()

This dataset is intended to be used with a timeline plot. It demonstrates five different "jobs", each starting two days after the previous, and each lasting 5 days in total. The job's "resource", or the name of the individual assigned to the job, is randomly selected. The dataset continues to loop in this way, moving across time until it is deleted or the server is shut down.

Returns: Table A Deephaven Table

Parameters	Type	Default	Description
ticking	bool	True	If true, the table will tick new data every second.

`stocks` dataset

from deephaven.plot import express as dx

stocks = dx.data.stocks()

Randomly generated (but deterministic) fictional stock market data. Starts with the first 5 minutes of data already initialized so example plots won't start empty.

Returns: Table A Deephaven Table

Parameters	Type	Default	Description
ticking	bool	True	If true, the table will tick using a replayer, if false the whole table will be returned as a static table.
starting_time	str	'2018-06-01T08:00:00 ET'	The starting time for the data generation, defaults to 2018-06-01T08:00:00 ET

`tips` dataset

from deephaven.plot import express as dx

tips = dx.data.tips()

Homewood, IL: Richard D. Irwin Publishing.

Returns: Table A Deephaven Table

Parameters	Type	Default	Description
ticking	bool	True	If true, a ticking table containing the entire Tips dataset will be returned, and new rows of synthetic data will tick in every second. If false, the Tips dataset will be returned as a static table.

`election` dataset

from deephaven.plot import express as dx

election = dx.data.election()

When this function is called, it will return a table containing the first 19 rows of the dataset. Then, a new row will tick in each second, until all 58 rows are included in the table. The table will then reset to the first 19 rows, and continue ticking in this manner until it is deleted or otherwise cleaned up.

Returns: Table A Deephaven Table

Parameters	Type	Default	Description
ticking	bool	True	If true, the table will tick new data every second.

`wind` dataset

from deephaven.plot import express as dx

wind = dx.data.wind()

When this function is called, it will return a table containing the first 42 rows of the dataset. Then, a new row will tick in each second, until all 128 rows are included in the table. The table will then reset to the first 42 rows, and continue ticking in this manner until it is deleted or otherwise cleaned up.

Returns: Table A Deephaven Table

Parameters	Type	Default	Description
ticking	bool	True	If true, the table will tick new data every second.

`gapminder` dataset

from deephaven.plot import express as dx

gapminder = dx.data.gapminder()

The original Gapminder dataset from the plotly-express package has a single measurement per country once every five years, starting in 1952 and ending in 2007. This resolution is not ideal for ticking data. So, this ticking version creates new data points for every country at every month between measurements. For example, between two real observations in 1952 and 1957, there are 12 * 5 - 1 synthetic observations for population, life expectancy, and GDP. The synthetic data are simply computed by linear interpolation of the two nearest real observations. Finally, the dataset ticks in one new month every second, and every country in the dataset gets updated each time, so a total of 142 rows tick in per second. The dataset starts with years up to 1961, ticks in each month till 2007, and then repeats until the table is cleaned up or deleted.

Returns: Table A Deephaven Table

Parameters	Type	Default	Description
ticking	bool	True	If true, the table will tick new data every second.

`fish_market` dataset

from deephaven.plot import express as dx

fish_market = dx.data.fish_market()

Returns a fish market sales dataset designed for pivot table examples. Ticks every second, is random but deterministic, and contains lots of categorical data for pivoting.

Returns: Table A Deephaven Table suitable for pivot table demonstrations.

Parameters	Type	Default	Description
ticking	bool	True	When true, one new transaction will tick in every second. When false, returns 1000 rows.