Deephaven TableHandles

A TableHandle holds a reference to a Deephaven Table on the server, and provides methods for operating on that table. Note that TableHandles should not be instantiated directly by user code, but rather by server calls accessible from the Client class. See ?Client for more information.

Naming tables on the server

When a TableHandle is created, it is not automatically bound to a variable name on the server. This means that the TableHandle that gets created is the only reference to the table that's been created. Importantly, the variable name given to the TableHandle is purely a local variable, and has no relationship to that table's name on the server. For this reason, code like the following:

client <- Client$new(...)
df1 <- data.frame(x = 1:10, y = 11:20)
t1 <- client$import_table(df1)
client$run_script("t2 = t1.update('z = x + y')")

will not run, because the table referenced by the local variable t1 is not named on the server at all. To make the table referenced by t1 accessible by name on the server (e.g., from within query strings), you must bind it to a variable with the method bind_to_variable(). We adopt the convention of calling local TableHandles th1, th2, etc., and server-side tables t1, t2, etc., to help distinguish between the two. So, the above code should be written as:

client <- Client$new(...)
df1 <- data.frame(x = 1:10, y = 11:20)
th1 <- client$import_table(df1)
th1$bind_to_variable("t1")
client$run_script("t2 = t1.update('z = x + y')")

You can then create a local TableHandle to reference t2 as follows:

th2 <- client$open_table("t2")

The above code is not best practice; calling update() directly on t1 would be preferred to running a script. It is, however, more illustrative of the relationship between local TableHandles and server-side tables. The best way to accomplish the above would be the following:

client <- Client$new(...)
df1 <- data.frame(x = 1:10, y = 11:20)
th1 <- client$import_table(df1)
th2 <- th1$update("z = x + y")

# this is necessary to access the tables from within query strings
th1$bind_to_variable("t1")
th2$bind_to_variable("t2")

Methods

Public methods

TableHandle$new()
TableHandle$is_static()
TableHandle$bind_to_variable()
TableHandle$head()
TableHandle$tail()
TableHandle$nrow()
TableHandle$ncol()
TableHandle$dim()
TableHandle$merge()
TableHandle$as_record_batch_reader()
TableHandle$as_arrow_table()
TableHandle$as_tibble()
TableHandle$as_data_frame()
TableHandle$select()
TableHandle$view()
TableHandle$update()
TableHandle$update_view()
TableHandle$drop_columns()
TableHandle$where()
TableHandle$group_by()
TableHandle$ungroup()
TableHandle$update_by()
TableHandle$agg_by()
TableHandle$agg_all_by()
TableHandle$first_by()
TableHandle$last_by()
TableHandle$head_by()
TableHandle$tail_by()
TableHandle$min_by()
TableHandle$max_by()
TableHandle$sum_by()
TableHandle$abs_sum_by()
TableHandle$avg_by()
TableHandle$w_avg_by()
TableHandle$median_by()
TableHandle$var_by()
TableHandle$std_by()
TableHandle$percentile_by()
TableHandle$count_by()
TableHandle$join()
TableHandle$natural_join()
TableHandle$exact_join()
TableHandle$sort()

Method `new()`

Initializes a new TableHandle from an internal Deephaven TableHandle.

Usage

TableHandle$new(table_handle)

Arguments

table_handle: Internal Deephaven TableHandle.

Method `is_static()`

Determines whether the table referenced by this TableHandle is static or not.

Usage

TableHandle$is_static()

Returns

TRUE if the table is static, or FALSE if the table is ticking.

Method `bind_to_variable()`

Binds the table referenced by this TableHandle to a variable on the server, so that it can be referenced by that name.

Usage

TableHandle$bind_to_variable(name)

Arguments

name: Name for this table on the server.

Method `head()`

Creates a new table containing the first n rows of this table.

Usage

TableHandle$head(n)

Arguments

n: Positive integer specifying the number of rows to return.

Returns

A TableHandle referencing the new table.

Method `tail()`

Creates a new table containing the last n rows of this table.

Usage

TableHandle$tail(n)

Arguments

n: Positive integer specifying the number of rows to return.

Returns

A TableHandle referencing the new table consisting of the last n rows of the parent table.

Method `nrow()`

Gets the number of rows in the table referenced by this TableHandle.

Usage

TableHandle$nrow()

Returns

The number of rows in the table.

Method `ncol()`

Gets the number of columns in the table referenced by this TableHandle.

Usage

TableHandle$ncol()

Returns

The number of columns in the table.

Method `dim()`

Gets the dimensions of the table referenced by this TableHandle. Equivalent to c(nrow, ncol).

Usage

TableHandle$dim()

Returns

A vector of length 2, where the first element is the number of rows in the table and the second element is the number of columns in the table.

Method `merge()`

Merges several tables into one table on the server. All tables must have the same schema as this table, and can be supplied as a list of TableHandles, any number of TableHandles, or a mix of both.

Usage

TableHandle$merge(...)

Arguments

...: Arbitrary number of TableHandles or vectors of TableHandles with a schema matching this table.

Returns

A TableHandle referencing the new table.

Method `as_record_batch_reader()`

Converts the table referenced by this TableHandle to an Arrow RecordBatchStreamReader.

Usage

TableHandle$as_record_batch_reader()

Returns

An Arrow RecordBatchStreamReader constructed from the data of this TableHandle.

Method `as_arrow_table()`

Converts the table referenced by this TableHandle to an Arrow Table.

Usage

TableHandle$as_arrow_table()

Returns

An Arrow Table constructed from the data of this TableHandle.

Method `as_tibble()`

Converts the table referenced by this TableHandle to a dplyr tibble.

Usage

TableHandle$as_tibble()

Returns

A dplyr tibble constructed from the data of this TableHandle.

Method `as_data_frame()`

Converts the table referenced by this TableHandle to an R data frame.

Usage

TableHandle$as_data_frame()

Returns

An R data frame constructed from the data of this TableHandle.

Method `select()`

Creates a new in-memory table that includes one column for each formula. If no formula is specified, all columns will be included.

Usage

TableHandle$select(formulas = character())

Arguments

formulas: String or list of strings denoting the column formulas.

Returns

A TableHandle referencing the new table.

Method `view()`

Creates a new formula table that includes one column for each formula.

Usage

TableHandle$view(formulas = character())

Arguments

formulas: String or list of strings denoting the column formulas.

Returns

A TableHandle referencing the new table.

Method `update()`

Creates a new table containing a new, in-memory column for each formula.

Usage

TableHandle$update(formulas = character())

Arguments

formulas: String or list of strings denoting the column formulas.

Returns

A TableHandle referencing the new table.

Method `update_view()`

Creates a new table containing a new formula column for each formula.

Usage

TableHandle$update_view(formulas = character())

Arguments

formulas: String or list of strings denoting the column formulas.

Returns

A TableHandle referencing the new table.

Method `drop_columns()`

Creates a new table that has the same number of rows as this table, but omits the columns specified in cols.

Usage

TableHandle$drop_columns(cols = character())

Arguments

cols: String or list of strings denoting the names of the columns to drop.

Returns

A TableHandle referencing the new table.

Method `where()`

Creates a new table containing only the rows meeting the filter condition.

Usage

TableHandle$where(filter)

Arguments

filter: String denoting the filter condition.

Returns

A TableHandle referencing the new table.

Method `group_by()`

Creates a new table containing grouping columns and grouped data, with column content is grouped into arrays. If no group-by column is given, the content of each column is grouped into its own array.

Usage

TableHandle$group_by(by = character())

Arguments

by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `ungroup()`

Creates a new table in which array columns from the source table are unwrapped into separate rows. The ungroup columns should be of array types.

Usage

TableHandle$ungroup(by = character())

Arguments

by: String or list of strings denoting the names of the columns to ungroup.

Returns

A TableHandle referencing the new table.

Method `update_by()`

Creates a table with additional columns calculated from window-based aggregations of columns in this table. The aggregations are defined by the provided operations, which support incremental aggregations over the corresponding rows in the table. The aggregations will apply position or time-based windowing and compute the results over the entire table or each row group as identified by the provided key columns. See more detailed documentation here or run ?UpdateBy.

Usage

TableHandle$update_by(ops, by = character())

Arguments

ops: UpdateByOp or list of UpdateByOps to perform on non-grouping columns.
by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `agg_by()`

Creates a new table containing grouping columns and grouped data. The resulting grouped data is defined by the aggregation(s) specified. See more detailed documentation here or run ?AggBy.

Usage

TableHandle$agg_by(aggs, by = character())

Arguments

aggs: AggOp or list of AggOps to perform on non-grouping columns.
by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `agg_all_by()`

Creates a new table containing grouping columns and grouped data. The resulting grouped data is defined by the aggregation(s) specified. See more detailed documentation here or run ?AggBy. This method applies the aggregation to all non-grouping columns of the table, so it can only accept one aggregation at a time.

Usage

TableHandle$agg_all_by(agg, by = character())

Arguments

agg: AggOp to perform on non-grouping columns.
by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `first_by()`

Creates a new table containing the first row of each group.

Usage

TableHandle$first_by(by = character())

Arguments

by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `last_by()`

Creates a new table containing the last row of each group.

Usage

TableHandle$last_by(by = character())

Arguments

by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `head_by()`

Creates a new table containing the first num_rows rows of each group.

Usage

TableHandle$head_by(num_rows, by = character())

Arguments

num_rows: Positive integer specifying the number of rows to return.
by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `tail_by()`

Creates a new table containing the last num_rows rows of each group.

Usage

TableHandle$tail_by(num_rows, by = character())

Arguments

num_rows: Positive integer specifying the number of rows to return.
by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `min_by()`

Creates a new table containing the column-wise minimum of each group.

Usage

TableHandle$min_by(by = character())

Arguments

by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `max_by()`

Creates a new table containing the column-wise maximum of each group.

Usage

TableHandle$max_by(by = character())

Arguments

by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `sum_by()`

Creates a new table containing the column-wise sum of each group.

Usage

TableHandle$sum_by(by = character())

Arguments

by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `abs_sum_by()`

Creates a new table containing the column-wise absolute sum of each group.

Usage

TableHandle$abs_sum_by(by = character())

Arguments

by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `avg_by()`

Creates a new table containing the column-wise average of each group.

Usage

TableHandle$avg_by(by = character())

Arguments

by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `w_avg_by()`

Creates a new table containing the column-wise weighted average of each group.

Usage

TableHandle$w_avg_by(wcol, by = character())

Arguments

wcol: String denoting the name of the column to use as weights.
by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `median_by()`

Creates a new table containing the column-wise median of each group.

Usage

TableHandle$median_by(by = character())

Arguments

by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `var_by()`

Creates a new table containing the column-wise variance of each group.

Usage

TableHandle$var_by(by = character())

Arguments

by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `std_by()`

Creates a new table containing the column-wise standard deviation of each group.

Usage

TableHandle$std_by(by = character())

Arguments

by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `percentile_by()`

Creates a new table containing the column-wise percentile of each group.

Usage

TableHandle$percentile_by(percentile, by = character())

Arguments

percentile: Numeric scalar between 0 and 1 denoting the percentile to compute.
by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `count_by()`

Creates a new table containing the number of rows in each group.

Usage

TableHandle$count_by(col, by = character())

Arguments

col: String denoting the name of the new column to hold the counts of each group.
by: String or list of strings denoting the names of the columns to group by.

Returns

A TableHandle referencing the new table.

Method `join()`

Creates a new table containing rows that have matching values in both tables. Rows that do not have matching criteria will not be included in the result. If there are multiple matches between a row from the left table and rows from the right table, all matching combinations will be included. If no columns to match (on) are specified, every combination of left and right table rows is included.

Usage

TableHandle$join(table, on = character(), joins = character())

Arguments

table: TableHandle referencing the table to join with.
on: String or list of strings denoting the names of the columns to join on.
joins: String or list of strings denoting the names of the columns to add from table.

Returns

A TableHandle referencing the new table.

Method `natural_join()`

Creates a new table containing all the rows and columns of this table, plus additional columns containing data from the right table. For columns appended to the left table (joins), row values equal the row values from the right table where the key values in the left and right tables are equal. If there is no matching key in the right table, appended row values are NULL.

Usage

TableHandle$natural_join(table, on = character(), joins = character())

Arguments

table: TableHandle referencing the table to join with.
on: String or list of strings denoting the names of the columns to join on.
joins: String or list of strings denoting the names of the columns to add from table.

Returns

A TableHandle referencing the new table.

Method `exact_join()`

Usage

TableHandle$exact_join(table, on = character(), joins = character())

Arguments

table: TableHandle referencing the table to join with.
on: String or list of strings denoting the names of the columns to join on.
joins: String or list of strings denoting the names of the columns to add from table.

Returns

A TableHandle referencing the new table.

Method `sort()`

Creates a new table containing all the rows and columns of this table, sorted by the specified columns.

Usage

TableHandle$sort(order_by, descending = FALSE, abs_sort = FALSE)

Arguments

order_by: String or list of strings denoting the names of the columns to sort by.
descending: Boolean or list of booleans denoting whether to sort in descending order. If a list is supplied, it must be the same length as order_by.
abs_sort: Boolean or list of booleans denoting whether to sort by absolute value. If a list is supplied, it must be the same length as order_by.

Returns

A TableHandle referencing the new table.

Examples

if (FALSE) { # \dontrun{
library(rdeephaven)

# connecting to Deephaven server
client <- Client$new("localhost:10000", auth_type = "psk", auth_token = "my_secret_token")

# create a data frame, push it to the server, and retrieve a TableHandle referencing the new table
df <- data.frame(
  timeCol = seq.POSIXt(as.POSIXct(Sys.Date()), as.POSIXct(Sys.Date() + 0.01), by = "1 sec")[1:50],
  boolCol = sample(c(TRUE, FALSE), 50, TRUE),
  col1 = sample(1000, size = 50, replace = TRUE),
  col2 = sample(1000, size = 50, replace = TRUE),
  col3 = 1:50
)
th <- client$import_table(df)

# get the dimension of the table
dim(th)

# get the last 10 rows of the table
th2 <- tail(th, 10)
as.data.frame(th2)

# create several new columns
th3 <- th$update(c("col4 = col1 + col2", "charCol = col3 % 2 == 0 ? `A` : `B`"))
as.data.frame(th3)

# filter based on parity of col3
th4 <- th3$where("charCol == `A`")
as.data.frame(th4)

# select a subset of columns
th5 <- th3$select(c("timeCol", "col1", "col4"))
as.data.frame(th5)

# drop timestamp column and get sum of remaining columns grouped by boolCol and charCol
th6 <- th3$
  drop_columns("timeCol")$
  sum_by(c("boolCol", "charCol"))
as.data.frame(th6)

client$close()
} # }

Naming tables on the server

Methods

Public methods

Method new()

Usage

Arguments

Method is_static()

Usage

Returns

Method bind_to_variable()

Usage

Arguments

Method head()

Usage

Arguments

Returns

Method tail()

Usage

Arguments

Returns

Method nrow()

Usage

Returns

Method ncol()

Usage

Returns

Method dim()

Usage

Returns

Method merge()

Usage

Arguments

Returns

Method as_record_batch_reader()

Usage

Returns

Method as_arrow_table()

Usage

Returns

Method as_tibble()

Usage

Returns

Method as_data_frame()

Usage

Returns

Method select()

Usage

Arguments

Returns

Method view()

Usage

Arguments

Returns

Method update()

Usage

Arguments

Returns

Method update_view()

Usage

Arguments

Returns

Method drop_columns()

Usage

Arguments

Returns

Method where()

Usage

Arguments

Returns

Method group_by()

Usage

Arguments

Returns

Method ungroup()

Usage

Arguments

Returns

Method update_by()

Usage

Method `new()`

Method `is_static()`

Method `bind_to_variable()`

Method `head()`

Method `tail()`

Method `nrow()`

Method `ncol()`

Method `dim()`

Method `merge()`

Method `as_record_batch_reader()`

Method `as_arrow_table()`

Method `as_tibble()`

Method `as_data_frame()`

Method `select()`

Method `view()`

Method `update()`

Method `update_view()`

Method `drop_columns()`

Method `where()`

Method `group_by()`

Method `ungroup()`

Method `update_by()`

Method `agg_by()`

Method `agg_all_by()`

Method `first_by()`

Method `last_by()`

Method `head_by()`

Method `tail_by()`

Method `min_by()`

Method `max_by()`

Method `sum_by()`

Method `abs_sum_by()`

Method `avg_by()`

Method `w_avg_by()`

Method `median_by()`

Method `var_by()`

Method `std_by()`

Method `percentile_by()`

Method `count_by()`