Posts tagged with big data

Why partitioned tables are powerful

The benefits of partitioning data

September 27 2024 • 9 min read

Combined aggregations for efficient analysis

Using Deephaven's Combined Aggregators

September 9 2024 • 3 min read

Deephaven and Iceberg

Big data storage meets big data processing

September 6 2024 • 5 min read

In response to Rockset-OpenAI: a brief real-time analytics manifesto

Why Deephaven is the best for real-time analytics

July 15 2024 • 8 min read

Store historical crypto data without file bloat

Make Parquet your new data storage standard

June 1 2022 • 3 min read

Batch process CSVs 60x faster than pandas with Parquet

Parquet files for big data

April 27 2022 • 4 min read

Data science without big costs: how to have a $3800 Laptop for $20 a month

Why I run on Google Cloud

April 18 2022 • 9 min read

The r/place dataset

Translating the r/place dataset from CSV to Parquet

April 8 2022 • 3 min read

Kafka + Parquet: Maximize speed, minimize storage

Parquet offers a smart storage solution for streaming data

February 22 2022 • 5 min read

RSS meta-data discovery and Podcast exploration

How to pull data from multiple RSS feeds simultaneously into Deephaven

February 16 2022 • 5 min read

Display a quadrillion rows of data in the browser

Use canvas to get around limitations of DOM based data grids

January 24 2022 • 7 min read

A DIY Reddit sentiment analyzer (of meme stocks)

Performing sentiment analysis on Reddit posts scraped via RSS

January 13 2022 • 5 min read

How to implement streaming analytics with Redpanda & Deephaven

Redpanda and Deephaven combined make the future of data

January 12 2022 • 10 min read

Take Twitter's temperature with Deephaven: a sentiment analysis tutorial

Pull live Twitter data for specified cryptocurrencies to perform sentiment analysis with Deephaven

January 11 2022 • 9 min read

Crypto made easy: import live trade data

Pull live and historical data for specified cryptocurrencies from the [CoinGecko](https://www.coingecko.com/) website into [Deephaven](https://github.com/deephaven/deephaven-core).

December 16 2021 • 6 min read

Crossing the streams is good

When you harness all the data streams available—both batch and real-time streams—you’re empowered with a much clearer, more accurate, more complete picture of the situation and what the next best course of action should be.

October 18 2021 • 3 min read

Deephaven Community Core with real-time data capabilities now available

Free and open time-series database designed for developers and data scientists removes barriers associated with complex data processing to fuel innovation and improve productivity.

October 13 2021 • 4 min read