Posts tagged with big data
Why partitioned tables are powerful
The benefits of partitioning data
September 27 2024 • 9 min read
Combined aggregations for efficient analysis
Using Deephaven's Combined Aggregators
September 9 2024 • 3 min read
Deephaven and Iceberg
Big data storage meets big data processing
September 6 2024 • 5 min read
In response to Rockset-OpenAI: a brief real-time analytics manifesto
Why Deephaven is the best for real-time analytics
July 15 2024 • 8 min read
Store historical crypto data without file bloat
Make Parquet your new data storage standard
June 1 2022 • 3 min read
Batch process CSVs 60x faster than pandas with Parquet
Parquet files for big data
April 27 2022 • 4 min read
Data science without big costs: how to have a $3800 Laptop for $20 a month
Why I run on Google Cloud
April 18 2022 • 9 min read
The r/place dataset
Translating the r/place dataset from CSV to Parquet
April 8 2022 • 3 min read
Kafka + Parquet: Maximize speed, minimize storage
Parquet offers a smart storage solution for streaming data
February 22 2022 • 5 min read
RSS meta-data discovery and Podcast exploration
How to pull data from multiple RSS feeds simultaneously into Deephaven
February 16 2022 • 5 min read
Display a quadrillion rows of data in the browser
Use canvas to get around limitations of DOM based data grids
January 24 2022 • 7 min read
A DIY Reddit sentiment analyzer (of meme stocks)
Performing sentiment analysis on Reddit posts scraped via RSS
January 13 2022 • 5 min read
How to implement streaming analytics with Redpanda & Deephaven
Redpanda and Deephaven combined make the future of data
January 12 2022 • 10 min read
Take Twitter's temperature with Deephaven: a sentiment analysis tutorial
Pull live Twitter data for specified cryptocurrencies to perform sentiment analysis with Deephaven
January 11 2022 • 9 min read
Crypto made easy: import live trade data
Pull live and historical data for specified cryptocurrencies from the [CoinGecko](https://www.coingecko.com/) website into [Deephaven](https://github.com/deephaven/deephaven-core).
December 16 2021 • 6 min read
Crossing the streams is good
When you harness all the data streams available—both batch and real-time streams—you’re empowered with a much clearer, more accurate, more complete picture of the situation and what the next best course of action should be.
October 18 2021 • 3 min read
Deephaven Community Core with real-time data capabilities now available
Free and open time-series database designed for developers and data scientists removes barriers associated with complex data processing to fuel innovation and improve productivity.
October 13 2021 • 4 min read