Open Dataset
- Kaggle DataSet Explore, analyze, and share quality data. About Computer Science 、Education、Classification、Computer Vision、NLP、 Data Visualization、Pre-Trained Model
- MovieLens | GroupLens
- Wikimedia Downloads
- Tianchi Datasets
- 和鲸社区 - Heywhale.com
- NYCtaxidata :纽约市出租车与豪华礼车管理局(New York City Taxi & Limousine Commission)公开的数据集
- The UK Property Price Paid dataset is a good starting point with some interesting SQL queries
- The New York Taxi Data has an example of how to insert data from S3 into ClickHouse
- The Cell Towers dataset imports a CSV into ClickHouse
- The NYPD Complaint Data demonstrates how to use data inference to simplify creating tables
- The “What’s on the Menu?” dataset has an example of denormalizing data
- The Laion dataset has an example of Approximate nearest neighbor search indexes usage
- The TPC-H, TPC-DS, and Star Schema (SSB) industry benchmarks for analytics databases
- Getting Data Into ClickHouse - Part 1 provides examples of defining a schema and loading a small Hacker News dataset
- Getting Data Into ClickHouse - Part 3 - Using S3 has examples of loading data from s3
- Generating random data in ClickHouse shows how to generate random data if none of the above fit your needs.