Skip to content
Paul Rogers edited this page Nov 28, 2016 · 3 revisions

Test Data

When developing Drill it is handy to have a variety of test data available. Below is a partial list of such resources.

TPC-H

Drill includes the TPC-H data and queries. Scan the specification for details, especially the ER diagram on page 11 (reproduced below.)

(insert image)

  • TestTpchDistributedConcurrent tests a variety of TPC-H queries. Look at it for links to the queries and data.
  • Queries are in drill-java-exec/src/test/resources/queries/tpch.
  • Data is available to Drill in cp.`tpch/something.parquet`
  • Data is packaged in tpch-sample-data-x.y.z.jar
  • Data is available in the development environment in drill-contrib-data-parent/tpch-sample-data/target/classes/tpch.

FoodMart Analytic Data

Drill ships the FoodMart data set maintained by Julian Hyde, adapted from the original Microsoft version.

Clone this wiki locally