Test Framework for SQL on Hadoop technologies. Currently supports Apache Drill, a schema-free SQL query engine for Hadoop, NoSQL and cloud storage.
The framework is built for regression, integration & sanity testing. Includes test coverage (with baselines) for core Drill functionality, and supported features. A subset of these tests are used by the Apache Drill community for pre-commit and pre-release criteria.
- Clone the repository
- Configure test environment
- Review tests
- Build test framework
- Execute tests
git clone [email protected]:mapr/drill-test-framework.git
Refer to Github documentation on how to clone a repository.
- The test framework requires a distributed file system such as HDFS or MapR-FS to be configured. Hive and HBase are additional dependencies to run all tests (refer to the example in the Execute Tests section to exclude tests with dependencies).
- The test framework expects Drill services to be setup on a clustered environment. Refer to Drill documentation for details on how to setup Drill.
- Copy the
drillTestConfig
file fromframework/src/main/resources
to your home directory as~/.drillTestConfig
. Edit suitably, following instructions in the file. - Source
.drillTestConfig
. ConfirmDRILL_HOME
and other required environment variables are set.
drill-test-framework
|_ framework
|_ resources
|_ Functional (default location for test suites)
|_ Advanced (test suites requiring large datasets)
|_ Datasources (datasets and scripts)
- In the
framework/resources/[Functional/Advanced]
directory, create a directory for a new test suite (orcd
to an existing test suite). - In the test suite directory, add testcases, expected results, and test definition file(s). Optionally include test suite sub directories to organize tests.
- Pairs of testcase (ex:
query1.sql
) and expected result files (ex:query1.e_tsv
) are co-located and share the same name. - You could generate expected result files using Postgres or any such database.
- In the
framework/resources/Datasources
directory, create corresponding datasource directories and copy over any required scripts and datasets required by the tests.
{
"testId": "...", // Unique identification string for test suite. Ex: window_function_tests
"type": "group", // Group of tests
"negative": false, // Tests are positive or negative (expected success or expected failures)
"description": "...", // Description for test suites
"submit-type": "jdbc", // Currently only support jdbc
"queryType": "sql",
"timeout": null,
"categories": [
"...", // Labels for categorizing test suites. Ex: smoke
"..."
],
"dependencies": [
"...", // Additional dependencies in setup. Ex: hive, hbase, maprdb
"..."
],
"matrices": [
{
"query-file": `".*.sql"`, // File name pattern of query files (REGEX). The example will match any filename with ".sql" extension
"schema": "dfs.tmp", // Drill storage plugin.workspace, or just the storage plugin name in case of hive, hbase
"output-format": "tsv", // Delimiter to seperate columns in expected result files
"expected-file": `".*.e_tsv"`, // File name pattern of expected result files (REGEX)
"username": "user",
"password": "pass",
"verification-type": [ // Supported type "in-memory", "regex"
"in-memory",
"..."
]
}
],
"datasources": [
{
"mode": "cp", // Use "cp" to copy datasets
"src": "`
In the framework
directory, execute mvn clean install
first, to build the project and also download any dependent datasets configured in pom.xml
In the framework
directory, execute the following command to run tests:
./run.sh -s <suites> -g <groups> -t <Timeout> -x <Exclude> -n <Concurrency>
Example:
./run.sh -s `Functional/aggregates,Functional/joins` -g `smoke,regression` -x `hbase` -t `180` -n `2`
-s suites (required)
Here `Functional/aggregates,Functional/joins` are directories inside `framework/resources/Functional`. All test suites
and sub-suites within a directory are included.
-g groups (required)
Here `smoke, regression` are categories of tests to execute
-t timeout (optional)
Here `180` seconds is the max time for a query to execute.
-n concurrency (optional)
Here `2` queries can execute concurrently.
-x exclude dependencies (optional)
Here any `hbase` test suites within the specified directory are excluded.
-h help (optional)
Use this option to provide the usage of the command, which includes additional options.
Zhiyong
Chun
Krystal
Ramana
Abhishek
Vicky
Suresh
Khurram
Rahul
Steven
Jacques
Jason
Sudheesh