Can Apache Drill connect to Amazon RedShift ? If yes Can anyone help me with configuration and plugin for Apache Drill to connect to Amazon RedShift .
2 Answers
I was not able to get the example from @pavel to work. i am sure it has to do with the driver name and which specific driver i downloaded and installed.
However, I was able to successfully connect to AWS Redshift using the following steps.
Navigate to folder you have downloaded drill to:
cd ~/Downloads/apache-drill-1.11.0
download the required jdbc drivers from AWS at Download the Amazon Redshift JDBC Driver
save the jar files into the folder ~/Downloads/apache-drill-1.11.0/jars/3rdparty
start drill using:
./bin/drill-embedded
Navigate to http://localhost:8047/storage to configure a new data source aka plugin.
Create a new storage plugin and name your data source accordingly. for my test i called it redshift.
Once you have named the source as "redshift", then paste the correct configuration
{
"type" : "jdbc",
"driver" : "com.amazon.redshift.jdbc42.Driver",
"url" : "jdbc:redshift://redshiftserver.mydomain.net:5439/databaseName?ssl=true&sslfactory=com.amazon.redshift.ssl.NonValidatingFactory",
"username" : "username",
"password" : "password",
"enabled" : true
}
next test connectivity to the server using the following query
select * from redshift.autonation.information_schema.columns
Yep, just use Example-Postgres-Configuration
And then use web gui to add a storage:
{
"type" : "jdbc",
"driver" : "org.postgresql.Driver",
"url" : "jdbc:postgresql://URL:5439/DB_NAME_HERE",
"username" : "user",
"password" : "password",
"enabled" : true
}