Apache spark jdbc connect to apache drill error

Question

I am sending query to apache drill from apache spark. I am getting the following error:

java.sql.SQLException: Failed to create prepared statement: PARSE ERROR: Encountered "\"" at line 1, column 23.

When traced, I found I need to write a custom sql dialect. The problem I do not find any examples for pyspark. All the examples are for scala or java. Any help is highly appreciated.!

Here is the pyspark code :

`dataframe_mysql = spark.read.format("jdbc").option("url", "jdbc:drill:zk=ip:2181;schema=dfs").option("driver","org.apache.drill.jdbc.Driver").option("dbtable","dfs.`/user/titanic_data/test.csv`").load()`

Vitalii Diravka Vitalii Diravka · Accepted Answer · 2018-07-12T22:01:27

Looks like you have used a double quote in your SQL query (please share your SQL).

By default Drill uses back tick for quoting identifiers - `
But you can change it by setting the system/session option (when you are already connected to Drill by JDBC for example) or you can specify it in JDBC connecting string. You can find more information here: https://drill.apache.org/docs/lexical-structure/#identifier-quotes

Apache spark jdbc connect to apache drill error

2 Answers