I'm processing table "Content" with User Defined Function "TrasformData"
@result =
SELECT Id,
TrasformData(Data) AS TrasformedData
FROM Content;
The table "Content" is big (about 100M records) and "TrasformData" function is slow. The function is very complex and takes about 20 milliseconds for one record.
Azure Data Lake splits my query into 25 Vertices be default. It's not enough. it may take hours to finish on 25 AU. I would like to allocate at least 200 AU for this process and finish it as fast as I can. As far as I understand it's useless to allocate more then 25 AU for this query until it splits into into 25 Vertices.
Can I somehow increase parallelism for my query? Could anyone help me on this question? Any options are acceptable.