3
votes

I've created a Sagemaker notebook to dev AWS Glue jobs, but when running through the provided example ("Joining, Filtering, and Loading Relational Data with AWS Glue") I get the following error:

enter image description here

Does anyone know what I've setup wrong/haven't setup to cause the import to not work?

4

4 Answers

1
votes

I had the same issue and the selected solution did not work for me.

I did manage to get working by using cloud formation (AWS::Glue::DevEndpoint). Through trial and error I noticed that you can't specify both NumberOfNodes and NumberOfWorkers at the same time. You have to specify one or the other.

Using NumberOfNodes: 5 resulted in the exact same error as specified in the question. But using the 2nd option worked perfectly.

So to conclude, to fix this error you can use CloudFormation and make sure to use the NumberOfWorkers property.

0
votes

You'll need to download the library files from here for Glue 0.9 or here for Glue 1.0 (Check your Glue jobs for the version).

Put the zip in S3 and reference it in the "Python library path" on your Dev Endpoint.

0
votes

hm... this approach doesn't work for me. I've just put zip to "Python library path", referenced to it and it doesn't work

0
votes

Add AWSGlueServiceNotebookRole to your Dev Endpoint IAM Role, restart your kernel and rerun