I'm using EMR Notebooks with pyspark and livy.
I'm reading the data from s3 which is in parquet format and string into a pyspark dataframe. there are approx. 2 million rows. when i do a join operation. I am getting 400 session isn't active. for which i have already set the livy timeout to 5h.
An error was encountered: Invalid status code '400' from https://172.31.12.103:18888/sessions/5/statements/20 with error payload: "requirement failed: Session isn't active."