Unable to install pandas on AWS Lambda

Question

I'm trying to install and run pandas on an Amazon Lambda instance. I've used the recommended zip method of packaging my code file model_a.py and related python libraries (pip install pandas -t /path/to/dir/) and uploaded the zip to Lambda. When I try to run a test, this is the error message I get:

Unable to import module 'model_a': C extension: /var/task/pandas/hashtable.so: undefined symbol: PyFPE_jbuf not built. If you want to import pandas from the source directory, you may need to run 'python setup.py build_ext --inplace' to build the C extensions first.

Looks like an error in a variable defined in hashtable.so that comes with the pandas installer. Googling for this did not turn up any relevant articles. There were some references to a failure in numpy installation but nothing concrete. Would appreciate any help in troubleshooting this! Thanks.

Why don't you try the virtualenv-based approach? That way you won't miss any dependencies required by the python packages that you include in your lambda deployment package. — Leon
I thought they were different, but cannot find any evidence supporting that point of view. — Leon
Check the answer at stackoverflow.com/a/43766512/345606 for advice on including Python packages, like Pandas, that have compiled code. — Kevin
I've deployed Pandas projects to AWS Lambda several times using Zappa and I haven't hit the problem you're running into. Zappa also works out of virtual environments. So I'm not sure if it's the venv step or how Zappa packages up the libraries that deserves the credit for avoiding the problem. — Chad Parmet

Szilárd Kálosi Szilárd Kálosi · Accepted Answer · 2020-09-30T14:33:59

I would advise you to use Lambda layers to use additional libraries. The size of a lambda function package is limited, but layers can be used up to 250MB (more here).

AWS has open sourced a good package, including Pandas, for dealing with data in Lambdas. AWS has also packaged it making it convenient for Lambda layers. You can find instructions here.

Unable to install pandas on AWS Lambda

3 Answers