Get column names of Excel worksheet with OpenPyXL in readonly mode

Question

How could I retrieve

the column names (values of the cells in the first row) in an openpyxl Read-only worksheet?
- City, Population, Country in the below example worksheet
all column names in an openpyxl Read-only workbook?
- City, Population, Country, frames from worksheet 1 and the other column names from all other worksheets

Example Excel worksheet:

| City       | Population  |    Country   |
| -----------|------------ | ------------ |
| Madison    |   252,551   |     USA      |
| Bengaluru  | 10,178,000  |    India     |
| ...        |       ...   |     ...      |

Example code:

from openpyxl import load_workbook

wb = load_workbook(filename=large_file.xlsx, read_only=True)
sheet = wb.worksheets[0]

... (not sure where to go from here)

Notes:

I have to use readonly because the Excel file has over 1 million rows (don't ask)
I'd like the column names so I can eventually infer the column types and import the excel data into a PostgreSQL database

You're still talking about print_titles which are something different. As are headers and footers. — Charlie Clark
So, what's the question now? [c.value for c in ws.iter_rows(min_row=1, max_row=1)] not sufficient? — Charlie Clark

HaR HaR · Accepted Answer · 2018-08-24T07:41:45

This will print every thing from row 1;

list_with_values=[]
for cell in ws[1]:
    list_with_values.append(cell.value)

If for some reason you want to get a list of the column letters that are filled in you can just:

column_list = [cell.column for cell in ws[1]]

For your 2nd question; Assuming you have stored the header values in a list called : "list_with_values"

from openpyxl import Workbook
wb = Workbook()
ws = wb['Sheet']
#Sheet is the default sheet name, you can rename it or create additional ones with wb.create_sheet()
ws.append(list_with_values)
wb.save('OutPut.xlsx')

Get column names of Excel worksheet with OpenPyXL in readonly mode

4 Answers