Python's alphabetical sort file error

Question

i'm new to python & here is my question: Open the file romeo.txt and read it line by line. For each line, split the line into a list of words using the split() method. The program should build a list of words. For each word on each line check to see if the word is already in the list and if not append it to the list. When the program completes, sort and print the resulting words in alphabetical order.

This is the file:

But soft what light through yonder window breaks

It is the east and Juliet is the sun

Arise fair sun and kill the envious moon

Who is already sick and pale with grief

Desired Output:

['Arise', 'But', 'It', 'Juliet', 'Who', 'already', 'and', 'breaks', 'east', 'envious', 'fair', 'grief', 'is', 'kill', 'light', 'moon', 'pale', 'sick', 'soft', 'sun', 'the', 'through', 'what', 'window', 'with', 'yonder']

This is my code:

fname = raw_input("Enter file name: ")
fh = open(fname)

lst = list()

#loop through the text to get the lines
for line in fh:

    line = line.rstrip()

    #loop through the line to get the words       
    for word in line:
        words = line.split()

        #if a word is not in the empty list, append it       
        if not word in lst: lst.append(word)

lst.sort()
print lst

My output:

[' ', 'A', 'B', 'I', 'J', 'W', 'a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'k', 'l', 'm', 'n', 'o', 'p', 'r', 's', 't', 'u', 'v', 'w', 'y']

If you could tell me what is wrong (to get only the first letters of the words with a space in the beginning instead of the whole words), it would be great..

Note: I want the code using these instructions, not other advanced instructions (to keep my learning sequence)

Thank you

When you write for word in line:, Python will loop through what it considers the "elements" of the string line: That is, it will loop through the string character by character. So word is always a single character. (Note that there is some confusion here anyway: you define words, but never use it.) — Mees de Vries

SaxyPandaBear SaxyPandaBear · Accepted Answer · 2017-07-04T11:11:56

You should be calling split() on the line, not the words in the line.

for line in file:
    result = line.split()  # this returns a list of values
    for word in result:
        # check if it already is in your list of words
list.sort()

Python's alphabetical sort file error

4 Answers