I'm a freshman who is studying LDA (Latent Dirichlet Allocation) model nowadays. But, I faced a problem.
How is the theta drawn from the alpha?
theta ~ Dir(alpha)
According to my short understanding, the variable theta is a vector with its length K and its components represent the topic proportions in a document. And, the thetas are different with each other for each document. And, in corpus level, the alpha is still a K-vector whereas the theta is a M(# of docs) by K(# of topics) sized matrix.
First question: What I mentioned above is true?
Second question: If true, over the documents, how can the different thetas (K-vectors) be drawn from the same Dirichlet distribution?