I've been reading Prince's book Computer Vision: Models, Inference, and Learning, specifically with the aim of understanding camera parameters and the pose estimation problem and I'm having some trouble with the extrinsic camera parameters. As I understand it, the extrinsic camera parameters consist of a rotation matrix and a translation vector. The rotation matrix transforms the world co-ordinate system into the camera co-ordinate frame. My question is whether the rotation matrix is a rotation matrix in the strict sense; as in it's orthogonal and has determinant 1.
I ask because in a subsequent chapter on geometric transformations, he describes the case where the camera is viewing a plane (w/z co-ordinate = 0), and introduces affine and projective transformations represented by the extrinsic camera matrix. I'm confused because such transformations can't be achieved using a rotation matrix, or am I wrong? Generally confused