How does X*theta in cost function from?

Question

In Linear Regression, there is a cost function as:

The code in Octave is:

function J = computeCost(X, y, theta)
%COMPUTECOST Compute cost for linear regression
%   J = COMPUTECOST(X, y, theta) computes the cost of using theta as the
%   parameter for linear regression to fit the data points in X and y

% Initialize some useful values
m = length(y); % number of training examples

% You need to return the following variables correctly 
J = 0;

% ====================== YOUR CODE HERE ======================
% Instructions: Compute the cost of a particular choice of theta
%               You should set J to the cost.

H = X*theta; 
S = (H - y).^2;
J = 1 / (2*m) * sum(S);

% =========================================================================

end

Could someone tell me why sigma(h0(x(i))) is equal to a vectorization X*theta?

Thanks

Ash Ash · Accepted Answer · 2020-01-14T12:02:10

Could someone tell me why sigma(h0(x(i))) is equal to a vectorization X*theta?

That is not the case. At no point in this code does sigma(h(x_i)) get computed separately. The variable H is not equal to that value but it's a (column) vector that stores the values

 `h(x_i)=dot_product(x_i,theta)`

for all examples.

The formula that you give in Latex just says that it wants us to sum the ((h(x_i)-y_i))^2 over all examples. What you want to avoid doing is to compute h(x_i) for all of those examples in a sequential manner, because that would be time consuming. From the definition of h(x), you know that

#I've written a more general case, and the case `n==1` will correspond to your Latex formula)
h(x_i)=[1 x_i1 ... x_in]*[theta_0 theta_1 ... theta_n]'

The matrix X is of size m*n, where m is the number of examples. So each row of the vector

H=X*theta #H is a vector of size m*1

will correspond to a single h(x_i).

Knowing this, you can see that

S=(H-y).^2 #S is a vector of size m*1

is a vector such that each element is one of the (h(x_i)-y_i)^2. So, you just need to sum all of them with sum(S) to get the value of the sigma from your Latex formula.

How does X*theta in cost function from?

2 Answers