39
votes

In python I have a function which has many parameters. I want to fit this function to a data set, but using only one parameter, the rest of the parameters I want to supply on on my own. Here is an example:

def func(x,a,b):
   return a*x*x + b

for b in xrange(10):
   popt,pcov = curve_fit(func,x1,x2)

In this I want that the fitting is done only for a and the parameter b takes the value of the loop variable. How can this be done?

7
There're infinite ways to define what it means to "fit" a curve, and for each method, many ways to implement it. The type of curve-fitting you want is often dependent on the problem you're trying to solve. Assuming you don't care, one simple way is called least squares, which minimizes the sum of the squares of the errors. Here is a pre-made library that calculates the solution to a "damped" least squares: docs.scipy.org/doc/scipy/reference/generated/… Question is incomplete though; I suggest to close and reopen with a specific question about curve-fitting. - ninjagecko
I don't care about the algorithm, I will just use the curve_fit from scipy.optimize. What I can't understand is the where should I specify that the one of the parameters should take my value and which parameter should it fit? - lovespeed
@ninjagecko His question is very specific and has a very clear purpose. He is not asking how the process of curve fitting works. - PaulMag
I have another suggestion which might be more intuitive - Azerila

7 Answers

53
votes

You can wrap func in a lambda, as follows:

def func(x,a,b):
   return a*x*x + b

for b in xrange(10):
   popt,pcov = curve_fit(lambda x, a: func(x, a, b), x1, x2)

A lambda is an anonymous function, which in Python can only be used for simple one line functions. Basically, it's normally used to reduce the amount of code when don't need to assign a name to the function. A more detailed description is given in the official documentation: http://docs.python.org/tutorial/controlflow.html#lambda-forms

In this case, a lambda is used to fix one of the arguments of func. The newly created function accepts only two arguments: x and a, whereas b is fixed to the value taken from the local b variable. This new function is then passed into curve_fit as an argument.

4
votes

A better approach would use lmfit, which provides a higher level interface to curve-fitting. Among other features, Lmfit makes fitting parameters be first-class objects that can have bounds or be explicitly fixed (among other features).

Using lmfit, this problem might be solved as:

from lmfit import Model
def func(x,a,b):
   return a*x*x + b

# create model
fmodel = Model(func)
# create parameters -- these are named from the function arguments --
# giving initial values
params = fmodel.make_params(a=1, b=0)

# fix b:
params['b'].vary = False

# fit parameters to data with various *static* values of b:
for b in range(10):
   params['b'].value = b
   result = fmodel.fit(ydata, params, x=x)
   print(": b=%f, a=%f+/-%f, chi-square=%f" % (b, result.params['a'].value, 
                                             result.params['a'].stderr,
                                             result.chisqr))
2
votes

Instead of using the lambda function which might be less intuitive to digest I would recommend to specify the scikit curve_fit parameter bounds that will force your parameter to be searched within custom boundaries.

All you have to do is to let your variable a move between -inf and +inf and your variable b between (b - epsilon) and (b + epsilon)

In your example:

epsilon = 0.00001

def func(x,a,b):
    return a*x*x + b

for b in xrange(10):
    popt,pcov = curve_fit(func,x1,x2, bounds=((-np.inf,b-epsilon), (np.inf,b+epsilon))
1
votes

I effectively use Anton Beloglazov's solution, though I like to avoid using lambda functions for readability so I do the following:

def func(x,a,b):
   return a*x*x + b

def helper(x,a):
   return func(x,a,b)

for b in xrange(10):
   popt,pcov = curve_fit(helper, x1, x2)

This ends up being reminiscent of Rick Berg's answer, but I like having one function dedicated to the "physics" of the problem and a helper function to get the code to work.

0
votes

There is a simpler option if you are willing/able to edit the original function.

Redefine your function as:

def func(x,a):
    return a*x*x + b

Then you can simply put it in your loop for parameter b:

for b in xrange(10):
   popt,pcov = curve_fit(func, x1, x2)

Caveat: the function needs to be defined in the same script in which it is called for this to work.

0
votes

Scipy's curve_fit takes three positional arguments, func, xdata and ydata. So an alternative approach (to using a function wrapper) is to treat 'b' as xdata (i.e. independent variable) by building a matrix that contains both your original xdata (x1) and a second column for your fixed parameter b.

Assuming x1 and x2 are arrays:

def func(xdata,a):
   x, b = xdata[:,0], xdata[:,1]  # Extract your x and b
   return a*x*x + b

for b in xrange(10): 
   xdata = np.zeros((len(x1),2))  # initialize a matrix
   xdata[:,0] = x1  # your original x-data
   xdata[:,1] = b  # your fixed parameter
   popt,pcov = curve_fit(func,xdata,x2)  # x2 is your y-data
0
votes

Another way is to use upper and lower bounds that are identical (+ eps) as the initial value. Using the same example with initial conditions and bounds:

def func(x,a,b):
   return a*x*x + b
# free for a and b
popt,pcov = curve_fit(func, x1, x2, 
                      p0=[1,1], 
                      bounds=[(-inf,-inf),(inf,inf)])

# free for a; fixed for b  ; 
eps=1/100
popt,pcov = curve_fit(func, x1, x2, 
                      p0=[1,1], 
                      bounds=[(-inf,(1-eps)),(inf,(1+eps))])

Remember to insert an epsilon, otherwise, a and b must be the same