Pandas Sliding/Rolling Window over Irregular Time Series

Question

Please excuse poor style and inefficient solutions. All help is greatly appreciated.

Context:

Attempting to isolate the best rate of cycling performance gain over a 6-week block over the course of one year. Performance is measured as the maximum effort produced for any given time period for one cycling record, i.e. 1, 5, 20 min effort, etc...

Tasks:

Create rolling window
Best fit trend line for each window
Keep window corresponding to largest positive slope

Data:

ap1 = np.array([[datetime(2015, 10, 17, 12, 45, 13),
   datetime(2015, 10, 18, 11, 56, 35),
   datetime(2015, 10, 20, 9, 24, 52),
   datetime(2015, 10, 23, 9, 27, 12),
   datetime(2015, 10, 24, 12, 26, 33)], 
[281.0, 343.0, 270.0, 312.0, 320.0], 
[246.0, 305.0, 260.0, 283.0, 289.0], 
[236.0, 250.0, 239.0, 257.0, 245.0]], dtype=object)

Issue: I am currently stuck on Task 1. I have been attempting to follow user2689410's response to computing a rolling_mean over irregular time series. I am hoping to grab his data slicing method.

I only want to slice the dataset into rolling intervals of 45 days. Below is the progress:

from pandas import Series, DataFrame
import pandas as pd
from datetime import datetime, timedelta
import numpy as np

idx = ap1[0]
idx = pd.Index(idx)

ap1=np.transpose(ap1)
ap1=pd.DataFrame(ap1, index = idx, columns = ['date', 'cp1', 'cp2', 'cp3'])
ap2=ap1.drop('date', 1)

ap2 = DataFrame(ap2.copy())
idx = Series(ap2.index.to_pydatetime(), index=ap2.index)

for colname, col in ap2.iteritems():
    dslice = col[idx-pd.tseries.frequencies.to_offset('42D').delta:idx]

The for loop gives me the error:

Traceback (most recent call last):
File "<stdin>", line 2, in <module>
File "/usr/local/lib64/python2.7/site-packages/pandas/core/series.py", line 642, in __getitem__
return self._get_with(key)
File "/usr/local/lib64/python2.7/site-packages/pandas/core/series.py", line 647, in _get_with
indexer = self.index._convert_slice_indexer(key, kind='getitem')
File "/usr/local/lib64/python2.7/site-packages/pandas/indexes/base.py", line 1208, in _convert_slice_indexer
indexer = self.slice_indexer(start, stop, step, kind=kind)
File "/usr/local/lib64/python2.7/site-packages/pandas/tseries/index.py", line 1497, in slice_indexer
return Index.slice_indexer(self, start, end, step, kind=kind)
File "/usr/local/lib64/python2.7/site-packages/pandas/indexes/base.py", line 2962, in slice_indexer
kind=kind)
File "/usr/local/lib64/python2.7/site-packages/pandas/indexes/base.py", line 3141, in slice_locs
start_slice = self.get_slice_bound(start, 'left', kind)
File "/usr/local/lib64/python2.7/site-packages/pandas/indexes/base.py", line 3084, in get_slice_bound
slc = self.get_loc(label)
File "/usr/local/lib64/python2.7/site-packages/pandas/tseries/index.py", line 1419, in get_loc
stamp = Timestamp(key, tz=self.tz)
File "pandas/tslib.pyx", line 405, in pandas.tslib.Timestamp.__new__ (pandas/tslib.c:9932)
File "pandas/tslib.pyx", line 1475, in pandas.tslib.convert_to_tsobject (pandas/tslib.c:26432)
TypeError: Cannot convert input to Timestamp

Where do I go from here?

Robert Pollak Robert Pollak · Accepted Answer · 2019-12-16T12:27:50

2

votes

Nowadays, pandas.DataFrame.rolling can deal with irregular time series.

Pandas Sliding/Rolling Window over Irregular Time Series

2 Answers