I am working with a data set that has a timestamp, event duration, and mean value. I would like to resample the data into 15s and 60s intervals. The problem is the timestamps are unevenly spaced.
This is what I've got so far:
from datetime import datetime
import pandas as pd
df = pd.DataFrame([dict(length=pd.to_timedelta(30, unit='s'), value=10),
dict(length=pd.to_timedelta(90, unit='s'), value=30),
dict(length=pd.to_timedelta(180, unit='s'), value=60),
dict(length=pd.to_timedelta(30, unit='s'), value=10)],
index=[datetime(2000, 1, 1),
datetime(2000, 1, 1, 0, 0, 30),
datetime(2000, 1, 1, 0, 3, 0),
datetime(2000, 1, 1, 0, 6, 0)])
print(df.resample('30s').mean())
Sample output:
timestamp value
2000-01-01 00:00:00 10.0
2000-01-01 00:00:30 30.0
2000-01-01 00:01:00 NaN
...
Corrected My desiared output would be:
print(df.resample('15s').mean())
timestamp value
2000-01-01 00:00:00 5.0
2000-01-01 00:00:15 5.0
2000-01-01 00:00:30 5.0
2000-01-01 00:00:45 5.0
2000-01-01 00:01:00 5.0
...
print(df.resample('60s').mean())
timestamp value
2000-01-01 00:00:00 20.0
2000-01-01 00:01:00 20.0
2000-01-01 00:02:00 20.0
...
An idea I had would be to manually upsample the data creating a record in the series for each second but this seems extremely inefficient. Any tips would be appreciated.