Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Interpolation creates null values in resampled dataframe #320

Open
lambertsbennett opened this issue Apr 27, 2023 · 9 comments
Open

Interpolation creates null values in resampled dataframe #320

lambertsbennett opened this issue Apr 27, 2023 · 9 comments
Assignees
Labels
bug Something isn't working question Further information is requested

Comments

@lambertsbennett
Copy link

I am having an issue with resampling/interpolation that I think must be a simple misunderstanding. I have a signal at 10 Hz and I want to upsample to 100 Hz and fill values with a linear interpolation. My current code is as follows:

interp_df = start_tsdf.resample(freq=f"{1/100} sec", func='mean').interpolate(target_cols=['value'], method='linear', show_interpolated=True)

However this results in a value column that contains only nulls in between actual sensor readings. Is it not possible to upsample a signal and linearly interpolate to fill missing values?

@tnixon
Copy link
Contributor

tnixon commented Apr 28, 2023

You should be able to do this @lambertsbennett, if not this is definitely a bug. Could you provide some sample data we could test with?

@tnixon tnixon closed this as completed Apr 28, 2023
@tnixon tnixon reopened this Apr 28, 2023
@tnixon tnixon added bug Something isn't working question Further information is requested labels Apr 28, 2023
@lambertsbennett
Copy link
Author

@tnixon thanks for the response, I will take a small sample of the data when I'm next at work and provide it as soon as possible!

@lambertsbennett
Copy link
Author

lambertsbennett commented May 2, 2023

sample.csv
@tnixon - Hi here are some sample data that I can use to reproduce the error. In this case I am using the signal and the file_name columns as partition columns. What I am thinking could be the problem is that many of the values are identical and it could be that this is a strange case?

@tnixon tnixon self-assigned this May 2, 2023
@tnixon
Copy link
Contributor

tnixon commented May 2, 2023

Thanks @lambertsbennett - I will investigate ASAP!

@lambertsbennett
Copy link
Author

@tnixon I have also been looking more into this and things are even a bit stranger... with a different signal in the data there is a large time gap and this large time gap is interpolated properly, but the smaller gaps are filled with nulls.

@lambertsbennett
Copy link
Author

Hi @tnixon, I was just wondering if there are any updates on this issue?

Thanks!

@lambertsbennett
Copy link
Author

@tnixon we eventually completely switched up our interpolation process, but it would be nice in the future to use tempo. Did anything ever come out of this?

@tnixon
Copy link
Contributor

tnixon commented Nov 28, 2023

Hi @lambertsbennett, thanks for providing the sample data. I'm sorry we haven't been able to follow up on this yet, but it is still in the queue and valuable to us to figure out what is going on here. I am setting aside some time to look at it in the near term

@jplavins
Copy link

jplavins commented Aug 30, 2024

Hello! I have the same problem where I end up with null values as a result of resampling/interpolation.
I've attached the sample data. Some combination of parameters work fine, some return null values (func='ceil' and method='linear', freq = 60seconds).
data_sample.csv

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants