-
Notifications
You must be signed in to change notification settings - Fork 176
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Shape Issue When Masking A Single Timesteps Feature Values #38
Comments
After looking through the code more, I suspect the shape of my source data might need to be altered. |
Hi, the expected shape and format of the dataframe currently is I suspect that in your case, maybe you are always considering 1 "sample" = 1 time step (where sample is used in the signal processing sense, not the machine learning sense). In this case, feel free to use "pseudo-IDs", e.g., one per row, by using the simple (ordinal) row index. It really depends on what you are trying to do (i.e. what is your data and what are your labels). |
Thanks for the detailed response. This clarifies things, and is along the lines I was thinking what the intended use case of the ID's was. My original approach was to set ID's == index (i.e. the "pseudo-ID's" idea you mentioned) but hit the above shape issue in the original post and couldn't find an obvious way to get this line to return a 2 dimension numpy array. It returns ( My temporary solution was to do this right after the line linked to above:
Which I believe does something like this:
And from the sounds of the intention of the |
I'm attempting a classification on custom data. There are 8 features and 447 time steps or samples in the train/val set. I'm guessing the issue is with my dataset, so I provide some shape prints below.
The issue occurs in dataset.py, line ~263 (added a lot of comments so might be a bit off) which reads:
for m in range(X.shape[1]): # feature dimension
throwsIndexError: tuple index out of range
Printing out some variables to debug, just before the above problematic line, we can see X is a single dimension array with size = number of features:
Gives:
Further up, around line 35 or so is where the
noise_mask
is called. I've printed out some variables there to debug too:Gives:
Edit: The command I'm using to run is:
The text was updated successfully, but these errors were encountered: