-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Custom Dataset format #34
Comments
Hi, It is CSV files still, where the first line tells the labels of the variables. For multinomial data, the second line should contain the number of states for the variables. Have a look at this example for continuous data and this example for multinomial data. Just let me know if you run into trouble or have any questions |
Sure, Thx.
Is there any pre-processing required
for a new dataset ?
Any constraints on the format requirements ?
Thx
… On Sep 4, 2022, at 23:34, Felix Rios ***@***.***> wrote:
Hi,
It is CSV files still, where the first line tells the labels of the variables. For multinomial data, the second line should contain the number of states for the variables.
Have a look at this example for continuous data and this example for multinomial data.
Just let me know if you run into trouble or have any questions
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you authored the thread.
|
For continuous data, it is recommended to standardize first. For multinomial data, some algorithms may require enumeration of e.g. k states from 0 to k-1. Also, have a look at the Benchpress paper for a further description of the data format. I hope this could help you |
Sure, thanks |
Hello,
if we wan to use on our own dataset (tabular csv)
format.
what will be the format of the dataset ?
thx
The text was updated successfully, but these errors were encountered: