Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data Preprocess Source and Custom Dataset #205

Closed
cpzz50 opened this issue Jun 28, 2024 · 1 comment
Closed

Data Preprocess Source and Custom Dataset #205

cpzz50 opened this issue Jun 28, 2024 · 1 comment

Comments

@cpzz50
Copy link

cpzz50 commented Jun 28, 2024

This question is about BTC high frequency trading dataset provided.

From the csv file. Some columns like [bid1_price] has number [23090.7], but other columns like [high] have number [-0.8507737401414914]. Seems like some columns are normalized but others are not. Please advise how these data are preprocessed so I can put my own data into training.

Also about preprocessing, README indicate that the data is from Kaggle, which I found the original data doesn't contain indicators in the csv file. Wondering if it's possible disclose how those technical indicators are preprocessed.

Thank you

@qinmoelei
Copy link
Contributor

EarnHFT contains a more detailed description of the data preprocess, which members also develop from TradeMaster. For high-frequency trading for crypto, the two repos share the same setting, and EarnHFT is much easier to read.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants