Table of Contents
statespace
is an automated portfolio construction process using advanced optuna
fine-tuning techniques.
optuna
is an automatic hyperparameter optimization software framework, particularly designed for machine learning. It features an imperative, define-by-run style user API. Thanks to our define-by-run API, the code written with Optuna enjoys high modularity, and the user of Optuna can dynamically construct the search spaces for the hyperparameters.
- Eager search spaces: Automated search for optimal hyperparameters using Python conditionals, loops, and syntax
- State-of-the-art algorithms: Efficiently search large spaces and prune unpromising trials for faster results
- Easy parallelization: Parallelize hyperparameter searches over multiple threads or processes without modifying code
statespace
is designed to automatically optimize portfolio hyper-parameters base on a configuration file.
Optuna enables efficient hyperparameter optimization by adopting state-of-the-art algorithms for sampling hyperparameters and pruning efficiently unpromising trials.
-
Sampling Algorithms: Samplers basically continually narrow down the search space using the records of suggested parameter values and evaluated objective values, leading to an optimal search space which giving off parameters leading to better objective values.
optuna
provides the following sampling algorithms:- Grid Search
- Random Search
- Tree-structured Parzen Estimator algorithm
- CMA-ES based algorithm
- Gaussian process-based algorithm
- Algorithm to enable partial fixed parameters
- Nondominated Sorting Genetic Algorithm II
- A Quasi Monte Carlo sampling algorithm
-
Pruning Algorithms: Pruners automatically stop unpromising trials at the early stages of the training (a.k.a., automated early-stopping). Optuna provides the following pruning algorithms:
- Median pruning algorithm
- Non-pruning algorithm
- Algorithm to operate pruner with tolerance
- Algorithm to prune specified percentile of trials
- Asynchronous Successive Halving algorithm
- Hyperband algorithm
- Threshold pruning algorithm
- A pruning algorithm based on Wilcoxon signed-rank test
More information could be found in the Official Documentation.
optuna = "^3.6.1"
numpy = "^1.26.4"
pandas = "^2.2.2"
plotly = "^5.22.0"
$ pip install python-statespace
Follow these steps to get started with developing an investment strategy using the statespace
framework.
Make sure you have sklearn
installed to follow this example:
$ pip install scikit-learn
- Develop an Investment Strategy
- Create a Custom Objective Object
- Set Up a Configuration File with Hyperparameters
- Execute the Trials
- Analyze the Results
- Visualize
In this example, we create a simple sklearn
pipeline with a processor and a regressor.
>>> from statespace.base import BaseStudy, Listed
>>> from statespace.decorators import run_study
Note: sklearn
imports are not shown here. Please see our running examples in the examples folder for more details.
>>> def strategy(estimator: BaseEstimator, preprocessor: BaseEstimator) -> Pipeline:
... return make_pipeline(preprocessor, estimator)
We then create a custom objective MyCustomObjective
to minimize the mean squared error on the validation set:
>>> class MyCustomObjective(BaseStudy):
... @run_study
... def objective(self, trial: Trial) -> float:
... X_train, X_test, y_train, y_test = train_test_split(*self.data, test_size=0.33)
... pred = self.model.fit(y_train, X_train).predict(y_test)
... return mean_squared_error(y_test, pred)
>>> config = {
... 'preprocessor': Listed([MinMaxScaler(), RobustScaler(), StandardScaler()]),
... 'estimator': Listed([LinearRegression(), Ridge(), Lasso(), ElasticNet()]),
... }
>>> custom_model = MyCustomObjective(config, strategy, X, y)
>>> model = custom_model.execute(n_trials=20)
# A new study created in memory with name: statespace
Get best parameters
>>> print(model.best_trial.params)
# {'preprocessor': MinMaxScaler(), 'estimator': ElasticNet()}
Get best value
>>> print(model.best_value)
# 0.9758159549070534.
>>> from optuna import visualization
Contour Plot
>>> fig = visualization.plot_contour(model, params=["estimator", "preprocessor"])
>>> fig.show()
Hyperparameter Importance
>>> fig = visualization.plot_param_importances(model)
>>> fig.show()
Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!
- Fork the Project
- Create your Feature Branch (
git checkout -b feature/AmazingFeature
) - Commit your Changes (
git commit -m 'Add some AmazingFeature'
) - Push to the Branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
Distributed under the BSD-3 License. See LICENSE.txt
for more information.
- Takuya Akiba, Shotaro Sano, Toshihiko Yanase, Takeru Ohta, and Masanori Koyama. 2019.
- Optuna: A Next-generation Hyperparameter Optimization Framework. In KDD.