2.4.0rc6: improvements, more examples, documentations and bug fixes
Pre-release
Pre-release
What's Changed
- Fix RestoreState callback by @YuanTingHsieh in #2115
- Update gnn example by @ZiyueXu77 in #2114
- Update swarm learning example by @SYangster in #2064
- Merge duplicate sag_pt_deploy_map/sag_pt_ddp job templates [skip ci] by @YuanTingHsieh in #2116
- Update the document on large model experiments. by @IsaacYangSLA in #2118
- Fixes readthedocs config and link in docs [skip ci] by @nvkevlu in #2119
- NeMo peft example by @holgerroth in #2094
- Changed default GPRC driver to sync version by @nvidianz in #2117
- Make client api pipe configurable by @YuanTingHsieh in #2103
- Enhance TF Client API examples by @YuanTingHsieh in #2056
- Fix docs build by @SYangster in #2123
- Print logs and write to logfile at the same time by @YuanTingHsieh in #2120
- Job CLI Upgrade to support path notation by @chesterxgchen in #2128
- Update configurations of LauncherExecutor by @YuanTingHsieh in #2126
- Remove receive_global_model by @YuanTingHsieh in #2129
- CLI Job upgrade, convert string to number if possible by @chesterxgchen in #2130
- Update gnn template generation part by @ZiyueXu77 in #2131
- Fix gnn job templates link [skip ci] by @YuanTingHsieh in #2135
- Updates for Client API examples, add cse and swarm by @SYangster in #2134
- Update NeMo PEFT example inference by @holgerroth in #2124
- Fix doc format and docstring format [skip ci] by @YuanTingHsieh in #2139
- Add discussion Q&A template [skip ci] by @YuanTingHsieh in #2136
- enhance the rc handling for MPM. by @yhwen in #1985
- Update client api doc and docstrings by @YuanTingHsieh in #2137
- Add NumPy + Client API example by @YuanTingHsieh in #2031
- Remove mlflow.utils.time_utils dependency. by @chesterxgchen in #2146
- Job Cli Upgrade: support custom configurations by @chesterxgchen in #2143
- Job CLI Upgrade: including configuration key removal by @chesterxgchen in #2147
- Add Step by step fed stats tabular example by @chesterxgchen in #2151
- Update q-a.yml question rendering by @YuanTingHsieh in #2154
- Support sys vars for job config and support parameterized template in job config by @yanchengnv in #2145
- Fix simulator by @YuanTingHsieh in #2156
- Add release notes by @SYangster in #2144
- Added serialized size in the message header by @nvidianz in #2150
- Add TaskExchanger and FlareAgent by @YuanTingHsieh in #2161
- Support graceful end_run processing by @yanchengnv in #2158
- Add Sklearn-linear step-to-step example by @chesterxgchen in #2163
- Fix getting_started doc by @SYangster in #2169
- Add variable doc build version for github links by @SYangster in #2162
- Add experimental decorator and test by @holgerroth in #2171
- Remove gen2 restriction from ad-hoc policy by @yanchengnv in #2170
- Fix Swarm learning README [skip ci] by @YuanTingHsieh in #2172
- Fix job cli issues by @YuanTingHsieh in #2141
- support getTask and submitResult timeout in job config by @yanchengnv in #2173
- Fixed the SystemVarName.SECURE_MODE error in simulator by @yhwen in #2174
- Experimental decorator for functions and classes by @holgerroth in #2185
- Add example for HuggingFace SFT/PEFT by @ZiyueXu77 in #2092
- Add step-by-step tutorial for tabular data on svm, kmeans, and xgboost by @ZiyueXu77 in #2168
- Refactor LauncherExecutor and client api by @YuanTingHsieh in #2164
- Update job templates and README by @YuanTingHsieh in #2187
- Setup poc github rendering fix by @chesterxgchen in #2188
- Add migration guide, add/update various sections by @SYangster in #2160
- Fix meta file processing in storage and improve schedule job retrieval by @yanchengnv in #2186
- update pub list for 2023 by @holgerroth in #2178
- Fixed the missing simulator END_RUN event by @yhwen in #2182
- Enable run cross-validation without training workflow and examples by @yhwen in #2035
- Add overview, comm config, 3rd party integration docs [skip ci] by @SYangster in #2194
- Add metrics exchange mechanism by @YuanTingHsieh in #2189
- change Fed Stats output format by @chesterxgchen in #2199
- Simple FedAvg workflow by @holgerroth in #2157
- Latest Werkzeug is not compatible with Flask by @IsaacYangSLA in #2201
- Update links, example section, various cleanup [skip ci] by @SYangster in #2207
- Added PT Tensor decomposer registration on server side by @nvidianz in #2192
- KeyCloak authentication integration example by @yhwen in #2205
- Disable link check for example links by @YuanTingHsieh in #2216
- POC bug fix by @chesterxgchen in #2212
- Lightning api: add strict option for load_state_dict by @holgerroth in #2211
- Check if fl model to be sent is empty by @YuanTingHsieh in #2217
- save_config() function: fix to_hocon() issue by @chesterxgchen in #2220
- Update xgboost example requirements [skip ci] by @SYangster in #2221
- Fixed a few issues with streaming error logging by @nvidianz in #2213
- fixed the custom_authentication example readme by @yhwen in #2218
- Add sag mlflow step by step example by @YuanTingHsieh in #2208
- MONAI integration and metrics refactoring by @SYangster in #2223
- Don't run init() twice by @holgerroth in #2224
- Fix LauncherExecutor racing issue by @YuanTingHsieh in #2209
- misc doc enhancements, update monai metrics example readme by @SYangster in #2226
- Fix stats global key bug by @SYangster in #2229
- examples reformatting by @SYangster in #2228
- Fix hello examples README file and prepare data section by @YuanTingHsieh in #2231
- Fix config read in lightning api by @YuanTingHsieh in #2232
- POC Tutorial add HE support documentation in the notebook by @chesterxgchen in #2230
- Add fl_ctx in convert method of ParamsConverter by @YuanTingHsieh in #2233
- Display messages to users when cloud scripts unable to find public ip… by @IsaacYangSLA in #2234
- Fix excepthook error by @yanchengnv in #2236
- Fix typo in model locator by @holgerroth in #2239
- Update monai nvflare setup.py file by @IsaacYangSLA in #2237
- az vm run-command invoke does not take multi-line scripts by @IsaacYangSLA in #2214
- Fixed a race condition issue during the server start by @yhwen in #2235
- handle launcher failure condition in launch monitor by @chesterxgchen in #2240
- Add he support to pt params converter by @SYangster in #2238
Full Changelog: 2.4.0rc5...2.4.0rc6