why is softmax applied twice when actions are transferred to portfolio weights? #196

tengyaolong2000 · 2024-01-24T10:34:18Z

Softmax is applied on action,

TradeMaster/trademaster/trainers/portfolio_management/trainer.py

Line 149 in bc5a30a

action = np.exp(action)/np.sum(np.exp(action))

then in,

TradeMaster/trademaster/environments/portfolio_management/environment.py

Line 125 in bc5a30a

weights = self.softmax(actions)

softmax is applied again to transfer action into portfolio weights. Is there a specific reason why this is done? Thanks for your time

qinmoelei · 2024-07-30T06:50:23Z

Technically, you only need to use Softmax once to get the portfolio weights.

However, during training, we found that the PnL fluctuations are too big, and the agent finds it very hard to converge. This is due to the high stochasticity in the market. Applying Softmax twice will somewhat make the weights more even, and therefore, the PnL will not fluctuate too much, making it easier for RL agents to converge.

In short, it is a compromise due to the previous methods' inability to handle a high stochastic environment. You can remove this if your algorithms can handle the fluctuations.

qinmoelei closed this as completed Jul 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why is softmax applied twice when actions are transferred to portfolio weights? #196

why is softmax applied twice when actions are transferred to portfolio weights? #196

tengyaolong2000 commented Jan 24, 2024 •

edited

Loading

qinmoelei commented Jul 30, 2024

why is softmax applied twice when actions are transferred to portfolio weights? #196

why is softmax applied twice when actions are transferred to portfolio weights? #196

Comments

tengyaolong2000 commented Jan 24, 2024 • edited Loading

qinmoelei commented Jul 30, 2024

tengyaolong2000 commented Jan 24, 2024 •

edited

Loading