fix portfolio_learning_strategies by maurerle · Pull Request #794 · assume-framework/assume

maurerle · 2026-04-15T16:10:35Z

Related Issue

Closes #784

Description

Improvements:

Add test for portfolio learning strategy and min_max_scale function: Added a test for the portfolio learning strategy and min_max_rescale function to ensure its functionality and stability.

Bug Fixes:

Fix errors in portfolio learning strategies: The min_max_rescale function was missing from utils.py, causing an ImportError in portfolio_learning_strategies.py. Resolved by extending min_max_scale to cover the rescaling use case. And fix minor construction bug for observation space.

Checklist

Documentation updated (docstrings, READMEs, user guides, inline comments, doc folder updates etc.)
New unit/integration tests added (if applicable)
Changes noted in release notes (if any)
Consent to release this PR's code under the GNU Affero General Public License v3.0

codecov · 2026-04-15T16:14:31Z

Codecov Report

❌ Patch coverage is 82.50000% with 7 lines in your changes missing coverage. Please review.
✅ Project coverage is 82.01%. Comparing base (4998487) to head (4410a34).

Files with missing lines	Patch %	Lines
assume/reinforcement_learning/learning_utils.py	60.00%	4 Missing ⚠️
assume/common/fast_pandas.py	62.50%	3 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #794      +/-   ##
==========================================
+ Coverage   80.51%   82.01%   +1.49%     
==========================================
  Files          56       56              
  Lines        9013     9033      +20     
==========================================
+ Hits         7257     7408     +151     
+ Misses       1756     1625     -131

Flag	Coverage Δ
pytest	`82.01% <82.50%> (+1.49%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

chiara-fb

Consider substituting line 453 of EnergyLearningStrategy from:
bid_prices = actions * self.max_bid_price
to:
bid_prices = min_max_scale(actions, -1,1, self.min_bid_price, self.max_bid_price)

to allow for more flexibility.

kim-mskw

This will break inital exploration mode

kim-mskw · 2026-04-30T13:22:11Z

        # actions are in the range [-1,1], we need to transform them into actual bids
        # we can use our domain knowledge to guide the bid formulation
-        bid_prices = actions * self.max_bid_price
+        bid_prices = min_max_scale(


If we want to change this, we need to adjust the intial exploration as well, because it now relies on the fact that it is symmetric around zero see:

marginal_cost = next_observation[ -1 ].detach() # ensure no gradients flow through # Add marginal cost to the action directly for initial random exploration curr_action += marginal_cost

and because both are scaled with max_bid_price only beforehand

correct. This is not the case in the portfolio strategy, which uses the current load status to proxy for the level of price markup that the operator can apply, see:

curr_action = 2 * next_observation[2] - 1 + noise

Possibly the initial experience could be shifted to be centered around (max_bid_price - min_bid_price) / 2 + noise? That would make it independent from marginal costs though ...

Yeah, you're completely right @kim-mskw, I missed that.

To achieve sensible exploration around the marginal costs, we could change the scaled mc in the observation space to the [min_bid_price, max_bid_price range]?

Changing the action space scaling for the other RL strategies needs further thinking. My proposition: revert it for normal RL Strategies and push the rest regarding the portfolio learning strategy. Ok @chiara-fb, @kim-mskw?

maurerle · 2026-05-13T08:28:47Z

We will merge the initial PR to fix the missing function and later work on the improvements @mthede

this function was omitted in the addition of portfolio strategies

… min_max_rescale by switching bounds

…) deprecation warning

Removed changes under discussion and only kept the bugfix.

maurerle requested a review from mthede April 15, 2026 16:20

mthede requested a review from chiara-fb April 17, 2026 13:54

chiara-fb reviewed Apr 24, 2026

View reviewed changes

mthede requested a review from kim-mskw April 29, 2026 15:06

mthede changed the title ~~utils: add min_max_rescale~~ fix portfolio_learning_strategies and reuse improvements for other learning strategies Apr 29, 2026

mthede requested a review from chiara-fb April 29, 2026 15:56

kim-mskw previously requested changes Apr 30, 2026

View reviewed changes

maurerle and others added 8 commits May 13, 2026 11:44

utils: add min_max_rescale

ad49747

this function was omitted in the addition of portfolio strategies

refactor min_max_scale function to be a general wrapper -> works like…

186306c

… min_max_rescale by switching bounds

replace rescale function with general min_max_scale()

1524040

fix: 0-dim array cannot be concatenated

c38ee4d

add tests for portfolio learning strategy + remove pandas date_range(…

5f56fd7

…) deprecation warning

add release notes

8798c73

fix numpy and tensor compatibility of scaling function

54ac47b

split date related features into separate functions

e5c11eb

maurerle force-pushed the minmaxrescale branch 2 times, most recently from 7831c29 to f87c81c Compare May 13, 2026 09:56

mthede changed the title ~~fix portfolio_learning_strategies and reuse improvements for other learning strategies~~ fix portfolio_learning_strategies May 13, 2026

improve min_max_scale default

80ee919

maurerle force-pushed the minmaxrescale branch from f87c81c to 80ee919 Compare May 13, 2026 11:29

maurerle added 2 commits May 13, 2026 13:41

use is_prepared instead of getattr

425b7b7

fix copy by introducing ones_like to fast_pandas

7068782

mthede approved these changes May 15, 2026

View reviewed changes

minor doc adjustments

4410a34

mthede force-pushed the minmaxrescale branch from 74587e6 to 4410a34 Compare May 15, 2026 12:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix portfolio_learning_strategies#794

fix portfolio_learning_strategies#794
maurerle wants to merge 12 commits into
mainfrom
minmaxrescale

maurerle commented Apr 15, 2026 •

edited by mthede

Loading

Uh oh!

codecov Bot commented Apr 15, 2026 •

edited

Loading

Uh oh!

chiara-fb left a comment

Uh oh!

kim-mskw left a comment

Uh oh!

kim-mskw Apr 30, 2026

Uh oh!

kim-mskw Apr 30, 2026

Uh oh!

chiara-fb May 5, 2026

Uh oh!

mthede May 7, 2026

Uh oh!

mthede May 7, 2026

Uh oh!

maurerle commented May 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

maurerle commented Apr 15, 2026 • edited by mthede Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Issue

Description

Checklist

Uh oh!

codecov Bot commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

chiara-fb left a comment

Choose a reason for hiding this comment

Uh oh!

kim-mskw left a comment

Choose a reason for hiding this comment

Uh oh!

kim-mskw Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

kim-mskw Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

chiara-fb May 5, 2026

Choose a reason for hiding this comment

Uh oh!

mthede May 7, 2026

Choose a reason for hiding this comment

Uh oh!

mthede May 7, 2026

Choose a reason for hiding this comment

Uh oh!

maurerle commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

maurerle commented Apr 15, 2026 •

edited by mthede

Loading

codecov Bot commented Apr 15, 2026 •

edited

Loading

maurerle commented May 13, 2026 •

edited

Loading