Custom curves upload, validation and to/from df #68

louispt1 · 2025-08-07T07:34:10Z

Reviewing myself and with copilot. These changes allow you to upload custom curves from a pandas dataframe to the API. For now the example reads in csv files manually to convert to a pandas dataframe in advance of implementing the reverse packer functionality which will handle unpacking excel/csv.

Uploaded curves are given the type 'custom' inside pyetm. The type (capacity profile etc) will be inferred by the API on upload, so this really just signifies the source of the curve within pyetm.

Validation is on length and unit type (float). Validation also tries to coerce units into floats if they are not already.

…odel

Copilot

Pull Request Overview

This PR introduces custom curve upload functionality to pyetm, allowing users to upload custom curves from pandas DataFrames to the API. The implementation includes validation for curve data, serialization/deserialization to/from DataFrames, and a new upload runner service.

Adds custom curve upload via update_custom_curves() method on scenarios
Implements validation for curve length (8760 values) and numeric data types
Provides DataFrame serialization/deserialization for custom curves

Reviewed Changes

Copilot reviewed 12 out of 15 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
src/pyetm/services/scenario_runners/update_custom_curves.py	New runner service for uploading custom curves to the API
src/pyetm/models/custom_curves.py	Adds DataFrame serialization, validation, and upload functionality
src/pyetm/models/scenario.py	Adds `update_custom_curves()` method for uploading curves
src/pyetm/models/base.py	Implements `from_dataframe()` class method infrastructure
src/pyetm/models/warnings.py	Removes deprecated `to_legacy_dict()` method
tests/	Comprehensive test coverage for new functionality
examples/excel_to_scenario.ipynb	Example notebook demonstrating curve upload workflow

Copilot · 2025-08-07T07:36:13Z

src/pyetm/services/scenario_runners/update_custom_curves.py

+            try:
+                if curve.file_path and curve.file_path.exists():
+                    # Use file
+                    with open(curve.file_path, "rb") as f:


Opening the file in binary mode ("rb") but then trying to iterate over curve data as text in the else branch (line 45) creates inconsistency. The file should be opened in text mode ("r") for CSV files, or the string conversion logic needs to handle bytes properly.

Suggested change

with open(curve.file_path, "rb") as f:

with open(curve.file_path, "r") as f:

Copilot · 2025-08-07T07:36:14Z

src/pyetm/models/custom_curves.py

+                            pd.to_numeric(raw_data.iloc[:, 0], errors='raise')
+                        except (ValueError, TypeError):
+                            curve_warnings.add(
+                                curve.key, "Curve contains non-numeric values"


The error handling for non-numeric values uses a generic ValueError/TypeError catch, but the user-facing error message 'Curve contains non-numeric values' doesn't provide information about which specific values failed or where they are located in the data.

Suggested change

curve.key, "Curve contains non-numeric values"

# Find non-numeric values and their indices

col = raw_data.iloc[:, 0]

numeric_col = pd.to_numeric(col, errors='coerce')

non_numeric_mask = numeric_col.isna() & ~col.isna()

non_numeric_indices = col[non_numeric_mask].index.tolist()

non_numeric_values = col[non_numeric_mask].tolist()

# Show up to 5 offending values for brevity

details = ", ".join(

f"row {i+1}: '{v}'" for i, v in zip(non_numeric_indices[:5], non_numeric_values[:5])

)

if len(non_numeric_indices) > 5:

details += f", ... ({len(non_numeric_indices)} total)"

curve_warnings.add(

curve.key,

f"Curve contains non-numeric values at {details}"

Copilot · 2025-08-07T07:36:14Z

src/pyetm/models/scenario.py

+            raise ScenarioError(f"Could not update custom curves: {error_summary}")
+
+        # Upload curves
+        result = UpdateCustomCurvesRunner.run(BaseClient(), self, custom_curves)


Creating a new BaseClient() instance instead of using the existing client could lead to authentication or configuration issues. The client should be passed as a parameter or obtained from the scenario context.

Suggested change

result = UpdateCustomCurvesRunner.run(BaseClient(), self, custom_curves)

result = UpdateCustomCurvesRunner.run(self.client, self, custom_curves)

Copilot · 2025-08-07T07:36:14Z

tests/services/scenario_runners/test_update_custom_curves.py

+def temp_curve_files():
+    """Fixture that creates temporary curve files for testing"""
+    temp_dir = Path("/tmp/test_update_curves")
+    temp_dir.mkdir(exist_ok=True)


Using hardcoded /tmp path is not portable across operating systems (Windows doesn't have /tmp). Use tempfile.mkdtemp() or pytest.tmp_path fixture for cross-platform temporary directories.

Suggested change

temp_dir.mkdir(exist_ok=True)

def temp_curve_files(tmp_path):

"""Fixture that creates temporary curve files for testing"""

temp_dir = tmp_path

Copilot · 2025-08-07T07:36:15Z

tests/models/test_custom_curves.py

+    test_data.to_csv(temp_file, index=False, header=False)
+    original.file_path = temp_file
+
+    try:


Using hardcoded /tmp path is not portable across operating systems. Use tempfile.mkdtemp() or pytest's tmp_path fixture for better cross-platform compatibility and automatic cleanup.

Suggested change

try:

with tempfile.TemporaryDirectory() as temp_dir:

temp_dir_path = Path(temp_dir)

temp_file = temp_dir_path / "test_curve.csv"

test_data.to_csv(temp_file, index=False, header=False)

original.file_path = temp_file

Copilot · 2025-08-07T07:36:15Z

src/pyetm/models/custom_curves.py

+                    if len(raw_data) != 8760:
+                        curve_warnings.add(
+                            curve.key,
+                            f"Curve must contain exactly 8,760 values, found {len(raw_data)}",


The magic number 8760 (hours in a year) is hardcoded in multiple places. Consider defining this as a named constant like HOURS_PER_YEAR = 8760 to improve maintainability.

Suggested change

f"Curve must contain exactly 8,760 values, found {len(raw_data)}",

if len(raw_data) != HOURS_PER_YEAR:

curve_warnings.add(

curve.key,

f"Curve must contain exactly {HOURS_PER_YEAR:,} values, found {len(raw_data)}",

* Custom curves to and from dataframe * Upload custom curves runner, scenario methods and validation on the model * Custom curves: tests for runner, scenario and validation

louispt1 added 3 commits August 6, 2025 14:11

Custom curves to and from dataframe

1711a1d

Upload custom curves runner, scenario methods and validation on the m…

e4537f3

…odel

Custom curves: tests for runner, scenario and validation

6dbb694

louispt1 requested a review from Copilot August 7, 2025 07:34

Copilot AI reviewed Aug 7, 2025

View reviewed changes

Fixes for custom curve runner and tests

32a43e7

louispt1 force-pushed the from-df branch from 97463c7 to 32a43e7 Compare August 7, 2025 07:45

louispt1 merged commit 0f06ad2 into louis Aug 7, 2025
1 check passed

louispt1 deleted the from-df branch August 7, 2025 07:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Custom curves upload, validation and to/from df #68

Custom curves upload, validation and to/from df #68

Uh oh!

louispt1 commented Aug 7, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	with open(curve.file_path, "rb") as f:
	with open(curve.file_path, "r") as f:

-                                curve.key, "Curve contains non-numeric values"
+                            # Find non-numeric values and their indices
+                            col = raw_data.iloc[:, 0]
+                            numeric_col = pd.to_numeric(col, errors='coerce')
+                            non_numeric_mask = numeric_col.isna() & ~col.isna()
+                            non_numeric_indices = col[non_numeric_mask].index.tolist()
+                            non_numeric_values = col[non_numeric_mask].tolist()
+                            # Show up to 5 offending values for brevity
+                            details = ", ".join(
+                                f"row {i+1}: '{v}'" for i, v in zip(non_numeric_indices[:5], non_numeric_values[:5])
+                            )
+                            if len(non_numeric_indices) > 5:
+                                details += f", ... ({len(non_numeric_indices)} total)"
+                            curve_warnings.add(
+                                curve.key,
+                                f"Curve contains non-numeric values at {details}"

	result = UpdateCustomCurvesRunner.run(BaseClient(), self, custom_curves)
	result = UpdateCustomCurvesRunner.run(self.client, self, custom_curves)

-    try:
+    with tempfile.TemporaryDirectory() as temp_dir:
+        temp_dir_path = Path(temp_dir)
+        temp_file = temp_dir_path / "test_curve.csv"
+        test_data.to_csv(temp_file, index=False, header=False)
+        original.file_path = temp_file

Custom curves upload, validation and to/from df #68

Custom curves upload, validation and to/from df #68

Uh oh!

Conversation

louispt1 commented Aug 7, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant