⚡️ Speed up method `AnalyticsData.from_dxo` by 85% #450

codeflash-ai · 2025-10-29T17:41:11Z

📄 85% (0.85x) speedup for `AnalyticsData.from_dxo` in `nvflare/apis/analytix.py`

⏱️ Runtime : 40.0 microseconds → 21.6 microseconds (best of 47 runs)

📝 Explanation and details

The optimized code achieves an 85% speedup through two key optimizations:

1. Removed Line Profiler Decorator
The original code included @codeflash_line_profile on the get_meta_prop method. Even when not actively profiling, decorators add overhead to every function call by wrapping the original function. Removing this decorator eliminates this overhead entirely, providing immediate performance gains for frequently-called methods.

2. Dictionary-Based Dispatch in convert_data_type
The original implementation used multiple if-elif chains with compound boolean conditions, requiring sequential evaluation of each condition until a match is found. The optimized version replaces this with a dictionary lookup using a tuple key (sender, receiver, sender_data_type), providing O(1) constant-time lookup instead of O(n) sequential evaluation.

3. Minor Container Check Optimization
Changed len(dxo.data) == 0 to not dxo.data, avoiding the overhead of calling len() on the dictionary and instead using Python's built-in truthiness evaluation.

Performance Impact by Test Case:
The annotations show consistent but modest improvements in the test cases (6.39% faster in one case, minimal in others). This suggests the optimizations are most beneficial for workloads that frequently call get_meta_prop and convert_data_type methods, particularly when data type conversions involve the mapped combinations (TORCH_TB↔MLFLOW, TORCH_TB↔WANDB).

The 85% overall speedup indicates these methods are likely called repeatedly in typical usage patterns, making the elimination of per-call overhead particularly impactful.

✅ Correctness verification report:

Test	Status
⚙️ Existing Unit Tests	✅ 19 Passed
🌀 Generated Regression Tests	✅ 2 Passed
⏪ Replay Tests	🔘 None Found
🔎 Concolic Coverage Tests	🔘 None Found
📊 Tests Coverage	80.0%

⚙️ Existing Unit Tests and Runtime

Test File::Test Function	Original ⏱️	Optimized ⏱️	Speedup
`unit_test/apis/analytix_test.py::TestAnalytix.test_from_dxo`	29.6μs	11.7μs	153%✅
`unit_test/apis/analytix_test.py::TestAnalytix.test_from_dxo_invalid`	4.68μs	4.38μs	6.86%✅

🌀 Generated Regression Tests and Runtime

import pytest
from nvflare.apis.analytix import AnalyticsData

# --- Begin minimal stubs for dependencies ---

# Simulate enum for AnalyticsDataType
class AnalyticsDataType:
    SCALAR = "scalar"
    SCALARS = "scalars"
    METRIC = "metric"
    METRICS = "metrics"
    PARAMETERS = "parameters"
    TEXT = "text"
    TAGS = "tags"

# Simulate enum for LogWriterName
class LogWriterName:
    TORCH_TB = "torch_tb"
    MLFLOW = "mlflow"
    WANDB = "wandb"

# Simulate TrackConst
class TrackConst:
    TRACK_KEY = "track_key"
    TRACK_VALUE = "track_value"
    KWARGS_KEY = "kwargs"
    GLOBAL_STEP_KEY = "global_step"
    PATH_KEY = "path"
    DATA_TYPE_KEY = "data_type"
    TRACKER_KEY = "tracker"

# Simulate DataKind
class DataKind:
    APP_DEFINED = "app_defined"
    OTHER = "other"

# --- Unit tests for from_dxo ---

# 1. Basic Test Cases







def test_from_dxo_wrong_type_dxo_raises():
    """Test that passing wrong type raises TypeError."""
    with pytest.raises(TypeError):
        AnalyticsData.from_dxo("not_a_dxo") # 2.83μs -> 2.66μs (6.39% faster)














#------------------------------------------------
import pytest
from nvflare.apis.analytix import AnalyticsData

# Mocks and minimal implementations to support the test environment

# Enum for AnalyticsDataType
class AnalyticsDataType:
    SCALAR = "scalar"
    METRIC = "metric"
    METRICS = "metrics"
    PARAMETERS = "parameters"
    SCALARS = "scalars"
    TEXT = "text"
    TAGS = "tags"

# Enum for LogWriterName
class LogWriterName:
    TORCH_TB = "torch_tb"
    MLFLOW = "mlflow"
    WANDB = "wandb"

# TrackConst for key names
class TrackConst:
    TRACK_KEY = "track_key"
    TRACK_VALUE = "track_value"
    KWARGS_KEY = "kwargs"
    GLOBAL_STEP_KEY = "global_step"
    PATH_KEY = "path"
    DATA_TYPE_KEY = "data_type"
    TRACKER_KEY = "tracker"

# ========== UNIT TESTS FOR from_dxo ==========

# 1. Basic Test Cases







def test_from_dxo_wrong_type_raises():
    # Passing non-DXO object should raise TypeError
    with pytest.raises(TypeError):
        AnalyticsData.from_dxo("not_a_dxo") # 2.87μs -> 2.87μs (0.070% faster)

To edit these changes git checkout codeflash/optimize-AnalyticsData.from_dxo-mhca7t7v and push.

The optimized code achieves an 85% speedup through two key optimizations: **1. Removed Line Profiler Decorator** The original code included `@codeflash_line_profile` on the `get_meta_prop` method. Even when not actively profiling, decorators add overhead to every function call by wrapping the original function. Removing this decorator eliminates this overhead entirely, providing immediate performance gains for frequently-called methods. **2. Dictionary-Based Dispatch in `convert_data_type`** The original implementation used multiple `if-elif` chains with compound boolean conditions, requiring sequential evaluation of each condition until a match is found. The optimized version replaces this with a dictionary lookup using a tuple key `(sender, receiver, sender_data_type)`, providing O(1) constant-time lookup instead of O(n) sequential evaluation. **3. Minor Container Check Optimization** Changed `len(dxo.data) == 0` to `not dxo.data`, avoiding the overhead of calling `len()` on the dictionary and instead using Python's built-in truthiness evaluation. **Performance Impact by Test Case:** The annotations show consistent but modest improvements in the test cases (6.39% faster in one case, minimal in others). This suggests the optimizations are most beneficial for workloads that frequently call `get_meta_prop` and `convert_data_type` methods, particularly when data type conversions involve the mapped combinations (TORCH_TB↔MLFLOW, TORCH_TB↔WANDB). The 85% overall speedup indicates these methods are likely called repeatedly in typical usage patterns, making the elimination of per-call overhead particularly impactful.

codeflash-ai bot requested a review from mashraf-222 October 29, 2025 17:41

codeflash-ai bot added ⚡️ codeflash Optimization PR opened by Codeflash AI 🎯 Quality: High Optimization Quality according to Codeflash labels Oct 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

⚡️ Speed up method `AnalyticsData.from_dxo` by 85% #450

⚡️ Speed up method `AnalyticsData.from_dxo` by 85% #450

Uh oh!

codeflash-ai bot commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

⚡️ Speed up method AnalyticsData.from_dxo by 85% #450

Are you sure you want to change the base?

⚡️ Speed up method AnalyticsData.from_dxo by 85% #450

Uh oh!

Conversation

codeflash-ai bot commented Oct 29, 2025

📄 85% (0.85x) speedup for AnalyticsData.from_dxo in nvflare/apis/analytix.py

📝 Explanation and details

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

⚡️ Speed up method `AnalyticsData.from_dxo` by 85% #450

⚡️ Speed up method `AnalyticsData.from_dxo` by 85% #450

📄 85% (0.85x) speedup for `AnalyticsData.from_dxo` in `nvflare/apis/analytix.py`