[WIP] Fix TNeuralFit accuracy metrics reporting issue #174

Copilot · 2025-12-27T23:06:06Z

Understand the issue and codebase structure
Add a new SumAbsoluteErrorBelow function in neuralfit.pas that considers a "hit" when the sum of absolute output errors is smaller than 0.001
Add the function to the interface section so it can be used as InferHitFn
Add a test for the new function in TestNeuralFit.pas
Test the function with a hypotenuse-like example to verify it works
Run code review
Run CodeQL security check

Original prompt

This section details on the original issue you should resolve

<issue_title>TNeuralFit completes training successfully but reports TrainingAccuracy, ValidationAccuracy, and TestAccuracy as 0.00% even when the network demonstrably learns</issue_title>
<issue_description># TNeuralFit Accuracy Metrics Report Zero Percent on Training/Validation/Test Sets

Description

TNeuralFit completes training successfully but reports TrainingAccuracy, ValidationAccuracy, and TestAccuracy as 0.00% even when the network demonstrably learns (outputs vary appropriately, loss decreases, predictions are non-trivial).

This makes TNeuralFit's accuracy metrics unreliable for monitoring training progress and model evaluation.

Steps to Reproduce

Test Case: Hypotenuse Function (Regression)

Based on the official examples/Hypotenuse example pattern:

program TestNeuralFitMetrics;
{$mode objfpc}{$H+}

uses
  Classes, SysUtils, Math,
  neuralnetwork, neuralvolume, neuraldatasets, neuralfit;

const
  TRAINING_SAMPLES = 1000;
  VALIDATION_SAMPLES = 100;
  TEST_SAMPLES = 100;

var
  NN: TNNet;
  Trainer: TNeuralFit;
  TrainingPairs, ValidationPairs, TestPairs: TNNetVolumePairList;
  i: integer;
  vOutput: TNNetVolume;

function CreateSimplePairs(Count: integer): TNNetVolumePairList;
var
  i: integer;
  vIn, vOut: TNNetVolume;
  X, Y, Z: single;
begin
  Result := TNNetVolumePairList.Create;
  for i := 0 to Count - 1 do
  begin
    X := Random * 100;
    Y := Random * 100;
    Z := Sqrt(X*X + Y*Y);
    
    vIn := TNNetVolume.Create(2, 1, 1);
    vIn.FData[0] := X / 100;
    vIn.FData[1] := Y / 100;
    
    vOut := TNNetVolume.Create(1, 1, 1);
    vOut.FData[0] := Z / 141.42;
    
    Result.Add(TNNetVolumePair.Create(vIn, vOut));
  end;
end;

begin
  TrainingPairs := CreateSimplePairs(TRAINING_SAMPLES);
  ValidationPairs := CreateSimplePairs(VALIDATION_SAMPLES);
  TestPairs := CreateSimplePairs(TEST_SAMPLES);

  NN := TNNet.Create;
  NN.AddLayer(TNNetInput.Create(2, 1, 1));
  NN.AddLayer(TNNetFullConnectReLU.Create(32));
  NN.AddLayer(TNNetFullConnectReLU.Create(32));
  NN.AddLayer(TNNetFullConnect.Create(1));

  Trainer := TNeuralFit.Create;
  Trainer.InitialLearningRate := 0.00001;
  Trainer.LearningRateDecay := 0;
  Trainer.L2Decay := 0;
  Trainer.Verbose := True;

  WriteLn('Training...');
  Trainer.Fit(NN, TrainingPairs, ValidationPairs, TestPairs, 32, 50);

  WriteLn('Final Training Accuracy: ', FormatFloat('0.00%', Trainer.TrainingAccuracy * 100));
  WriteLn('Final Validation Accuracy: ', FormatFloat('0.00%', Trainer.ValidationAccuracy * 100));
  WriteLn('Final Test Accuracy: ', FormatFloat('0.00%', Trainer.TestAccuracy * 100));
  WriteLn('');
  WriteLn('Sample Output vs Expected:');
  vOutput := TNNetVolume.Create(1, 1, 1);
  for i := 0 to 4 do
  begin
    NN.Compute(TestPairs[i].A);
    NN.GetOutput(vOutput);
    WriteLn(Format('  Output: %.4f, Expected: %.4f',
      [vOutput.FData[0], TestPairs[i].B.FData[0]]));
  end;

  TrainingPairs.Free;
  ValidationPairs.Free;
  TestPairs.Free;
  Trainer.Free;
  NN.Free;
end.

Expected Behavior

After 50 epochs of training on the hypotenuse function:

TrainingAccuracy should be >50% (network learned mapping)
ValidationAccuracy should be >40% (reasonable generalization)
TestAccuracy should be >40% (unseen data performance)

Actual Behavior

Final Training Accuracy: 0.00%
Final Validation Accuracy: 0.00%
Final Test Accuracy: 0.00%

Sample Output vs Expected:
  Output: 0.5585, Expected: 0.8089   (network outputs vary, learning occurred)
  Output: 0.6195, Expected: 0.9582
  Output: 0.3811, Expected: 0.4340
  Output: 0.5848, Expected: 0.8370

Analysis

✅ Training completes successfully (no crashes)
✅ Network demonstrably learns (outputs vary, are non-trivial, not random)
❌ Reported accuracies are always 0.00% across all three sets
❌ Makes TNeuralFit's metric reporting unreliable for monitoring

Possible Root Causes

InferHitFn default implementation incompatible with regression targets
Threading (TNeuralFit allocates 32 threads by default) interferes with metric calculation
Accuracy calculation expects different target encoding/format
Silent failure in metric computation while training proceeds

Workaround

Use manual training loops with NN.Compute() + NN.Backpropagate() for reliable accuracy metrics:

for epoch := 1 to MAX_EPOCHS do
begin
  for i := 0 to TrainingPairs.Count - 1 do
  begin
    NN.Compute(TrainingPairs[i].A);
    NN.Backpropagate(TrainingPairs[i].B);
  end;
  // Manual accuracy computation via forward passes
end;

Environment

CAI Neural API: (latest from master branch)
FreePascal: 3.3.1+
Platform: Windows 10/11
Lazarus: 3.x

Impact

This prevents reliable use of TNeuralFit for model evaluation and progress monitoring, forcing users back to manual training loops even when batching would be beneficial.
</issue_description>

<a...

Fixes TNeuralFit completes training successfully but reports TrainingAccuracy, ValidationAccuracy, and TestAccuracy as 0.00% even when the network demonstrably learns #173

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

joaopauloschuler · 2025-12-28T01:17:29Z

@copilot Please try again.

Initial plan

17e5268

Copilot AI assigned Copilot and joaopauloschuler Dec 27, 2025

Copilot started work on behalf of joaopauloschuler December 27, 2025 23:06 View session

Copilot AI requested a review from joaopauloschuler December 27, 2025 23:30

Copilot stopped work on behalf of joaopauloschuler due to an error December 27, 2025 23:30
Copilot has encountered an error. See logs for additional details.

Copilot started work on behalf of joaopauloschuler December 28, 2025 01:17 View session

Copilot stopped work on behalf of joaopauloschuler due to an error December 28, 2025 01:41
Copilot has encountered an error. See logs for additional details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Fix TNeuralFit accuracy metrics reporting issue #174

[WIP] Fix TNeuralFit accuracy metrics reporting issue #174

Uh oh!

Copilot AI commented Dec 27, 2025 •

edited

Loading

Uh oh!

joaopauloschuler commented Dec 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[WIP] Fix TNeuralFit accuracy metrics reporting issue #174

Are you sure you want to change the base?

[WIP] Fix TNeuralFit accuracy metrics reporting issue #174

Uh oh!

Conversation

Copilot AI commented Dec 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Steps to Reproduce

Test Case: Hypotenuse Function (Regression)

Expected Behavior

Actual Behavior

Analysis

Possible Root Causes

Workaround

Environment

Impact

Uh oh!

joaopauloschuler commented Dec 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Dec 27, 2025 •

edited

Loading