New tabpfn transformers #239

jackjii79 · 2026-01-02T22:00:03Z

https://github.com/h2oai/h2oai/issues/34827

Due to the complexity of TabPFN, automation regression is skipped, manual testing results show down below

Copilot

Pull request overview

This PR introduces three new TabPFN-based transformers for Driverless AI that leverage pre-trained TabPFN models for outlier detection and embedding generation. The implementation includes both transformer and model components for unsupervised outlier detection, along with a supervised embedding transformer.

Key Changes:

Adds TabPFN-based outlier detection transformer with chunked processing and memory optimization
Implements TabPFN embedding transformer using supervised learning with SVD dimensionality reduction
Introduces unsupervised outlier detection model with Random Forest-based feature selection and density-aware sampling

Reviewed changes

Copilot reviewed 1 out of 1 changed files in this pull request and generated 10 comments.

File	Description
transformers/outliers/tabpfn_outlier.py	Implements outlier detection transformer with chain-rule probability estimation across feature permutations, supporting chunked processing for large datasets
transformers/generic/tabpfn_embedding.py	Provides supervised embedding extraction from TabPFN models with automatic classification/regression detection and SVD-based dimensionality reduction
models/unsupervised/tabpfn_outlier.py	Implements unsupervised outlier model with surrogate RF for feature selection, density-aware sampling, and score calibration for probabilistic interpretation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-02T22:03:43Z

transformers/outliers/tabpfn_outlier.py

+        finals = None
+        if full > final_output.shape[0]:
+            finals = np.full((full, 2 if self.return_flag else 1,), fill_value=0.0, dtype=np.float32)


The variable name 'finals' is unclear and doesn't convey its purpose. A more descriptive name like 'full_output' or 'padded_output' would better indicate that this array holds the full dataset with zero-filled values for unsampled rows.

transformers/outliers/tabpfn_outlier.py

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 20 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

models/unsupervised/tabpfn_outlier.py

transformers/generic/tabpfn_embedding.py

models/unsupervised/tabpfn_outlier.py

transformers/generic/tabpfn_embedding.py

models/unsupervised/tabpfn_outlier.py

jackjii79 · 2026-01-05T16:37:31Z

Testing results:

TabPFNEmbeddingTransformer
✅ Multiclassification
✅ Binary
✅ Regression

jackjii79 · 2026-01-06T05:09:45Z

TabPFNOutlierScorerModel
✅ Unsupervised learning

jackjii79 added 3 commits December 22, 2025 09:45

Add TabPFNOutlierScoreTransformer impl

b2c25dd

Add unsupervised models + embedding transformer

7f27a6d

Polish implementation

d782cc3

Copilot AI review requested due to automatic review settings January 2, 2026 22:00

Copilot started reviewing on behalf of jackjii79 January 2, 2026 22:00 View session

jackjii79 added 2 commits January 2, 2026 14:00

Remove extra

21cc7bd

Remove extra

ad7566d

Copilot AI reviewed Jan 2, 2026

View reviewed changes

jackjii79 added 2 commits January 2, 2026 14:05

Add back

c4549ea

Remove extra

1bc2bef

jackjii79 self-assigned this Jan 2, 2026

jackjii79 requested a review from Copilot January 3, 2026 20:58

Copilot started reviewing on behalf of jackjii79 January 3, 2026 20:59 View session

Copilot AI reviewed Jan 3, 2026

View reviewed changes

jackjii79 added 4 commits January 3, 2026 13:22

nit

898c469

feedback

b79de4d

Fix multiclasification

e579f2c

More fix

ee6eff4

Address parameter issues

e5fefe1

jackjii79 requested review from Mathanraj-Sharma, carmilea and rsujeevan January 6, 2026 05:12

New tabpfn transformers #239

Are you sure you want to change the base?

New tabpfn transformers #239

Uh oh!

Conversation

jackjii79 commented Jan 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Jan 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jackjii79 commented Jan 5, 2026

Uh oh!

jackjii79 commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jackjii79 commented Jan 2, 2026 •

edited

Loading