Skip to content
@capitalone

Capital One

We’re an open source-first organization — actively using, contributing to and managing open source software projects.

Pinned Loading

  1. DataProfiler DataProfiler Public

    What's in your data? Extract schema, statistics and entities from datasets

    Python 1.5k 176

  2. datacompy datacompy Public

    Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!

    Python 601 146

  3. locopy locopy Public

    locopy: Loading/Unloading to Redshift and Snowflake using Python.

    Python 113 50

  4. rubicon-ml rubicon-ml Public

    Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

    Jupyter Notebook 137 36

  5. dataCompareR dataCompareR Public

    dataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.

    R 75 26

  6. edgetest edgetest Public

    edgetest is a tox-inspired python library that will loop through your project's dependencies, and check if your project is compatible with the latest version of each dependency

    Python 25 8

Repositories

Showing 10 of 49 repositories
  • datacompy Public

    Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!

    capitalone/datacompy’s past year of commit activity
    Python 601 Apache-2.0 146 11 (1 issue needs help) 4 Updated Sep 10, 2025
  • rubicon-ml Public

    Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

    capitalone/rubicon-ml’s past year of commit activity
    Jupyter Notebook 137 Apache-2.0 36 8 2 Updated Sep 9, 2025
  • DataProfiler Public

    What's in your data? Extract schema, statistics and entities from datasets

    capitalone/DataProfiler’s past year of commit activity
    Python 1,516 Apache-2.0 176 69 (8 issues need help) 9 Updated Sep 8, 2025
  • synthetic-data Public

    Generating complex, nonlinear datasets appropriate for use with deep learning/black box models which 'need' nonlinearity


    capitalone/synthetic-data’s past year of commit activity
    Python 44 Apache-2.0 29 3 3 Updated Sep 8, 2025
  • capitalone/c1s-slingshot-sdk-py’s past year of commit activity
    Python 1 Apache-2.0 2 0 1 Updated Sep 2, 2025
  • edgetest Public

    edgetest is a tox-inspired python library that will loop through your project's dependencies, and check if your project is compatible with the latest version of each dependency

    capitalone/edgetest’s past year of commit activity
    Python 25 Apache-2.0 8 4 (1 issue needs help) 0 Updated Aug 25, 2025
  • federated-model-aggregation Public

    The Federated Model Aggregation (FMA) Service is a collection of installable python components that make up the generic workflow/infrastructure needed for federated learning.

    capitalone/federated-model-aggregation’s past year of commit activity
    Python 32 Apache-2.0 11 19 (1 issue needs help) 2 Updated Aug 20, 2025
  • .github Public
    capitalone/.github’s past year of commit activity
    0 1 0 0 Updated Aug 13, 2025
  • acronym-decoder Public

    Acronym Decoder

    capitalone/acronym-decoder’s past year of commit activity
    TypeScript 44 Apache-2.0 26 3 10 Updated Aug 12, 2025
  • Stratum-Observability Public

    A no-dependency library to send standardized events to observability and data platforms. Based on plugins, Stratum enables the cataloging of app-specific logic to define, validate, and publish events to your entire stack.

    capitalone/Stratum-Observability’s past year of commit activity
    TypeScript 24 Apache-2.0 9 6 2 Updated Aug 5, 2025