Skip to content
Change the repository type filter

All

    Repositories list

    • auctus_v2

      Public
      Auctus++ is an advanced dataset discovery engine powered by multi-provider ingestion (Socrata), vector-based k-NN search, and dynamic on-demand data profiling.
      Python
      2001Updated Jun 29, 2026Jun 29, 2026
    • Next.js based website to showcase all research in VIDA lab.
      MDX
      3000Updated Jun 26, 2026Jun 26, 2026
    • Python
      3000Updated Jun 22, 2026Jun 22, 2026
    • bdi-viz

      Public
      TypeScript
      MIT License
      0411Updated Jun 22, 2026Jun 22, 2026
    • tile2net

      Public
      Automated mapping of pedestrian networks from aerial imagery tiles
      Python
      BSD 3-Clause "New" or "Revised" License
      3522112Updated Jun 21, 2026Jun 21, 2026
    • AutoDDG

      Public
      [SIGMOD '26] Automated Dataset Description Generation using Large Language Models
      Python
      Apache License 2.0
      62211Updated Jun 16, 2026Jun 16, 2026
    • Data Gatherer is a retrieval-augmented extraction tool to extract structured dataset references from scientific articles.
      Jupyter Notebook
      3800Updated Jun 16, 2026Jun 16, 2026
    • BugDoc

      Public
      BugDoc: python package to debug computational pipelines
      Python
      MIT License
      21100Updated Jun 12, 2026Jun 12, 2026
    • Python
      MIT License
      2311Updated Jun 10, 2026Jun 10, 2026
    • BDF

      Public
      HTML
      MIT License
      168000Updated Jun 3, 2026Jun 3, 2026
    • bdi-kit

      Public
      A toolkit for biomedical data integration and harmonization
      Python
      Apache License 2.0
      32541Updated May 29, 2026May 29, 2026
    • D3M

      Public
      Less
      0000Updated May 22, 2026May 22, 2026
    • API deployment on NYU HSRN Cluster
      Python
      1000Updated May 4, 2026May 4, 2026
    • A node-based ETL tool for urban planning and geospatial analysis.
      Jupyter Notebook
      1100Updated Apr 28, 2026Apr 28, 2026
    • harmonia

      Public
      An LLM-based agent for tabular data harmonization, powered by the bdi-kit library.
      Python
      MIT License
      4600Updated Apr 19, 2026Apr 19, 2026
    • Python
      0200Updated Apr 3, 2026Apr 3, 2026
    • https://vida-nyu.github.io/DISN-Wildlife-website
      HTML
      2100Updated Mar 27, 2026Mar 27, 2026
    • discovera

      Public
      Python
      0300Updated Mar 4, 2026Mar 4, 2026
    • Lightweight implementation of visflow
      TypeScript
      0000Updated Feb 13, 2026Feb 13, 2026
    • reprozip

      Public
      ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in comput…
      Python
      BSD 3-Clause "New" or "Revised" License
      373626810Updated Feb 4, 2026Feb 4, 2026
    • Data collection and enhancement efforts for the USDOT Complete Streets Artificial Intelligence (CSAI) Initiative — Phase I. This project focuses on generating g…
      Python
      Apache License 2.0
      4421Updated Jan 29, 2026Jan 29, 2026
    • CitySurfaces semantic segmentation of sidewalk surfaces
      Python
      BSD 3-Clause "New" or "Revised" License
      125510Updated Jan 27, 2026Jan 27, 2026
    • Python
      BSD 3-Clause "New" or "Revised" License
      1200Updated Dec 30, 2025Dec 30, 2025
    • Spatial Join & Enrich any urban layer given any external urban dataset of interest, streamline your urban analysis with Scikit-Learn-Like pipelines, and share …
      Python
      MIT License
      36622Updated Dec 20, 2025Dec 20, 2025
    • [VLDB '25] Magneto combines small and large language models to provide cost-effective schema matching.
      Jupyter Notebook
      Apache License 2.0
      61831Updated Dec 18, 2025Dec 18, 2025
    • OpenSpace WebRTC enables real-time streaming of OpenSpace-rendered visualizations to a standard web browser using GStreamer and WebRTC.
      1000Updated Dec 18, 2025Dec 18, 2025
    • Python
      2301Updated Dec 9, 2025Dec 9, 2025
    • hilts

      Public
      A tool for data exploration and labeling using multi-modal embedding models.
      Svelte
      1000Updated Dec 9, 2025Dec 9, 2025
    • LTS

      Public
      Python
      4400Updated Dec 9, 2025Dec 9, 2025
    • HTML
      Apache License 2.0
      0000Updated Oct 28, 2025Oct 28, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.