Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      Apache License 2.0
      51924Updated Mar 3, 2026Mar 3, 2026
    • align-app

      Public
      Python
      Other
      0280Updated Feb 13, 2026Feb 13, 2026
    • Code for the paper "Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping"
      Python
      Apache License 2.0
      0200Updated Dec 19, 2025Dec 19, 2025
    • Python
      0141Updated Oct 2, 2025Oct 2, 2025
    • Steerable Pluralism: Pluralistic Alignment via Few-Shot Comparative Regression
      Python
      Apache License 2.0
      0000Updated Sep 11, 2025Sep 11, 2025
    • Python
      0100Updated Jan 24, 2025Jan 24, 2025
    • LLMs as alignable Decision-Makers
      Python
      Apache License 2.0
      1501Updated Jul 29, 2024Jul 29, 2024