Python Language Generation #808

ryanpeach · 2024-10-01T19:17:44Z

What kind of change does this PR introduce?

Feature

What is the current behavior?

Does not generate python,

What is the new behavior?

Practicing TDD, designing tests first and asking the community for feedback.

Additional Comments

I'd like everyones feedback on this format for the python types. I'm using pydantic and mimicing the go structure, since its closest to python dataclasses (basically structs).

ryanpeach · 2024-10-01T21:41:16Z

I'm not really sure what a lot of these things other than "Tables" are. Materialized Views? Why are there different types for different operations? CompositeTypes?

Anyway, if we agree on the TDD output, I'll make it work 🙂

ryanpeach · 2024-10-03T16:19:42Z

I'd also like to do this again, but for rust.

lwih · 2024-10-31T18:29:01Z

Man I'm looking forward to that :)

yangcheng · 2024-11-14T02:27:02Z

happy to be early tester!

ryanpeach · 2024-11-21T20:55:49Z

I've dropped my work on it atm if someone wants to take it the final mile, or wants to answer my questions on this pr, that would be great! Eventually i'll get back to working on supabase.

troyshu · 2024-12-14T12:40:25Z

This is really cool @ryanpeach! Not a supabase team member, just a supabase user who'd love to have python types generated by the CLI. Just wondering, what questions or todos remain before making this an open PR (instead of draft)?

ryanpeach · 2024-12-16T01:09:37Z

I need someone to check the tests. I know I’ve not covered all cases, but I don’t understand some of the test cases in the other code generators. Basically I need a code review.

mikelong10 · 2025-02-08T19:57:36Z

Hey, y'all 👋 just wanted to join the convo here

I'm working on a project with a Next.js frontend, Python FastAPI backend, and Supabase, and recently realized there was no python type gen support :(

I came across #795 when I was Googling around and then saw this PR here. Hoping to spark the convo and get the ball rolling again here! I did a quick look through, and the changes seem on the right track to me.

I'm def a little out of my domain here, but I saw @sweatybridge you commented on the original #795 saying you'd be more than happy to review a PR for this issue. Seems like @ryanpeach would love to get your or someone else from Supabase's eyes on this!

grdsdev · 2025-02-11T13:21:17Z

Hi @ryanpeach, thanks a bunch for all the hard work you put into this PR!

I’d love to chat about this and figure out how we want to approach Python type generation. The way you implemented it, as raw Python models, sounds great, but we might want to take it a step further and try something more aligned with what we have in JS.

In postgrest-js, the generated types are integrated into the PostgrestClient as you can see on https://github.com/supabase/postgrest-js/blob/master/src/PostgrestClient.ts#L18-L24.

This makes types available during runtime usage of the library.

I’m not a Python expert, so I’m not sure how far Python types can go.

ryanpeach · 2025-02-11T15:19:33Z

@grdsdev I did see that, but I think we should go for language availability before we go for complexity. That's why I emulated the go structs. Maybe all that can go in a V2.

In fact, I think most languages can extend your go examples. Rust for instance. As it has no inheritance and no complex typing as far as I remember. I'd like to do a rust PR next, as I'm a python/rust primary dev.

I think the JS way would be very unpythonic, but I understand the reasons for it and we should try to emulate those reasons in a pythonic way in a V2.

ryanpeach · 2025-02-11T15:26:40Z

Either way, I think the best solution is this:

Create a canonical "schema" for the database in TDD. Release a readme describing all the system features it describes.
Create an agreed upon "test" for the language generation in a PR. Evaluate it and vote on it.
Write the code to generate the test (the easiest part). Release.

Right now I don't understand all the system features described in the database schema, so I have a hard time implementing a code generator for the ones I don't understand.

soedirgo · 2025-02-14T04:55:11Z

we should go for language availability before we go for complexity. That's why I emulated the go structs

I'll have to disagree on this - the typegen needs to be well integrated into and narrowly scoped into the use case of the SDK. I don't see how generating structs/interfaces on its own would improve the Supabase DX.

I'd actually use the Go types as a reason against adding this, because to this day supabase-go still doesn't use the Go types (cmiiw), and it's not clear how you're meant to use the two together.

soedirgo · 2025-02-14T05:00:03Z

To elaborate further: some of the reasons we added typings to supabase-js are because it affords us:

autocompletion (table name in .from(), column name in filters)
static type checking for missing columns in .select()
typed result based on the query in .select()

I'm not convinced of the benefits of adding typings if it doesn't afford us these things.

ryanpeach · 2025-02-14T16:35:24Z

@soedirgo you aren't convinced that having structs which match your tables, and having those structs autogenerated, is useful?

I'm sorry but I can't even begin to see the logic in that perspective... It's a basic serialization/deserialization paradigm. It's 1000x better than having an untyped object without data validation, static type checking, or IDE autocompletion.

And I think the interest in this issue is clear that people care about it. How many of the tools developed here ad-hoc by developers to fix this issue just for python go beyond struct generation?

The more advanced options of getting certain types from certain parameterizations of from or select, I'm not sure that's even possible in python typing. If so it's very complicated, using typing.overload and the use of typing.Literal. But it's definitely a v2 IMO.

soedirgo · 2025-02-17T06:35:59Z

I see where you're coming from; if Python's typings doesn't support these features then it is what it is. I'll leave it to the supabase-py maintainers to decide if we want to add this in, because if we do want to add these features in v2 they'd be responsible for integrating the types with supabase-py. cc @juancarlospaco @silentworks @grdsdev

soedirgo · 2025-02-17T06:42:42Z

What would also help is if you could show an example of how you'd use these types in your code, because certain approaches end up making the code much more verbose and it gets unclear whether it's a DX improvement or not (example here)

ryanpeach · 2025-02-17T22:29:21Z

I'd just use pydantics deserialization and from dict capabilities after running a query, just to associate a type with the return and get data validation.

Any future complexity addition would need this capability first anyway.

silentworks · 2025-02-18T10:07:27Z

Great work here @ryanpeach. Can you provide a code example of how this would be used in a Python project?

ryanpeach · 2025-02-18T15:15:04Z

import PublicUsersSelect, PublicUsersInsert, PublicUsersUpdate from generated_table_classes
import supabase

# Select
selected = [PublicUsersSelect(x) from x in supabase.table("users").select("*").execute().data]

# Insert
to_insert = PublicUsersInsert(name="foo", status="bar")
inserted = [PublicUsersInsert(x) for x in supabase.table("users").insert(to_insert.as_dict()).execute().data]

# Update
to_update = selected.as_dict()
id = to_update.pop("id")
updated = [PublicUsersUpdate(x) for x in supabase.table("users").update(selected.as_dict()).eq("id", id).execute().data]

Things I don't understand are like:

Why is that table called public? Remember I copied from Go.
Could we combine "PublicUsers*" responses into one?

ryanpeach · 2025-02-18T15:26:10Z

From this exercise I think this DX is ideal:

import User, Users, UsersUpdate, UserInsert from generated_table_classes
import supabase

# Select
selected = Users(supabase.table("users").select("*").execute().data)

# Insert
to_insert = UserInsert(name="foo", status="bar")
inserted = Users(supabase.table("users").insert(to_insert.as_dict()).execute().data)

# Update
to_update = UsersUpdate(name="bar")
updated = Users(supabase.table("users").update(to_update.as_dict()).eq("id", selected.id).execute().data)

The difference between User, UsersUpdate, and UsersInsert would be:

# Nothing is optional
class User(BaseModel):
  id: Annotated[int, Field(alias="id")]
  name: Annotated[str, Field(alias="name")]
  status: Annotated[UserStatus, Field(alias="status")]

# Easily handle data lists
Users = lambda users: [User(x) from x in users]  # For now

# The id field is missing but all the others are non optional
class UserInsert(BaseModel):
  name: Annotated[str, Field(alias="name")]
  status: Annotated[UserStatus, Field(alias="status")]

# Every field is optional, no id field
class UsersUpdate(BaseModel):
  name: Annotated[str | None, Field(alias="name")]
  status: Annotated[UserStatus | None, Field(alias="status")]

ryanpeach · 2025-02-18T15:40:01Z

From the above comment, adding these functions to each object would make the DX even better:

import User, Users, UsersUpdate, UserInsert from generated_table_classes
# import supabase not needed

# Select
selected  = Users(Users.select("*").execute().data)

# Insert
to_insert = UserInsert(name="foo", status="bar")
inserted = Users(to_insert.insert().execute().data)

# Update
to_update = UsersUpdate(name="bar")
updated = Users(to_update.update().eq("id", selected.id).execute().data)

Very roughly pseudocoded:

import supabase
from pydantic import BaseModel

# Nothing is optional
class User(BaseModel):
  id: Annotated[int, Field(alias="id")]
  name: Annotated[str, Field(alias="name")]
  status: Annotated[UserStatus, Field(alias="status")]

# Easily handle data lists
class Users(list):
  def __init__(self, users: list[dict]):
    super().__init__(User(x) for x in users)

  @staticmethod
  def select(self, *args, **kwargs):
    return supabase.table("users").select(*args, **kwargs)

# The id field is missing but all the others are non optional
class UserInsert(BaseModel):
  name: Annotated[str, Field(alias="name")]
  status: Annotated[UserStatus, Field(alias="status")]
  
  def insert(self, *args, **kwargs):
    return supabase.table("users").insert(self.as_dict(), *args, **kwargs)

# Every field is optional, no id field
class UsersUpdate(BaseModel):
  name: Annotated[str | None, Field(alias="name")]
  status: Annotated[UserStatus | None, Field(alias="status")]
  
  def update(self, *args, **kwargs):
    return supabase.table("users").update(self.as_dict(), *args, **kwargs)

Much further and I think it would be rather complicated, basically rewriting the features of the python library.

silentworks · 2025-02-22T11:14:58Z

I think this would be a great addition to the CLI for sure.

mikelong10 · 2025-02-27T17:30:12Z

Yeah I think this looks great! In terms of what's useful/valuable and would improve DX, I'd have to agree with @ryanpeach.

Essentially, even just simple auto-generated pydantic model classes that help you automatically stay up to date with your supabase schema and give you static type-checked, validated data to work with is enough of a DX improvement to make this effort worth it in my opinion 🙏

From your last example, maybe we could do this to make it even cleaner and clearer? Let me know your guys' thoughts and if you guys think something like this would be doable! cc: @silentworks

Usage:

from generated_table_classes import Users, UsersInsert, UsersUpdate

insert_user_1 = Users.insert(UsersInsert(name="Ryan", email="ryan@gmail.com"))[0]
insert_user_2 = Users.insert(UsersInsert(name="Mike"))[0]

select_user_1 = Users.select("*", filters={"id": insert_user_1.id})[0]
select_all_users = Users.select("*")

update_user_1 = Users.update(
    UsersUpdate(email="ryan@gmail.com"), filters={"id": insert_user_1.id}
)[0]
update_user_2 = Users.update(
    UsersUpdate(name="Michael"), filters={"id": insert_user_2.id}
)[0]

Implementation:

# generated_table_classes.py

# User schema, required name, optional email
class User(BaseModel):
    id: Annotated[int, Field(alias="id")]
    name: Annotated[str, Field(alias="name")]
    email: Annotated[str | None, Field(alias="email")] = None


# UsersInsert schema, required name, optional email
class UsersInsert(BaseModel):
    name: Annotated[str, Field(alias="name")]
    email: Annotated[str | None, Field(alias="email")] = None


# UsersUpdate schema, optional name, optional email
class UsersUpdate(BaseModel):
    name: Annotated[str | None, Field(alias="name")] = None
    email: Annotated[str | None, Field(alias="email")] = None


# Single Users class used to interact with the users table
class Users:
    def __init__(self, users: list[dict]):
        self.users = [User(**x) for x in users]

    def __iter__(self):
        return iter(self.users)

    @staticmethod
    def select(
        *columns: str,
        filters: Dict[str, Any] | None = None,
        limit: int | None = None,
        order_by: str | None = None,
        descending: bool = True,
    ) -> List[User]:
        query = supabase.table("users").select(*columns)

        if filters:
            for column, value in filters.items():
                if column == "or":
                    query = query.or_(value)
                else:
                    query = query.eq(column, value)

        if order_by:
            query = query.order(order_by, desc=descending)

        if limit:
            query = query.limit(limit)

        response = query.execute().data
        return [User(**x) for x in response]

    @staticmethod
    def insert(user: UsersInsert) -> List[User]:
        response = supabase.table("users").insert(user.model_dump()).execute().data
        return [User(**x) for x in response]

    @staticmethod
    def update(
        user: UsersUpdate,
        filters: Dict[str, Any] | None = None,
    ) -> List[User]:
        query = supabase.table("users").update(user.model_dump())
        if filters:
            for column, value in filters.items():
                if column == "or":
                    query = query.or_(value)
                else:
                    query = query.eq(column, value)

        response = query.execute().data
        return [User(**x) for x in response]

mikelong10 · 2025-02-27T17:41:01Z

^ from testing locally, this seems to get the static type checking.

However, I will say that I have the "Mypy Type Checker" VSCode extension installed, and I don't believe by default Pydantic is able to catch the type errors of missing required fields in the UsersInsert class. For example, the Mypy extension picks this up, but if I disable the extension I don't get this red squiggly anymore:

Everything else seems good and working, though with intellisense picking up on it.

ryanpeach · 2025-03-04T08:58:10Z

Yeah I think this looks great! In terms of what's useful/valuable and would improve DX, I'd have to agree with @ryanpeach.

Essentially, even just simple auto-generated pydantic model classes that help you automatically stay up to date with your supabase schema and give you static type-checked, validated data to work with is enough of a DX improvement to make this effort worth it in my opinion 🙏

From your last example, maybe we could do this to make it even cleaner and clearer? Let me know your guys' thoughts and if you guys think something like this would be doable! cc: @silentworks

Usage:

from generated_table_classes import Users, UsersInsert, UsersUpdate

insert_user_1 = Users.insert(UsersInsert(name="Ryan", email="ryan@gmail.com"))[0]
insert_user_2 = Users.insert(UsersInsert(name="Mike"))[0]

select_user_1 = Users.select("*", filters={"id": insert_user_1.id})[0]
select_all_users = Users.select("*")

update_user_1 = Users.update(
    UsersUpdate(email="ryan@gmail.com"), filters={"id": insert_user_1.id}
)[0]
update_user_2 = Users.update(
    UsersUpdate(name="Michael"), filters={"id": insert_user_2.id}
)[0]

Implementation:

# generated_table_classes.py

# User schema, required name, optional email
class User(BaseModel):
    id: Annotated[int, Field(alias="id")]
    name: Annotated[str, Field(alias="name")]
    email: Annotated[str | None, Field(alias="email")] = None


# UsersInsert schema, required name, optional email
class UsersInsert(BaseModel):
    name: Annotated[str, Field(alias="name")]
    email: Annotated[str | None, Field(alias="email")] = None


# UsersUpdate schema, optional name, optional email
class UsersUpdate(BaseModel):
    name: Annotated[str | None, Field(alias="name")] = None
    email: Annotated[str | None, Field(alias="email")] = None


# Single Users class used to interact with the users table
class Users:
    def __init__(self, users: list[dict]):
        self.users = [User(**x) for x in users]

    def __iter__(self):
        return iter(self.users)

    @staticmethod
    def select(
        *columns: str,
        filters: Dict[str, Any] | None = None,
        limit: int | None = None,
        order_by: str | None = None,
        descending: bool = True,
    ) -> List[User]:
        query = supabase.table("users").select(*columns)

        if filters:
            for column, value in filters.items():
                if column == "or":
                    query = query.or_(value)
                else:
                    query = query.eq(column, value)

        if order_by:
            query = query.order(order_by, desc=descending)

        if limit:
            query = query.limit(limit)

        response = query.execute().data
        return [User(**x) for x in response]

    @staticmethod
    def insert(user: UsersInsert) -> List[User]:
        response = supabase.table("users").insert(user.model_dump()).execute().data
        return [User(**x) for x in response]

    @staticmethod
    def update(
        user: UsersUpdate,
        filters: Dict[str, Any] | None = None,
    ) -> List[User]:
        query = supabase.table("users").update(user.model_dump())
        if filters:
            for column, value in filters.items():
                if column == "or":
                    query = query.or_(value)
                else:
                    query = query.eq(column, value)

        response = query.execute().data
        return [User(**x) for x in response]

I think you need to keep it returning the Insert object and the Select object, otherwise you can't follow up with filters and modifiers which would be complicated to reimplement.

Added the test first for python

bd931c8

ryanpeach mentioned this pull request Oct 1, 2024

Generating types in python format #795

Open

ryanpeach added 2 commits October 1, 2024 15:40

First stab at modifying go.ts into python.ts

4e7a2fd

npm run format

77672e6

ryanpeach changed the title ~~Python Language Generation Tests~~ Python Language Generation Oct 1, 2024

ryanpeach added 2 commits October 3, 2024 14:18

Getting really close with our output

a900dfd

Now using the more modern Annotated format

661e881

silentworks mentioned this pull request Mar 11, 2025

Add return types supabase/postgrest-py#301

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python Language Generation #808

Python Language Generation #808

ryanpeach commented Oct 1, 2024

ryanpeach commented Oct 1, 2024 •

edited

Loading

ryanpeach commented Oct 3, 2024

lwih commented Oct 31, 2024

yangcheng commented Nov 14, 2024

ryanpeach commented Nov 21, 2024

troyshu commented Dec 14, 2024

ryanpeach commented Dec 16, 2024 •

edited

Loading

mikelong10 commented Feb 8, 2025

grdsdev commented Feb 11, 2025

ryanpeach commented Feb 11, 2025 •

edited

Loading

ryanpeach commented Feb 11, 2025 •

edited

Loading

soedirgo commented Feb 14, 2025

soedirgo commented Feb 14, 2025

ryanpeach commented Feb 14, 2025 •

edited

Loading

soedirgo commented Feb 17, 2025

soedirgo commented Feb 17, 2025

ryanpeach commented Feb 17, 2025 •

edited

Loading

silentworks commented Feb 18, 2025

ryanpeach commented Feb 18, 2025 •

edited

Loading

ryanpeach commented Feb 18, 2025 •

edited

Loading

ryanpeach commented Feb 18, 2025 •

edited

Loading

silentworks commented Feb 22, 2025

mikelong10 commented Feb 27, 2025 •

edited

Loading

mikelong10 commented Feb 27, 2025 •

edited

Loading

ryanpeach commented Mar 4, 2025 •

edited

Loading

Usage:

Implementation:

Python Language Generation #808

Are you sure you want to change the base?

Python Language Generation #808

Conversation

ryanpeach commented Oct 1, 2024

What kind of change does this PR introduce?

What is the current behavior?

What is the new behavior?

Additional Comments

ryanpeach commented Oct 1, 2024 • edited Loading

ryanpeach commented Oct 3, 2024

lwih commented Oct 31, 2024

yangcheng commented Nov 14, 2024

ryanpeach commented Nov 21, 2024

troyshu commented Dec 14, 2024

ryanpeach commented Dec 16, 2024 • edited Loading

mikelong10 commented Feb 8, 2025

grdsdev commented Feb 11, 2025

ryanpeach commented Feb 11, 2025 • edited Loading

ryanpeach commented Feb 11, 2025 • edited Loading

soedirgo commented Feb 14, 2025

soedirgo commented Feb 14, 2025

ryanpeach commented Feb 14, 2025 • edited Loading

soedirgo commented Feb 17, 2025

soedirgo commented Feb 17, 2025

ryanpeach commented Feb 17, 2025 • edited Loading

silentworks commented Feb 18, 2025

ryanpeach commented Feb 18, 2025 • edited Loading

ryanpeach commented Feb 18, 2025 • edited Loading

ryanpeach commented Feb 18, 2025 • edited Loading

silentworks commented Feb 22, 2025

mikelong10 commented Feb 27, 2025 • edited Loading

Usage:

Implementation:

mikelong10 commented Feb 27, 2025 • edited Loading

ryanpeach commented Mar 4, 2025 • edited Loading

Usage:

Implementation:

ryanpeach commented Oct 1, 2024 •

edited

Loading

ryanpeach commented Dec 16, 2024 •

edited

Loading

ryanpeach commented Feb 11, 2025 •

edited

Loading

ryanpeach commented Feb 11, 2025 •

edited

Loading

ryanpeach commented Feb 14, 2025 •

edited

Loading

ryanpeach commented Feb 17, 2025 •

edited

Loading

ryanpeach commented Feb 18, 2025 •

edited

Loading

ryanpeach commented Feb 18, 2025 •

edited

Loading

ryanpeach commented Feb 18, 2025 •

edited

Loading

mikelong10 commented Feb 27, 2025 •

edited

Loading

mikelong10 commented Feb 27, 2025 •

edited

Loading

ryanpeach commented Mar 4, 2025 •

edited

Loading