ENH: first-class UUID support

### Feature Type

- [x] Adding new functionality to pandas
- [ ] Changing existing functionality in pandas
- [ ] Removing existing functionality in pandas


### Problem Description

A lot of people want to use UUID arrays, so we should provide first-class support.

If you want to play around with it, I prototyped this functionality here: https://pypi.org/project/pandas-uuid/

### Feature Description

I suggest to do something similar as for `{Arrow,}StringArray`, i.e. have a dyad of `ExtensionArray` types, one backed by numpy and one by pyarrow.

They don’t need a lot of features except for comparison (`self == other` / `self == elem`) and membership tests (`self.__contains__(elem)`/`elem in self` and `self.isin(other)`).

The numpy variant needs to be backed by a `np.void(16)`/`"V16"` array, since [`np.bytes_`/`"S"`](https://numpy.org/doc/stable/reference/arrays.scalars.html#numpy.bytes_) has special treatment of null bytes, whereas numeric types (like UUIDs) assign no special meaning to them.

### Alternative Solutions

- We could use `MaskedArray` as base for the numpy-backed variant to allow missing values in both cases.
- We could just force people to rely on pyarrow for this functionality, but I feel that wrapping `np.void(16)` is simple enough. People didn’t like adding the pyarrow dependency for basic functionality, so I assume you want to keep adding basic features that don’t rely on pyarrow.
- We could keep the `pandas-uuid` package around, see [here](https://icb-pandas-uuid.readthedocs-hosted.com/en/latest/#pandas-integration) for its limitations.

### Additional Context

- Turns out the following issue isn’t actually a blocker if we have the `ExtensionDtype` report its `kind` to be `"O"`: #54810 
- We should test that things like this work by default: #59132


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: first-class UUID support #63511

Feature Type

Problem Description

Feature Description

Alternative Solutions

Additional Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

ENH: first-class UUID support #63511

Description

Feature Type

Problem Description

Feature Description

Alternative Solutions

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions