Support reading lists of Parquet files with `read_parquet`

Pandas should support reading lists of parquet files which contain the same schema. Currently, the `path` argument of `read_parquet` in Pandas must direct to either a directory or a single file. However, users may want to read from a subset of files that are in a single directory or files from different directories. This is a common use case that other systems like Bodo, Spark, Dask, ... all support.

Assuming `~/path/to/pqs` contains `part1.pq` and `part2.pq` with the same schema, we wish to do

```python3
pd.read_parquet(['~/path/to/pqs/part1.pq', '~/path/to/pqs/part2.pq'])
```
rather than read the files individually.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support reading lists of Parquet files with `read_parquet` #7

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support reading lists of Parquet files with read_parquet #7

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Support reading lists of Parquet files with `read_parquet` #7