RayReadParquetSettings¶
- class awswrangler.typing.RayReadParquetSettings¶
Bases:
dictTyped dictionary defining the settings for distributing reading calls using Ray.
Attributes
True to enable a faster reading of a large number of Parquet files.
Attributes Documentation
-
parallelism:
NotRequired[int]¶
-
override_num_blocks:
NotRequired[int]¶
-
bulk_read:
NotRequired[bool]¶ True to enable a faster reading of a large number of Parquet files. Offers improved performance due to not gathering the file metadata in a single node. The drawback is that it does not offer schema resolution, so it should only be used when the Parquet files are all uniform.
-
parallelism: