You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When use delta kernel to read delta lake table encountered such an error
list not found in optional group readerFeatures (LIST) {
repeated group bag {
optional binary array (STRING);
}
}
and the schema of parquet file is
And I think delta kernel currently does not supports reading this legacy array type.
Steps to reproduce
Observed results
Expected results
Further details
Environment information
Delta Lake version:
Spark version:
Scala version:
Willingness to contribute
The Delta Lake Community encourages bug fix contributions. Would you or another member of your organization be willing to contribute a fix for this bug to the Delta Lake code base?
Yes. I can contribute a fix for this bug independently.
Yes. I would be willing to contribute a fix for this bug with guidance from the Delta Lake community.
No. I cannot contribute a bug fix at this time.
The text was updated successfully, but these errors were encountered:
…repeated types (#3083)
## Description
When legacy mode is enabled in Spark, array physical types are stored
slightly different from the standard format.
Standard mode (default):
```
optional group readerFeatures (LIST) {
repeated group list {
optional binary element (STRING);
}
}
```
When write legacy mode is enabled (`spark.sql.parquet.writeLegacyFormat
= true`):
```
optional group readerFeatures (LIST) {
repeated group bag {
optional binary array (STRING);
}
}
```
TODO: We need to handle the 2-level lists. Will post a separate PR. The
challenge is with generating or finding the Parquet files with 2-level
lists.
## How was this patch tested?
Added tests
Fixes#3082
…repeated types (#3083)
## Description
When legacy mode is enabled in Spark, array physical types are stored
slightly different from the standard format.
Standard mode (default):
```
optional group readerFeatures (LIST) {
repeated group list {
optional binary element (STRING);
}
}
```
When write legacy mode is enabled (`spark.sql.parquet.writeLegacyFormat
= true`):
```
optional group readerFeatures (LIST) {
repeated group bag {
optional binary array (STRING);
}
}
```
TODO: We need to handle the 2-level lists. Will post a separate PR. The
challenge is with generating or finding the Parquet files with 2-level
lists.
## How was this patch tested?
Added tests
Fixes#3082
Bug
Which Delta project/connector is this regarding?
Describe the problem
When use delta kernel to read delta lake table encountered such an error
and the schema of parquet file is

And I think delta kernel currently does not supports reading this legacy array type.
Steps to reproduce
Observed results
Expected results
Further details
Environment information
Willingness to contribute
The Delta Lake Community encourages bug fix contributions. Would you or another member of your organization be willing to contribute a fix for this bug to the Delta Lake code base?
The text was updated successfully, but these errors were encountered: