public interface VectorizedReader<T>
Modifier and Type | Method and Description |
---|---|
void |
close()
Release any resources allocated.
|
T |
read(T reuse,
int numRows)
Reads a batch of type @param <T> and of size numRows
|
void |
setBatchSize(int batchSize) |
void |
setRowGroupInfo(org.apache.parquet.column.page.PageReadStore pages,
java.util.Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata,
long rowPosition)
Sets the row group information to be used with this reader
|
T read(T reuse, int numRows)
reuse
- container for the last batch to be reused for next batchnumRows
- number of rows to readvoid setBatchSize(int batchSize)
void setRowGroupInfo(org.apache.parquet.column.page.PageReadStore pages, java.util.Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata, long rowPosition)
pages
- row group information for all the columnsmetadata
- map of ColumnPath
-> ColumnChunkMetaData
for the row grouprowPosition
- the row group's row offset in the parquet filevoid close()