VectorizedArrowReader

java.lang.Object
- org.apache.iceberg.arrow.vectorized.VectorizedArrowReader

All Implemented Interfaces:

VectorizedReader<VectorHolder>

Direct Known Subclasses:

VectorizedArrowReader.ConstantVectorReader, VectorizedArrowReader.DeletedVectorReader
```
public class VectorizedArrowReader
extends java.lang.Object
implements VectorizedReader<VectorHolder>
```
VectorReader(s) that read in a batch of values into Arrow vectors. It also takes care of allocating the right kind of Arrow vectors depending on the corresponding Iceberg/Parquet data types.

Nested Class Summary

Nested Classes
Modifier and Type	Class and Description
`static class`	`VectorizedArrowReader.ConstantVectorReader<T>` A Dummy Vector Reader which doesn't actually read files, instead it returns a dummy VectorHolder which indicates the constant value which should be used for this column.
`static class`	`VectorizedArrowReader.DeletedVectorReader` A Dummy Vector Reader which doesn't actually read files.

Field Summary

Fields
Modifier and Type Field and Description

static int DEFAULT_BATCH_SIZE

Fields
Modifier and Type	Field and Description
`static int`	`DEFAULT_BATCH_SIZE`

Constructor Summary

Constructors
Constructor and Description
`VectorizedArrowReader(org.apache.parquet.column.ColumnDescriptor desc, Types.NestedField icebergField, org.apache.arrow.memory.BufferAllocator ra, boolean setArrowValidityVector)`

Method Summary

All Methods Static Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`void`	`close()` Release any resources allocated.
`static VectorizedArrowReader`	`nulls()`
`static VectorizedArrowReader`	`positions()`
`static VectorizedArrowReader`	`positionsWithSetArrowValidityVector()`
`VectorHolder`	`read(VectorHolder reuse, int numValsToRead)` Reads a batch of type @param <T> and of size numRows
`void`	`setBatchSize(int batchSize)`
`void`	`setRowGroupInfo(org.apache.parquet.column.page.PageReadStore source, java.util.Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata, long rowPosition)` Sets the row group information to be used with this reader
`java.lang.String`	`toString()`

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait

- Field Detail
  - DEFAULT_BATCH_SIZE
```
public static final int DEFAULT_BATCH_SIZE
```
    See Also:
    
    Constant Field Values
- Constructor Detail
  - VectorizedArrowReader
```
public VectorizedArrowReader(org.apache.parquet.column.ColumnDescriptor desc,
                             Types.NestedField icebergField,
                             org.apache.arrow.memory.BufferAllocator ra,
                             boolean setArrowValidityVector)
```
- Method Detail
  - setBatchSize
```
public void setBatchSize(int batchSize)
```
    Specified by:
    
    setBatchSize in interface VectorizedReader<VectorHolder>
  - read
```
public VectorHolder read(VectorHolder reuse,
                         int numValsToRead)
```
    Description copied from interface: VectorizedReader
    
    Reads a batch of type @param <T> and of size numRows
    
    Specified by:
    
    read in interface VectorizedReader<VectorHolder>
    
    Parameters:
    
    reuse - container for the last batch to be reused for next batch
    
    numValsToRead - number of rows to read
    
    Returns:
    
    batch of records of type @param <T>
  - setRowGroupInfo
```
public void setRowGroupInfo(org.apache.parquet.column.page.PageReadStore source,
                            java.util.Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata,
                            long rowPosition)
```
    Description copied from interface: VectorizedReader
    
    Sets the row group information to be used with this reader
    
    Specified by:
    
    setRowGroupInfo in interface VectorizedReader<VectorHolder>
    
    Parameters:
    
    source - row group information for all the columns
    
    metadata - map of ColumnPath -> ColumnChunkMetaData for the row group
    
    rowPosition - the row group's row offset in the parquet file
  - close
```
public void close()
```
    Description copied from interface: VectorizedReader
    
    Release any resources allocated.
    
    Specified by:
    
    close in interface VectorizedReader<VectorHolder>
  - toString
```
public java.lang.String toString()
```
    Overrides:
    
    toString in class java.lang.Object
  - nulls
```
public static VectorizedArrowReader nulls()
```
  - positions
```
public static VectorizedArrowReader positions()
```
  - positionsWithSetArrowValidityVector
```
public static VectorizedArrowReader positionsWithSetArrowValidityVector()
```

Class VectorizedArrowReader

Nested Class Summary

Field Summary

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Field Detail

DEFAULT_BATCH_SIZE

Constructor Detail

VectorizedArrowReader

Method Detail

setBatchSize

read

setRowGroupInfo

close

toString

nulls

positions

positionsWithSetArrowValidityVector