public class SparkOrcReader extends java.lang.Object implements OrcRowReader<org.apache.spark.sql.catalyst.InternalRow>
It minimizes allocations by reusing most of the objects in the implementation.
Constructor and Description |
---|
SparkOrcReader(Schema expectedSchema,
org.apache.orc.TypeDescription readSchema) |
SparkOrcReader(Schema expectedSchema,
org.apache.orc.TypeDescription readOrcSchema,
java.util.Map<java.lang.Integer,?> idToConstant) |
Modifier and Type | Method and Description |
---|---|
org.apache.spark.sql.catalyst.InternalRow |
read(org.apache.orc.storage.ql.exec.vector.VectorizedRowBatch batch,
int row)
Reads a row.
|
void |
setBatchContext(long batchOffsetInFile) |
public SparkOrcReader(Schema expectedSchema, org.apache.orc.TypeDescription readSchema)
public SparkOrcReader(Schema expectedSchema, org.apache.orc.TypeDescription readOrcSchema, java.util.Map<java.lang.Integer,?> idToConstant)
public org.apache.spark.sql.catalyst.InternalRow read(org.apache.orc.storage.ql.exec.vector.VectorizedRowBatch batch, int row)
OrcRowReader
read
in interface OrcRowReader<org.apache.spark.sql.catalyst.InternalRow>
public void setBatchContext(long batchOffsetInFile)
setBatchContext
in interface OrcRowReader<org.apache.spark.sql.catalyst.InternalRow>