VectorizedOrcAcidRowBatchReader (Hive 3.1.1 API)

java.lang.Object
- org.apache.hadoop.hive.ql.io.orc.VectorizedOrcAcidRowBatchReader

All Implemented Interfaces:

Closeable, AutoCloseable, org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch>
```
public class VectorizedOrcAcidRowBatchReader
extends Object
implements org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch>
```
A fast vectorized batch reader class for ACID. Insert events are read directly from the base files/insert_only deltas in vectorized row batches. The deleted rows can then be easily indicated via the 'selected' field of the vectorized row batch. Refer HIVE-14233 for more details.

Nested Class Summary

Nested Classes
Modifier and Type	Class and Description
`protected static interface`	`VectorizedOrcAcidRowBatchReader.DeleteEventRegistry` An interface that can determine which rows have been deleted from a given vectorized row batch.

Field Summary

Fields
Modifier and Type Field and Description

protected Object[] partitionValues

protected float progress

Fields
Modifier and Type	Field and Description
`protected Object[]`	`partitionValues`
`protected float`	`progress`

Constructor Summary

Constructors
Constructor and Description
`VectorizedOrcAcidRowBatchReader(OrcSplit inputSplit, org.apache.hadoop.mapred.JobConf conf, org.apache.hadoop.mapred.Reporter reporter, org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch> baseReader, VectorizedRowBatchCtx rbCtx, boolean isFlatPayload)` LLAP IO c'tor

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`void`	`close()`
`org.apache.hadoop.io.NullWritable`	`createKey()`
`org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch`	`createValue()`
`long`	`getPos()`
`float`	`getProgress()`
`boolean`	`next(org.apache.hadoop.io.NullWritable key, org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch value)` There are 2 types of schema from the `baseReader` that this handles.
`void`	`setBaseAndInnerReader(org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch> baseReader)`

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Field Detail
  - progress
```
protected float progress
```
  - partitionValues
```
protected Object[] partitionValues
```
- Constructor Detail
  - VectorizedOrcAcidRowBatchReader
```
public VectorizedOrcAcidRowBatchReader(OrcSplit inputSplit,
                                       org.apache.hadoop.mapred.JobConf conf,
                                       org.apache.hadoop.mapred.Reporter reporter,
                                       org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch> baseReader,
                                       VectorizedRowBatchCtx rbCtx,
                                       boolean isFlatPayload)
                                throws IOException
```
    LLAP IO c'tor
    
    Throws:
    
    IOException
- Method Detail
  - setBaseAndInnerReader
```
public void setBaseAndInnerReader(org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch> baseReader)
```
  - next
```
public boolean next(org.apache.hadoop.io.NullWritable key,
                    org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch value)
             throws IOException
```
    There are 2 types of schema from the baseReader that this handles. In the case the data was written to a transactional table from the start, every row is decorated with transaction related info and looks like >. The other case is when data was written to non-transactional table and thus only has the user data: . Then this table was then converted to a transactional table but the data files are not changed until major compaction. These are the "original" files. In this case we may need to decorate the outgoing data with transactional column values at read time. (It's done somewhat out of band via VectorizedRowBatchCtx - ask Teddy Choi). The "owid, writerId, rowid" columns represent RecordIdentifier. They are assigned each time the table is read in a way that needs to project VirtualColumn.ROWID. Major compaction will attach these values to each row permanently. It's critical that these generated column values are assigned exactly the same way by each read of the same row and by the Compactor. See CompactorMR and OrcRawRecordMerger.OriginalReaderPairToCompact for the Compactor read path. (Longer term should make compactor use this class) This only decorates original rows with metadata if something above is requesting these values or if there are Delete events to apply.
    
    Specified by:
    
    next in interface org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch>
    
    Returns:
    
    false where there is no more data, i.e. value is empty
    
    Throws:
    
    IOException
  - createKey
```
public org.apache.hadoop.io.NullWritable createKey()
```
    Specified by:
    
    createKey in interface org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch>
  - createValue
```
public org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch createValue()
```
    Specified by:
    
    createValue in interface org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch>
  - getPos
```
public long getPos()
            throws IOException
```
    Specified by:
    
    getPos in interface org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch>
    
    Throws:
    
    IOException
  - close
```
public void close()
           throws IOException
```
    Specified by:
    
    close in interface Closeable
    
    Specified by:
    
    close in interface AutoCloseable
    
    Specified by:
    
    close in interface org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch>
    
    Throws:
    
    IOException
  - getProgress
```
public float getProgress()
                  throws IOException
```
    Specified by:
    
    getProgress in interface org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.NullWritable,org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch>
    
    Throws:
    
    IOException

Class VectorizedOrcAcidRowBatchReader

Nested Class Summary

Field Summary

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Field Detail

progress

partitionValues

Constructor Detail

VectorizedOrcAcidRowBatchReader

Method Detail

setBaseAndInnerReader

next

createKey

createValue

getPos

close

getProgress