|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.hadoop.hive.ql.io.RCFile
public class RCFile
RCFiles, short of Record Columnar File, are flat files
consisting of binary key/value pairs, which shares much similarity with
SequenceFile.
RCFile stores columns of a table in a record columnar way. It first
partitions rows horizontally into row splits. and then it vertically
partitions each row split in a columnar way. RCFile first stores the meta
data of a row split, as the key part of a record, and all the data of a row
split as the value part. When writing, RCFile.Writer first holds records'
value bytes in memory, and determines a row split if the raw bytes size of
buffered records overflow a given parameterWriter.columnsBufferSize,
which can be set like: conf.setInt(COLUMNS_BUFFER_SIZE_CONF_STR,
4 * 1024 * 1024) .
RCFile provides RCFile.Writer, RCFile.Reader and classes for
writing, reading respectively.
RCFile stores columns of a table in a record columnar way. It first partitions rows horizontally into row splits. and then it vertically partitions each row split in a columnar way. RCFile first stores the meta data of a row split, as the key part of a record, and all the data of a row split as the value part.
RCFile compresses values in a more fine-grained manner then record level
compression. However, It currently does not support compress the key part
yet. The actual compression algorithm used to compress key and/or values can
be specified by using the appropriate CompressionCodec.
The RCFile.Reader is used to read and explain the bytes of RCFile.
CompressionCodec class which is used
for compression of keys and/or values (if compression is enabled).SequenceFile.Metadata for this file.
| Nested Class Summary | |
|---|---|
static class |
RCFile.KeyBuffer
KeyBuffer is the key of each record in RCFile. |
static class |
RCFile.Reader
Read KeyBuffer/ValueBuffer pairs from a RCFile. |
static class |
RCFile.ValueBuffer
ValueBuffer is the value of each record in RCFile. |
static class |
RCFile.Writer
Write KeyBuffer/ValueBuffer pairs to a RCFile. |
| Field Summary | |
|---|---|
static String |
BLOCK_MISSING_MESSAGE
|
static String |
COLUMN_NUMBER_CONF_STR
|
static String |
COLUMN_NUMBER_METADATA_STR
|
static String |
RECORD_INTERVAL_CONF_STR
|
static int |
SYNC_INTERVAL
The number of bytes between sync points. |
static String |
TOLERATE_CORRUPTIONS_CONF_STR
|
| Constructor Summary | |
|---|---|
RCFile()
|
|
| Method Summary | |
|---|---|
static org.apache.hadoop.io.SequenceFile.Metadata |
createMetadata(org.apache.hadoop.io.Text... values)
Create a metadata object with alternating key-value pairs. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final String RECORD_INTERVAL_CONF_STR
public static final String COLUMN_NUMBER_METADATA_STR
public static final String COLUMN_NUMBER_CONF_STR
public static final String TOLERATE_CORRUPTIONS_CONF_STR
public static final String BLOCK_MISSING_MESSAGE
public static final int SYNC_INTERVAL
| Constructor Detail |
|---|
public RCFile()
| Method Detail |
|---|
public static org.apache.hadoop.io.SequenceFile.Metadata createMetadata(org.apache.hadoop.io.Text... values)
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||