class PortableDataStream extends Serializable
A class that allows DataStreams to be serialized and moved around by not creating them until they need to be read
- Source
- PortableDataStream.scala
- Note
TaskAttemptContext is not serializable resulting in the confBytes construct
,CombineFileSplit is not serializable resulting in the splitBytes construct
- Alphabetic
- By Inheritance
- PortableDataStream
- Serializable
- Serializable
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Instance Constructors
- new PortableDataStream(isplit: CombineFileSplit, context: TaskAttemptContext, index: Integer)
Value Members
-
def
getConfiguration: Configuration
- Annotations
- @Since( "2.2.0" )
-
def
getPath(): String
- Annotations
- @Since( "1.2.0" )
-
def
open(): DataInputStream
Create a new DataInputStream from the split and context.
Create a new DataInputStream from the split and context. The user of this method is responsible for closing the stream after usage.
- Annotations
- @Since( "1.2.0" )
-
def
toArray(): Array[Byte]
Read the file as a byte array
Read the file as a byte array
- Annotations
- @Since( "1.2.0" )