Class AmazonS3Source
java.lang.Object
org.apache.wayang.core.plan.wayangplan.OperatorBase
org.apache.wayang.core.plan.wayangplan.UnarySource<String>
org.apache.wayang.basic.operators.AmazonS3Source
- All Implemented Interfaces:
Serializable
,ActualOperator
,ElementaryOperator
,Operator
- Direct Known Subclasses:
JavaAmazonS3Source
This source reads a blob file stored in Amazon s3 and outputs the lines as data units.
- See Also:
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionprotected class
CustomCardinalityEstimator
forFlatMapOperator
s.Nested classes/interfaces inherited from class org.apache.wayang.core.plan.wayangplan.OperatorBase
OperatorBase.GsonSerializer
-
Field Summary
Fields inherited from class org.apache.wayang.core.plan.wayangplan.OperatorBase
inputSlots, outputSlots, STANDARD_OPERATOR_ARGS
Fields inherited from interface org.apache.wayang.core.plan.wayangplan.Operator
FIRST_EPOCH
-
Constructor Summary
ConstructorsConstructorDescriptionAmazonS3Source
(String bucket, String blobName, String filePathToCredentialsFile) AmazonS3Source
(String bucket, String blobName, String filePathToCredentialsFile, String encoding) AmazonS3Source
(AmazonS3Source that) Copies an instance (exclusive of broadcasts). -
Method Summary
Modifier and TypeMethodDescriptionRetrieves an InputStream to the specified S3 blob file.Methods inherited from class org.apache.wayang.core.plan.wayangplan.UnarySource
getOutput, getType
Methods inherited from class org.apache.wayang.core.plan.wayangplan.OperatorBase
accept, addBroadcastInput, addTargetPlatform, at, collectMappedInputSlots, collectMappedOutputSlots, copy, createCopy, getAllInputs, getAllOutputs, getCardinalityEstimator, getContainer, getEpoch, getName, getOriginal, getSimpleClassName, getTargetPlatforms, isAuxiliary, isSupportingBroadcastInputs, propagateInputCardinality, propagateOutputCardinality, setAuxiliary, setCardinalityEstimator, setContainer, setEpoch, setName, toString
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
Methods inherited from interface org.apache.wayang.core.plan.wayangplan.ActualOperator
accept
Methods inherited from interface org.apache.wayang.core.plan.wayangplan.ElementaryOperator
createCardinalityEstimator, getCardinalityEstimator, isAuxiliary, setAuxiliary, setCardinalityEstimator
Methods inherited from interface org.apache.wayang.core.plan.wayangplan.Operator
addBroadcastInput, addTargetPlatform, broadcastTo, broadcastTo, collectMappedInputSlots, collectMappedOutputSlots, connectTo, connectTo, getAllInputs, getAllOutputs, getCardinalityPusher, getContainer, getEffectiveOccupant, getEffectiveOccupant, getEpoch, getEstimationContextProperties, getForwards, getInnermostLoop, getInput, getInput, getLoopStack, getName, getNumBroadcastInputs, getNumInputs, getNumOutputs, getNumRegularInputs, getOuterInputSlot, getOutermostInputSlot, getOutermostOutputSlots, getOutput, getOutput, getParent, getTargetPlatforms, isAlternative, isConversion, isElementary, isExecutionOperator, isFeedbackInput, isFeedforwardOutput, isLoopHead, isLoopSubplan, isOwnerOf, isReading, isSink, isSource, isSubplan, isSupportingBroadcastInputs, isUnconnected, propagateInputCardinality, propagateOutputCardinality, propagateOutputCardinality, replaceWith, setContainer, setEpoch, setInput, setName, setOutput
-
Constructor Details
-
AmazonS3Source
-
AmazonS3Source
-
AmazonS3Source
Copies an instance (exclusive of broadcasts).- Parameters:
that
- that should be copied
-
-
Method Details
-
getEncoding
-
getFilePathToCredentialsFile
-
getBucket
-
getBlobName
-
getBlobByteSize
- Returns:
- the total size of the bytes in the Blob file.
-
getInputStream
Retrieves an InputStream to the specified S3 blob file.- Returns:
- InputStream to the Blob file.
- Throws:
Exception
- if an error occurs during S3 client creation or file rertrieval.
-