Class SparkShufflePartitionSampleOperator<Type>
java.lang.Object
org.apache.wayang.core.plan.wayangplan.OperatorBase
org.apache.wayang.core.plan.wayangplan.UnaryToUnaryOperator<Type,Type>
org.apache.wayang.basic.operators.SampleOperator<Type>
org.apache.wayang.spark.operators.SparkShufflePartitionSampleOperator<Type>
- All Implemented Interfaces:
Serializable
,ActualOperator
,ElementaryOperator
,ExecutionOperator
,Operator
,SparkExecutionOperator
public class SparkShufflePartitionSampleOperator<Type>
extends SampleOperator<Type>
implements SparkExecutionOperator
Spark implementation of the
SparkShufflePartitionSampleOperator
.- See Also:
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.wayang.basic.operators.SampleOperator
SampleOperator.Methods
Nested classes/interfaces inherited from class org.apache.wayang.core.plan.wayangplan.OperatorBase
OperatorBase.GsonSerializer
-
Field Summary
Fields inherited from class org.apache.wayang.basic.operators.SampleOperator
datasetSize, logger, sampleSizeFunction, seedFunction, UNKNOWN_DATASET_SIZE
Fields inherited from class org.apache.wayang.core.plan.wayangplan.OperatorBase
inputSlots, outputSlots, STANDARD_OPERATOR_ARGS
Fields inherited from interface org.apache.wayang.core.plan.wayangplan.Operator
FIRST_EPOCH
-
Constructor Summary
ConstructorsConstructorDescriptionCopies an instance (exclusive of broadcasts).SparkShufflePartitionSampleOperator
(FunctionDescriptor.SerializableIntUnaryOperator sampleSizeFunction, DataSetType<Type> type, FunctionDescriptor.SerializableLongUnaryOperator seedFunction) Creates a new instance. -
Method Summary
Modifier and TypeMethodDescriptionboolean
Tell whether this instances is a Spark action.protected ExecutionOperator
evaluate
(ChannelInstance[] inputs, ChannelInstance[] outputs, SparkExecutor sparkExecutor, OptimizationContext.OperatorContext operatorContext) Evaluates this operator.Provide theConfiguration
keys for theLoadProfileEstimator
specification of this instance.getSupportedInputChannels
(int index) getSupportedOutputChannels
(int index) Display the supportedChannel
s for a certainOutputSlot
.Methods inherited from class org.apache.wayang.basic.operators.SampleOperator
createCardinalityEstimator, getDatasetSize, getSampleMethod, getSampleSize, getSeed, getType, isDataSetSizeKnown, randomSeed, setDatasetSize, setSampleMethod, setSeedFunction
Methods inherited from class org.apache.wayang.core.plan.wayangplan.UnaryToUnaryOperator
getInput, getInputType, getOutput, getOutputType
Methods inherited from class org.apache.wayang.core.plan.wayangplan.OperatorBase
accept, addBroadcastInput, addTargetPlatform, at, collectMappedInputSlots, collectMappedOutputSlots, copy, getAllInputs, getAllOutputs, getCardinalityEstimator, getContainer, getEpoch, getName, getOriginal, getSimpleClassName, getTargetPlatforms, isAuxiliary, isSupportingBroadcastInputs, propagateInputCardinality, propagateOutputCardinality, setAuxiliary, setCardinalityEstimator, setContainer, setEpoch, setName, toString
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
Methods inherited from interface org.apache.wayang.core.plan.wayangplan.ActualOperator
accept
Methods inherited from interface org.apache.wayang.core.plan.wayangplan.ElementaryOperator
createCardinalityEstimator, getCardinalityEstimator, isAuxiliary, setAuxiliary, setCardinalityEstimator
Methods inherited from interface org.apache.wayang.core.plan.wayangplan.ExecutionOperator
copy, createLoadProfileEstimator, createOutputChannelInstances, getLimitBaseKey, getLoadProfileEstimatorConfigurationKey, getOriginal, getOutputChannelDescriptor, isFiltered
Methods inherited from interface org.apache.wayang.core.plan.wayangplan.Operator
addBroadcastInput, addTargetPlatform, broadcastTo, broadcastTo, collectMappedInputSlots, collectMappedOutputSlots, connectTo, connectTo, getAllInputs, getAllOutputs, getCardinalityPusher, getContainer, getEffectiveOccupant, getEffectiveOccupant, getEpoch, getEstimationContextProperties, getForwards, getInnermostLoop, getInput, getInput, getLoopStack, getName, getNumBroadcastInputs, getNumInputs, getNumOutputs, getNumRegularInputs, getOuterInputSlot, getOutermostInputSlot, getOutermostOutputSlots, getOutput, getOutput, getParent, getTargetPlatforms, isAlternative, isConversion, isElementary, isExecutionOperator, isFeedbackInput, isFeedforwardOutput, isLoopHead, isLoopSubplan, isOwnerOf, isReading, isSink, isSource, isSubplan, isSupportingBroadcastInputs, isUnconnected, propagateInputCardinality, propagateOutputCardinality, propagateOutputCardinality, replaceWith, setContainer, setEpoch, setInput, setName, setOutput
Methods inherited from interface org.apache.wayang.spark.operators.SparkExecutionOperator
getPlatform, name, name
-
Constructor Details
-
SparkShufflePartitionSampleOperator
public SparkShufflePartitionSampleOperator(FunctionDescriptor.SerializableIntUnaryOperator sampleSizeFunction, DataSetType<Type> type, FunctionDescriptor.SerializableLongUnaryOperator seedFunction) Creates a new instance. -
SparkShufflePartitionSampleOperator
Copies an instance (exclusive of broadcasts).- Parameters:
that
- that should be copied
-
-
Method Details
-
evaluate
public Tuple<Collection<ExecutionLineageNode>,Collection<ChannelInstance>> evaluate(ChannelInstance[] inputs, ChannelInstance[] outputs, SparkExecutor sparkExecutor, OptimizationContext.OperatorContext operatorContext) Description copied from interface:SparkExecutionOperator
Evaluates this operator. Takes a set ofChannelInstance
s according to the operator inputs and manipulates a set ofChannelInstance
s according to the operator outputs -- unless the operator is a sink, then it triggers execution.In addition, this method should give feedback of what this instance was doing by wiring the
LazyExecutionLineageNode
s of input and ouputChannelInstance
s and providing aCollection
of executedExecutionLineageNode
s.- Specified by:
evaluate
in interfaceSparkExecutionOperator
- Parameters:
inputs
-ChannelInstance
s that satisfy the inputs of this operatoroutputs
-ChannelInstance
s that accept the outputs of this operatorsparkExecutor
-SparkExecutor
that executes this instanceoperatorContext
- optimization information for this instance- Returns:
Collection
s of what has been executed and produced
-
createCopy
- Overrides:
createCopy
in classOperatorBase
-
getLoadProfileEstimatorConfigurationKeys
Description copied from interface:ExecutionOperator
Provide theConfiguration
keys for theLoadProfileEstimator
specification of this instance.- Specified by:
getLoadProfileEstimatorConfigurationKeys
in interfaceExecutionOperator
- Returns:
- the
Configuration
keys
-
getSupportedInputChannels
Description copied from interface:ExecutionOperator
- Specified by:
getSupportedInputChannels
in interfaceExecutionOperator
- Parameters:
index
- the index of theInputSlot
- Returns:
- an
List
ofChannel
s'Class
es, ordered by their preference of use
-
getSupportedOutputChannels
Description copied from interface:ExecutionOperator
Display the supportedChannel
s for a certainOutputSlot
.- Specified by:
getSupportedOutputChannels
in interfaceExecutionOperator
- Parameters:
index
- the index of theOutputSlot
- Returns:
- an
List
ofChannel
s'Class
es, ordered by their preference of use - See Also:
-
containsAction
public boolean containsAction()Description copied from interface:SparkExecutionOperator
Tell whether this instances is a Spark action. This is important to keep track on when Spark is actually initialized.- Specified by:
containsAction
in interfaceSparkExecutionOperator
- Returns:
- whether this instance issues Spark actions
-