Class SparkMaterializedGroupByOperator<Type,KeyType>
java.lang.Object
org.apache.wayang.core.plan.wayangplan.OperatorBase
org.apache.wayang.core.plan.wayangplan.UnaryToUnaryOperator<Type,Iterable<Type>>
org.apache.wayang.basic.operators.MaterializedGroupByOperator<Type,KeyType>
org.apache.wayang.spark.operators.SparkMaterializedGroupByOperator<Type,KeyType>
- All Implemented Interfaces:
Serializable
,ActualOperator
,ElementaryOperator
,ExecutionOperator
,Operator
,SparkExecutionOperator
public class SparkMaterializedGroupByOperator<Type,KeyType>
extends MaterializedGroupByOperator<Type,KeyType>
implements SparkExecutionOperator
Spark implementation of the
MaterializedGroupByOperator
.- See Also:
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.wayang.core.plan.wayangplan.OperatorBase
OperatorBase.GsonSerializer
-
Field Summary
Fields inherited from class org.apache.wayang.basic.operators.MaterializedGroupByOperator
keyDescriptor
Fields inherited from class org.apache.wayang.core.plan.wayangplan.OperatorBase
inputSlots, outputSlots, STANDARD_OPERATOR_ARGS
Fields inherited from interface org.apache.wayang.core.plan.wayangplan.Operator
FIRST_EPOCH
-
Constructor Summary
ConstructorsConstructorDescriptionCopies an instance (exclusive of broadcasts).SparkMaterializedGroupByOperator
(TransformationDescriptor<Type, KeyType> keyDescriptor, DataSetType<Type> inputType, DataSetType<Iterable<Type>> outputType) -
Method Summary
Modifier and TypeMethodDescriptionboolean
Tell whether this instances is a Spark action.protected ExecutionOperator
createLoadProfileEstimator
(Configuration configuration) Developers ofExecutionOperator
s can provide a defaultLoadProfileEstimator
via this method.evaluate
(ChannelInstance[] inputs, ChannelInstance[] outputs, SparkExecutor sparkExecutor, OptimizationContext.OperatorContext operatorContext) Evaluates this operator.getSupportedInputChannels
(int index) getSupportedOutputChannels
(int index) Display the supportedChannel
s for a certainOutputSlot
.Methods inherited from class org.apache.wayang.basic.operators.MaterializedGroupByOperator
createCardinalityEstimator, getKeyDescriptor
Methods inherited from class org.apache.wayang.core.plan.wayangplan.UnaryToUnaryOperator
getInput, getInputType, getOutput, getOutputType
Methods inherited from class org.apache.wayang.core.plan.wayangplan.OperatorBase
accept, addBroadcastInput, addTargetPlatform, at, collectMappedInputSlots, collectMappedOutputSlots, copy, getAllInputs, getAllOutputs, getCardinalityEstimator, getContainer, getEpoch, getName, getOriginal, getSimpleClassName, getTargetPlatforms, isAuxiliary, isSupportingBroadcastInputs, propagateInputCardinality, propagateOutputCardinality, setAuxiliary, setCardinalityEstimator, setContainer, setEpoch, setName, toString
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
Methods inherited from interface org.apache.wayang.core.plan.wayangplan.ActualOperator
accept
Methods inherited from interface org.apache.wayang.core.plan.wayangplan.ElementaryOperator
createCardinalityEstimator, getCardinalityEstimator, isAuxiliary, setAuxiliary, setCardinalityEstimator
Methods inherited from interface org.apache.wayang.core.plan.wayangplan.ExecutionOperator
copy, createOutputChannelInstances, getLimitBaseKey, getLoadProfileEstimatorConfigurationKeys, getOriginal, getOutputChannelDescriptor, isFiltered
Methods inherited from interface org.apache.wayang.core.plan.wayangplan.Operator
addBroadcastInput, addTargetPlatform, broadcastTo, broadcastTo, collectMappedInputSlots, collectMappedOutputSlots, connectTo, connectTo, getAllInputs, getAllOutputs, getCardinalityPusher, getContainer, getEffectiveOccupant, getEffectiveOccupant, getEpoch, getEstimationContextProperties, getForwards, getInnermostLoop, getInput, getInput, getLoopStack, getName, getNumBroadcastInputs, getNumInputs, getNumOutputs, getNumRegularInputs, getOuterInputSlot, getOutermostInputSlot, getOutermostOutputSlots, getOutput, getOutput, getParent, getTargetPlatforms, isAlternative, isConversion, isElementary, isExecutionOperator, isFeedbackInput, isFeedforwardOutput, isLoopHead, isLoopSubplan, isOwnerOf, isReading, isSink, isSource, isSubplan, isSupportingBroadcastInputs, isUnconnected, propagateInputCardinality, propagateOutputCardinality, propagateOutputCardinality, replaceWith, setContainer, setEpoch, setInput, setName, setOutput
Methods inherited from interface org.apache.wayang.spark.operators.SparkExecutionOperator
getPlatform, name, name
-
Constructor Details
-
SparkMaterializedGroupByOperator
public SparkMaterializedGroupByOperator(TransformationDescriptor<Type, KeyType> keyDescriptor, DataSetType<Type> inputType, DataSetType<Iterable<Type>> outputType) -
SparkMaterializedGroupByOperator
Copies an instance (exclusive of broadcasts).- Parameters:
that
- that should be copied
-
-
Method Details
-
evaluate
public Tuple<Collection<ExecutionLineageNode>,Collection<ChannelInstance>> evaluate(ChannelInstance[] inputs, ChannelInstance[] outputs, SparkExecutor sparkExecutor, OptimizationContext.OperatorContext operatorContext) Description copied from interface:SparkExecutionOperator
Evaluates this operator. Takes a set ofChannelInstance
s according to the operator inputs and manipulates a set ofChannelInstance
s according to the operator outputs -- unless the operator is a sink, then it triggers execution.In addition, this method should give feedback of what this instance was doing by wiring the
LazyExecutionLineageNode
s of input and ouputChannelInstance
s and providing aCollection
of executedExecutionLineageNode
s.- Specified by:
evaluate
in interfaceSparkExecutionOperator
- Parameters:
inputs
-ChannelInstance
s that satisfy the inputs of this operatoroutputs
-ChannelInstance
s that accept the outputs of this operatorsparkExecutor
-SparkExecutor
that executes this instanceoperatorContext
- optimization information for this instance- Returns:
Collection
s of what has been executed and produced
-
createCopy
- Overrides:
createCopy
in classOperatorBase
-
getLoadProfileEstimatorConfigurationKey
- Specified by:
getLoadProfileEstimatorConfigurationKey
in interfaceExecutionOperator
-
createLoadProfileEstimator
Description copied from interface:ExecutionOperator
Developers ofExecutionOperator
s can provide a defaultLoadProfileEstimator
via this method.- Specified by:
createLoadProfileEstimator
in interfaceExecutionOperator
- Parameters:
configuration
- in which theLoadProfile
should be estimated.- Returns:
- an
Optional
that might contain theLoadProfileEstimator
(butOptional.empty()
by default)
-
getSupportedInputChannels
Description copied from interface:ExecutionOperator
- Specified by:
getSupportedInputChannels
in interfaceExecutionOperator
- Parameters:
index
- the index of theInputSlot
- Returns:
- an
List
ofChannel
s'Class
es, ordered by their preference of use
-
getSupportedOutputChannels
Description copied from interface:ExecutionOperator
Display the supportedChannel
s for a certainOutputSlot
.- Specified by:
getSupportedOutputChannels
in interfaceExecutionOperator
- Parameters:
index
- the index of theOutputSlot
- Returns:
- an
List
ofChannel
s'Class
es, ordered by their preference of use - See Also:
-
containsAction
public boolean containsAction()Description copied from interface:SparkExecutionOperator
Tell whether this instances is a Spark action. This is important to keep track on when Spark is actually initialized.- Specified by:
containsAction
in interfaceSparkExecutionOperator
- Returns:
- whether this instance issues Spark actions
-