Package org.apache.wayang.basic.operators
package org.apache.wayang.basic.operators
-
ClassDescriptionThis source reads a blob file stored in Amazon s3 and outputs the lines as data units.This source reads a blob file stored in Azure Blob Storage and outputs the lines as data units.CartesianOperator<InputType0,
InputType1> This operator returns the cartesian product of elements of input datasets.CoGroupOperator<InputType0,InputType1, Key> This operator groups both inputs by some key and then matches groups with the same key.This source takes as input a JavaCollection.CountOperator<Type>This operator returns the count of elements in this stream.DistinctOperator<Type>This operator returns the distinct elements in this dataset.DLTrainingOperator<X,Y> DoWhileOperator<InputType,ConvergenceType> This operator has three inputs and two outputs.FilterOperator<Type>This operator returns a new dataset after filtering by applying predicateDescriptor.FlatMapOperator<InputType,OutputType> A flatmap operator represents semantics as they are known from frameworks, such as Spark and Flink.This operator groups the elements of a data set into a single data quantum.GlobalReduceOperator<Type>This operator groups the elements of a data set and aggregates the groups.GroupByOperator<Input,Key> This is the auxiliary GroupBy operator, i.e., it behaves differently depending on its context.IntersectOperator<Type>This operator returns the set intersection of elements of input datasets.JoinOperator<InputType0,InputType1, Key> This operator returns the cartesian product of elements of input datasets.ThisUnarySinkwrites all incoming data quanta to a single Kafka topic.This source reads messages from a Kafka topic and outputs the messages as data units.This sink executes a callback on each received data unit into a JavaCollection.LoopOperator<InputType,ConvergenceType> This operator has three inputs and three outputs.MapOperator<InputType,OutputType> A map operator represents semantics as they are known from frameworks, such as Spark and Flink.MapPartitionsOperator<InputType,OutputType> This operator takes as input potentially multiple input data quanta and outputs multiple input data quanta.MaterializedGroupByOperator<Type,Key> This operator collocates the data units in a data set w.r.t. a key function.Deprecated.ThisUnarySinkwrites all incoming data quanta to a Object file.This source reads a text file and outputs the lines as data units.Operatorfor the PageRank algorithm.This source reads a parquet file and outputs the lines asRecordunits.PredictOperator<X,Y> PyWayangReduceByOperator<Type,Key> ReduceByOperator<Type,Key> This operator groups the elements of a data set and aggregates the groups.ReduceOperator<Type>This operator is context dependent: after aGroupByOperator, it is meant to be aReduceByOperator; otherwise, it is aGlobalReduceOperator.RepeatOperator<Type>SampleOperator<Type>A random sample operator randomly selects its inputs from the input slot and pushes that element to the output slot.SortOperator<Type,Key> This operator sorts the elements in this dataset.UnarySourcethat provides the tuples from a database table.TextFileSink<T>ThisUnarySinkwrites all incoming data quanta to a text file.This source reads a text file and outputs the lines as data units.UnionAllOperator<Type>ThisOperatorcreates the union (bag semantics) of two .ZipWithIdOperator<InputType>This operators attaches a unique ID to each input data quantum.