Class AmazonS3Source

All Implemented Interfaces:
Serializable, ActualOperator, ElementaryOperator, Operator
Direct Known Subclasses:
JavaAmazonS3Source

public class AmazonS3Source extends UnarySource<String>
This source reads a blob file stored in Amazon s3 and outputs the lines as data units.
See Also:
  • Constructor Details

    • AmazonS3Source

      public AmazonS3Source(String bucket, String blobName, String filePathToCredentialsFile)
    • AmazonS3Source

      public AmazonS3Source(String bucket, String blobName, String filePathToCredentialsFile, String encoding)
    • AmazonS3Source

      public AmazonS3Source(AmazonS3Source that)
      Copies an instance (exclusive of broadcasts).
      Parameters:
      that - that should be copied
  • Method Details

    • getEncoding

      public String getEncoding()
    • getFilePathToCredentialsFile

      public String getFilePathToCredentialsFile()
    • getBucket

      public String getBucket()
    • getBlobName

      public String getBlobName()
    • getBlobByteSize

      public OptionalLong getBlobByteSize()
      Returns:
      the total size of the bytes in the Blob file.
    • getInputStream

      public InputStream getInputStream() throws Exception
      Retrieves an InputStream to the specified S3 blob file.
      Returns:
      InputStream to the Blob file.
      Throws:
      Exception - if an error occurs during S3 client creation or file rertrieval.