Class: Google::Apis::DataprocV1beta2::PySparkJob

Inherits:
Object
  • Object
show all
Includes:
Core::Hashable, Core::JsonObjectSupport
Defined in:
lib/google/apis/dataproc_v1beta2/classes.rb,
lib/google/apis/dataproc_v1beta2/representations.rb,
lib/google/apis/dataproc_v1beta2/representations.rb

Overview

A Dataproc job for running Apache PySpark (https://spark.apache.org/docs/0.9.0/ python-programming-guide.html) applications on YARN.

Instance Attribute Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(**args) ⇒ PySparkJob

Returns a new instance of PySparkJob.



2785
2786
2787
# File 'lib/google/apis/dataproc_v1beta2/classes.rb', line 2785

def initialize(**args)
   update!(**args)
end

Instance Attribute Details

#archive_urisArray<String>

Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip. Corresponds to the JSON property archiveUris

Returns:

  • (Array<String>)


2739
2740
2741
# File 'lib/google/apis/dataproc_v1beta2/classes.rb', line 2739

def archive_uris
  @archive_uris
end

#argsArray<String>

Optional. The arguments to pass to the driver. Do not include arguments, such as --conf, that can be set as job properties, since a collision may occur that causes an incorrect job submission. Corresponds to the JSON property args

Returns:

  • (Array<String>)


2746
2747
2748
# File 'lib/google/apis/dataproc_v1beta2/classes.rb', line 2746

def args
  @args
end

#file_urisArray<String>

Optional. HCFS URIs of files to be placed in the working directory of each executor. Useful for naively parallel tasks. Corresponds to the JSON property fileUris

Returns:

  • (Array<String>)


2752
2753
2754
# File 'lib/google/apis/dataproc_v1beta2/classes.rb', line 2752

def file_uris
  @file_uris
end

#jar_file_urisArray<String>

Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks. Corresponds to the JSON property jarFileUris

Returns:

  • (Array<String>)


2758
2759
2760
# File 'lib/google/apis/dataproc_v1beta2/classes.rb', line 2758

def jar_file_uris
  @jar_file_uris
end

#logging_configGoogle::Apis::DataprocV1beta2::LoggingConfig

The runtime logging config of the job. Corresponds to the JSON property loggingConfig



2763
2764
2765
# File 'lib/google/apis/dataproc_v1beta2/classes.rb', line 2763

def logging_config
  @logging_config
end

#main_python_file_uriString

Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file. Corresponds to the JSON property mainPythonFileUri

Returns:

  • (String)


2769
2770
2771
# File 'lib/google/apis/dataproc_v1beta2/classes.rb', line 2769

def main_python_file_uri
  @main_python_file_uri
end

#propertiesHash<String,String>

Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code. Corresponds to the JSON property properties

Returns:

  • (Hash<String,String>)


2777
2778
2779
# File 'lib/google/apis/dataproc_v1beta2/classes.rb', line 2777

def properties
  @properties
end

#python_file_urisArray<String>

Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip. Corresponds to the JSON property pythonFileUris

Returns:

  • (Array<String>)


2783
2784
2785
# File 'lib/google/apis/dataproc_v1beta2/classes.rb', line 2783

def python_file_uris
  @python_file_uris
end

Instance Method Details

#update!(**args) ⇒ Object

Update properties of this object



2790
2791
2792
2793
2794
2795
2796
2797
2798
2799
# File 'lib/google/apis/dataproc_v1beta2/classes.rb', line 2790

def update!(**args)
  @archive_uris = args[:archive_uris] if args.key?(:archive_uris)
  @args = args[:args] if args.key?(:args)
  @file_uris = args[:file_uris] if args.key?(:file_uris)
  @jar_file_uris = args[:jar_file_uris] if args.key?(:jar_file_uris)
  @logging_config = args[:logging_config] if args.key?(:logging_config)
  @main_python_file_uri = args[:main_python_file_uri] if args.key?(:main_python_file_uri)
  @properties = args[:properties] if args.key?(:properties)
  @python_file_uris = args[:python_file_uris] if args.key?(:python_file_uris)
end