class Google::Apis::DataprocV1::PySparkJob

A Cloud Dataproc job for running PySpark applications on YARN.

Attributes

archive_uris[RW]
Optional

HCFS URIs of archives to be extracted in the working directory of .

jar, .tar, .tar.gz, .tgz, and .zip. Corresponds to the JSON property `archiveUris` @return [Array<String>]

args[RW]
Optional

The arguments to pass to the driver. Do not include arguments, such

as `–conf`, that can be set as job properties, since a collision may occur that causes an incorrect job submission. Corresponds to the JSON property `args` @return [Array<String>]

file_uris[RW]
Optional

HCFS URIs of files to be copied to the working directory of Python

drivers and distributed tasks. Useful for naively parallel tasks. Corresponds to the JSON property `fileUris` @return [Array<String>]

jar_file_uris[RW]
Optional

HCFS URIs of jar files to add to the CLASSPATHs of the Python

driver and tasks. Corresponds to the JSON property `jarFileUris` @return [Array<String>]

logging_config[RW]

The runtime logging config of the job. Corresponds to the JSON property `loggingConfig` @return [Google::Apis::DataprocV1::LoggingConfig]

main_python_file_uri[RW]
Required

The HCFS URI of the main Python file to use as the driver. Must be

a .py file. Corresponds to the JSON property `mainPythonFileUri` @return [String]

properties[RW]
Optional

A mapping of property names to values, used to configure PySpark.

Properties that conflict with values set by the Cloud Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code. Corresponds to the JSON property `properties` @return [Hash<String,String>]

python_file_uris[RW]
Optional

HCFS file URIs of Python files to pass to the PySpark framework.

Supported file types: .py, .egg, and .zip. Corresponds to the JSON property `pythonFileUris` @return [Array<String>]

Public Class Methods

new(**args) click to toggle source
# File generated/google/apis/dataproc_v1/classes.rb, line 1015
def initialize(**args)
   update!(**args)
end

Public Instance Methods

update!(**args) click to toggle source

Update properties of this object

# File generated/google/apis/dataproc_v1/classes.rb, line 1020
def update!(**args)
  @main_python_file_uri = args[:main_python_file_uri] if args.key?(:main_python_file_uri)
  @args = args[:args] if args.key?(:args)
  @python_file_uris = args[:python_file_uris] if args.key?(:python_file_uris)
  @jar_file_uris = args[:jar_file_uris] if args.key?(:jar_file_uris)
  @file_uris = args[:file_uris] if args.key?(:file_uris)
  @archive_uris = args[:archive_uris] if args.key?(:archive_uris)
  @properties = args[:properties] if args.key?(:properties)
  @logging_config = args[:logging_config] if args.key?(:logging_config)
end