Reference documentation and code samples for the Cloud Dataproc V1 API class Google::Cloud::Dataproc::V1::PySparkJob.
A Dataproc job for running Apache PySpark applications on YARN.
Inherits
- Object
Extended By
- Google::Protobuf::MessageExts::ClassMethods
Includes
- Google::Protobuf::MessageExts
Methods
#archive_uris
def archive_uris() -> ::Array<::String>
Returns
- (::Array<::String>) — Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip.
#archive_uris=
def archive_uris=(value) -> ::Array<::String>
Parameter
- value (::Array<::String>) — Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip.
Returns
- (::Array<::String>) — Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip.
#args
def args() -> ::Array<::String>
Returns
-
(::Array<::String>) — Optional. The arguments to pass to the driver. Do not include arguments,
such as
--conf
, that can be set as job properties, since a collision may occur that causes an incorrect job submission.
#args=
def args=(value) -> ::Array<::String>
Parameter
-
value (::Array<::String>) — Optional. The arguments to pass to the driver. Do not include arguments,
such as
--conf
, that can be set as job properties, since a collision may occur that causes an incorrect job submission.
Returns
-
(::Array<::String>) — Optional. The arguments to pass to the driver. Do not include arguments,
such as
--conf
, that can be set as job properties, since a collision may occur that causes an incorrect job submission.
#file_uris
def file_uris() -> ::Array<::String>
Returns
- (::Array<::String>) — Optional. HCFS URIs of files to be placed in the working directory of each executor. Useful for naively parallel tasks.
#file_uris=
def file_uris=(value) -> ::Array<::String>
Parameter
- value (::Array<::String>) — Optional. HCFS URIs of files to be placed in the working directory of each executor. Useful for naively parallel tasks.
Returns
- (::Array<::String>) — Optional. HCFS URIs of files to be placed in the working directory of each executor. Useful for naively parallel tasks.
#jar_file_uris
def jar_file_uris() -> ::Array<::String>
Returns
- (::Array<::String>) — Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks.
#jar_file_uris=
def jar_file_uris=(value) -> ::Array<::String>
Parameter
- value (::Array<::String>) — Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks.
Returns
- (::Array<::String>) — Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks.
#logging_config
def logging_config() -> ::Google::Cloud::Dataproc::V1::LoggingConfig
Returns
- (::Google::Cloud::Dataproc::V1::LoggingConfig) — Optional. The runtime log config for job execution.
#logging_config=
def logging_config=(value) -> ::Google::Cloud::Dataproc::V1::LoggingConfig
Parameter
- value (::Google::Cloud::Dataproc::V1::LoggingConfig) — Optional. The runtime log config for job execution.
Returns
- (::Google::Cloud::Dataproc::V1::LoggingConfig) — Optional. The runtime log config for job execution.
#main_python_file_uri
def main_python_file_uri() -> ::String
Returns
- (::String) — Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file.
#main_python_file_uri=
def main_python_file_uri=(value) -> ::String
Parameter
- value (::String) — Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file.
Returns
- (::String) — Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file.
#properties
def properties() -> ::Google::Protobuf::Map{::String => ::String}
Returns
- (::Google::Protobuf::Map{::String => ::String}) — Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code.
#properties=
def properties=(value) -> ::Google::Protobuf::Map{::String => ::String}
Parameter
- value (::Google::Protobuf::Map{::String => ::String}) — Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code.
Returns
- (::Google::Protobuf::Map{::String => ::String}) — Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code.
#python_file_uris
def python_file_uris() -> ::Array<::String>
Returns
- (::Array<::String>) — Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip.
#python_file_uris=
def python_file_uris=(value) -> ::Array<::String>
Parameter
- value (::Array<::String>) — Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip.
Returns
- (::Array<::String>) — Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip.