Class: Aws::SageMaker::Types::TabularJobConfig

Inherits:
Struct
  • Object
show all
Includes:
Aws::Structure
Defined in:
lib/aws-sdk-sagemaker/types.rb

Overview

The collection of settings used by an AutoML job V2 for the tabular problem type.

Constant Summary collapse

SENSITIVE =
[]

Instance Attribute Summary collapse

Instance Attribute Details

#candidate_generation_configTypes::CandidateGenerationConfig

The configuration information of how model candidates are generated.



40098
40099
40100
40101
40102
40103
40104
40105
40106
40107
40108
40109
# File 'lib/aws-sdk-sagemaker/types.rb', line 40098

class TabularJobConfig < Struct.new(
  :candidate_generation_config,
  :completion_criteria,
  :feature_specification_s3_uri,
  :mode,
  :generate_candidate_definitions_only,
  :problem_type,
  :target_attribute_name,
  :sample_weight_attribute_name)
  SENSITIVE = []
  include Aws::Structure
end

#completion_criteriaTypes::AutoMLJobCompletionCriteria

How long a job is allowed to run, or how many candidates a job is allowed to generate.



40098
40099
40100
40101
40102
40103
40104
40105
40106
40107
40108
40109
# File 'lib/aws-sdk-sagemaker/types.rb', line 40098

class TabularJobConfig < Struct.new(
  :candidate_generation_config,
  :completion_criteria,
  :feature_specification_s3_uri,
  :mode,
  :generate_candidate_definitions_only,
  :problem_type,
  :target_attribute_name,
  :sample_weight_attribute_name)
  SENSITIVE = []
  include Aws::Structure
end

#feature_specification_s3_uriString

A URL to the Amazon S3 data source containing selected features from the input data source to run an Autopilot job V2. You can input ‘FeatureAttributeNames` (optional) in JSON format as shown below:

‘{ “FeatureAttributeNames”:[“col1”, “col2”, …] }`.

You can also specify the data type of the feature (optional) in the format shown below:

‘{ “FeatureDataTypes”:{“col1”:“numeric”, “col2”:“categorical” … } }`

<note markdown=“1”> These column keys may not include the target column.

</note>

In ensembling mode, Autopilot only supports the following data types: ‘numeric`, `categorical`, `text`, and `datetime`. In HPO mode, Autopilot can support `numeric`, `categorical`, `text`, `datetime`, and `sequence`.

If only ‘FeatureDataTypes` is provided, the column keys (`col1`, `col2`,..) should be a subset of the column names in the input data.

If both ‘FeatureDataTypes` and `FeatureAttributeNames` are provided, then the column keys should be a subset of the column names provided in `FeatureAttributeNames`.

The key name ‘FeatureAttributeNames` is fixed. The values listed in `[“col1”, “col2”, …]` are case sensitive and should be a list of strings containing unique values that are a subset of the column names in the input data. The list of columns provided must not include the target column.

Returns:

  • (String)


40098
40099
40100
40101
40102
40103
40104
40105
40106
40107
40108
40109
# File 'lib/aws-sdk-sagemaker/types.rb', line 40098

class TabularJobConfig < Struct.new(
  :candidate_generation_config,
  :completion_criteria,
  :feature_specification_s3_uri,
  :mode,
  :generate_candidate_definitions_only,
  :problem_type,
  :target_attribute_name,
  :sample_weight_attribute_name)
  SENSITIVE = []
  include Aws::Structure
end

#generate_candidate_definitions_onlyBoolean

Generates possible candidates without training the models. A model candidate is a combination of data preprocessors, algorithms, and algorithm parameter settings.

Returns:

  • (Boolean)


40098
40099
40100
40101
40102
40103
40104
40105
40106
40107
40108
40109
# File 'lib/aws-sdk-sagemaker/types.rb', line 40098

class TabularJobConfig < Struct.new(
  :candidate_generation_config,
  :completion_criteria,
  :feature_specification_s3_uri,
  :mode,
  :generate_candidate_definitions_only,
  :problem_type,
  :target_attribute_name,
  :sample_weight_attribute_name)
  SENSITIVE = []
  include Aws::Structure
end

#modeString

The method that Autopilot uses to train the data. You can either specify the mode manually or let Autopilot choose for you based on the dataset size by selecting ‘AUTO`. In `AUTO` mode, Autopilot chooses `ENSEMBLING` for datasets smaller than 100 MB, and `HYPERPARAMETER_TUNING` for larger ones.

The ‘ENSEMBLING` mode uses a multi-stack ensemble model to predict classification and regression tasks directly from your dataset. This machine learning mode combines several base models to produce an optimal predictive model. It then uses a stacking ensemble method to combine predictions from contributing members. A multi-stack ensemble model can provide better performance over a single model by combining the predictive capabilities of multiple models. See

Autopilot algorithm support][1

for a list of algorithms supported

by ‘ENSEMBLING` mode.

The ‘HYPERPARAMETER_TUNING` (HPO) mode uses the best hyperparameters to train the best version of a model. HPO automatically selects an algorithm for the type of problem you want to solve. Then HPO finds the best hyperparameters according to your objective metric. See

Autopilot algorithm support][1

for a list of algorithms supported

by ‘HYPERPARAMETER_TUNING` mode.

[1]: docs.aws.amazon.com/sagemaker/latest/dg/autopilot-model-support-validation.html#autopilot-algorithm-support

Returns:

  • (String)


40098
40099
40100
40101
40102
40103
40104
40105
40106
40107
40108
40109
# File 'lib/aws-sdk-sagemaker/types.rb', line 40098

class TabularJobConfig < Struct.new(
  :candidate_generation_config,
  :completion_criteria,
  :feature_specification_s3_uri,
  :mode,
  :generate_candidate_definitions_only,
  :problem_type,
  :target_attribute_name,
  :sample_weight_attribute_name)
  SENSITIVE = []
  include Aws::Structure
end

#problem_typeString

The type of supervised learning problem available for the model candidates of the AutoML job V2. For more information, see [ SageMaker Autopilot problem types].

<note markdown=“1”> You must either specify the type of supervised learning problem in ‘ProblemType` and provide the [AutoMLJobObjective] metric, or none at all.

</note>

[1]: docs.aws.amazon.com/sagemaker/latest/dg/autopilot-datasets-problem-types.html#autopilot-problem-types [2]: docs.aws.amazon.com/sagemaker/latest/APIReference/API_CreateAutoMLJobV2.html#sagemaker-CreateAutoMLJobV2-request-AutoMLJobObjective

Returns:

  • (String)


40098
40099
40100
40101
40102
40103
40104
40105
40106
40107
40108
40109
# File 'lib/aws-sdk-sagemaker/types.rb', line 40098

class TabularJobConfig < Struct.new(
  :candidate_generation_config,
  :completion_criteria,
  :feature_specification_s3_uri,
  :mode,
  :generate_candidate_definitions_only,
  :problem_type,
  :target_attribute_name,
  :sample_weight_attribute_name)
  SENSITIVE = []
  include Aws::Structure
end

#sample_weight_attribute_nameString

If specified, this column name indicates which column of the dataset should be treated as sample weights for use by the objective metric during the training, evaluation, and the selection of the best model. This column is not considered as a predictive feature. For more information on Autopilot metrics, see [Metrics and validation].

Sample weights should be numeric, non-negative, with larger values indicating which rows are more important than others. Data points that have invalid or no weight value are excluded.

Support for sample weights is available in [Ensembling] mode only.

[1]: docs.aws.amazon.com/sagemaker/latest/dg/autopilot-metrics-validation.html [2]: docs.aws.amazon.com/sagemaker/latest/APIReference/API_AutoMLAlgorithmConfig.html

Returns:

  • (String)


40098
40099
40100
40101
40102
40103
40104
40105
40106
40107
40108
40109
# File 'lib/aws-sdk-sagemaker/types.rb', line 40098

class TabularJobConfig < Struct.new(
  :candidate_generation_config,
  :completion_criteria,
  :feature_specification_s3_uri,
  :mode,
  :generate_candidate_definitions_only,
  :problem_type,
  :target_attribute_name,
  :sample_weight_attribute_name)
  SENSITIVE = []
  include Aws::Structure
end

#target_attribute_nameString

The name of the target variable in supervised learning, usually represented by ‘y’.

Returns:

  • (String)


40098
40099
40100
40101
40102
40103
40104
40105
40106
40107
40108
40109
# File 'lib/aws-sdk-sagemaker/types.rb', line 40098

class TabularJobConfig < Struct.new(
  :candidate_generation_config,
  :completion_criteria,
  :feature_specification_s3_uri,
  :mode,
  :generate_candidate_definitions_only,
  :problem_type,
  :target_attribute_name,
  :sample_weight_attribute_name)
  SENSITIVE = []
  include Aws::Structure
end