public abstract class Transformer extends PipelineStage
| Constructor and Description | 
|---|
Transformer()  | 
| Modifier and Type | Method and Description | 
|---|---|
abstract Transformer | 
copy(ParamMap extra)
Creates a copy of this instance with the same UID and some extra params. 
 | 
abstract Dataset<Row> | 
transform(Dataset<?> dataset)
Transforms the input dataset. 
 | 
Dataset<Row> | 
transform(Dataset<?> dataset,
         ParamMap paramMap)
Transforms the dataset with provided parameter map as additional parameters. 
 | 
Dataset<Row> | 
transform(Dataset<?> dataset,
         ParamPair<?> firstParamPair,
         ParamPair<?>... otherParamPairs)
Transforms the dataset with optional parameters 
 | 
Dataset<Row> | 
transform(Dataset<?> dataset,
         ParamPair<?> firstParamPair,
         scala.collection.Seq<ParamPair<?>> otherParamPairs)
Transforms the dataset with optional parameters 
 | 
transformSchemaequals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitclear, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, paramMap, params, set, set, set, setDefault, setDefault, shouldOwntoString, uidinitializeLogging, initializeLogIfNecessary, initializeLogIfNecessary, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarningpublic abstract Transformer copy(ParamMap extra)
ParamsdefaultCopy().copy in interface Paramscopy in class PipelineStageextra - (undocumented)public Dataset<Row> transform(Dataset<?> dataset, ParamPair<?> firstParamPair, ParamPair<?>... otherParamPairs)
dataset - input datasetfirstParamPair - the first param pair, overwrite embedded paramsotherParamPairs - other param pairs, overwrite embedded paramspublic Dataset<Row> transform(Dataset<?> dataset, ParamPair<?> firstParamPair, scala.collection.Seq<ParamPair<?>> otherParamPairs)
dataset - input datasetfirstParamPair - the first param pair, overwrite embedded paramsotherParamPairs - other param pairs, overwrite embedded paramspublic Dataset<Row> transform(Dataset<?> dataset, ParamMap paramMap)
dataset - input datasetparamMap - additional parameters, overwrite embedded params