org.apache.spark.mllib.classification
Train a logistic regression model given an RDD of (label, features) pairs.
Train a logistic regression model given an RDD of (label, features) pairs. We run a fixed number of iterations of gradient descent using a step size of 1.0. We use the entire data set to update the gradient in each iteration. NOTE: Labels used in Logistic Regression should be {0, 1}
RDD of (label, array of features) pairs.
Number of iterations of gradient descent to run.
a LogisticRegressionModel which has the weights and offset from training.
Train a logistic regression model given an RDD of (label, features) pairs.
Train a logistic regression model given an RDD of (label, features) pairs. We run a fixed number of iterations of gradient descent using the specified step size. We use the entire data set to update the gradient in each iteration. NOTE: Labels used in Logistic Regression should be {0, 1}
RDD of (label, array of features) pairs.
Number of iterations of gradient descent to run.
Step size to be used for each iteration of Gradient Descent.
a LogisticRegressionModel which has the weights and offset from training.
Train a logistic regression model given an RDD of (label, features) pairs.
Train a logistic regression model given an RDD of (label, features) pairs. We run a fixed
number of iterations of gradient descent using the specified step size. Each iteration uses
miniBatchFraction
fraction of the data to calculate the gradient.
NOTE: Labels used in Logistic Regression should be {0, 1}
RDD of (label, array of features) pairs.
Number of iterations of gradient descent to run.
Step size to be used for each iteration of gradient descent.
Fraction of data to be used per iteration.
Train a logistic regression model given an RDD of (label, features) pairs.
Train a logistic regression model given an RDD of (label, features) pairs. We run a fixed
number of iterations of gradient descent using the specified step size. Each iteration uses
miniBatchFraction
fraction of the data to calculate the gradient. The weights used in
gradient descent are initialized using the initial weights provided.
NOTE: Labels used in Logistic Regression should be {0, 1}
RDD of (label, array of features) pairs.
Number of iterations of gradient descent to run.
Step size to be used for each iteration of gradient descent.
Fraction of data to be used per iteration.
Initial set of weights to be used. Array should be equal in size to the number of features in the data.
Top-level methods for calling Logistic Regression using Stochastic Gradient Descent. NOTE: Labels used in Logistic Regression should be {0, 1}
(Since version 2.0.0) Use ml.classification.LogisticRegression or LogisticRegressionWithLBFGS