python实现线性回归之lasso回归--688IT编程网

python实现线性回归之lasso回归

Lasso回归于岭回归⾮常相似，它们的差别在于使⽤了不同的正则化项。最终都实现了约束参数从⽽防⽌过拟合的效果。但是Lasso之所以重要，还有另⼀个原因是：Lasso能够将⼀些作⽤⽐较⼩的特征的参数训练为0，从⽽获得稀疏解。也就是说⽤这种⽅法，在训练模型的过程中实现了降维(特征筛选)的⽬的。

Lasso回归的代价函数为：

上式中的w||w||1

其中sign(θi)

上述解释摘⾃：

接下来是实现代码，代码来源：

⾸先还是定义⼀个基类，各种线性回归都需要继承该基类：

class Regression(object):

""" Base regression model. Models the relationship between a scalar dependent variable y and the independent

variables X.

Parameters:

正则化的回归分析

-----------

n_iterations: float

The number of training iterations the algorithm will tune the weights for.

learning_rate: float

The step length that will be used when updating the weights.

"""

def__init__(self, n_iterations, learning_rate):

self.n_iterations = n_iterations

self.learning_rate = learning_rate

def initialize_weights(self, n_features):

""" Initialize weights randomly [-1/N, 1/N] """

limit = 1 / math.sqrt(n_features)

self.w = np.random.uniform(-limit, limit, (n_features, ))

def fit(self, X, y):

# Insert constant ones for bias weights

X = np.insert(X, 0, 1, axis=1)

self.initialize_weights(n_features=X.shape[1])

# Do gradient descent for n_iterations

for i in range(self.n_iterations):

y_pred = X.dot(self.w)

# Calculate l2 loss

mse = np.mean(0.5 * (y - y_pred)**2 + ularization(self.w))

# Gradient of l2 w

grad_w = -(y - y_pred).dot(X) + ad(self.w)

# Update the weights

self.w -= self.learning_rate * grad_w

def predict(self, X):

# Insert constant ones for bias weights

X = np.insert(X, 0, 1, axis=1)

y_pred = X.dot(self.w)

return y_pred

需要注意的是这⾥的mse损失函数前⾯就只是0.5。

lasso回归的核⼼就是l1正则化，其代码如下所⽰：

class l1_regularization():

""" Regularization for Lasso Regression """

def__init__(self, alpha):

self.alpha = alpha

def__call__(self, w):

return self.alpha * (w)

def grad(self, w):

return self.alpha * np.sign(w)

然后是lasso回归代码：

class LassoRegression(Regression):

"""Linear regression model with a regularization factor which does both variable selection

and regularization. Model that tries to balance the fit of the model with respect to the training

data and the complexity of the model. A large regularization factor with decreases the variance of

the model and do para.

Parameters:

-----------

degree: int

The degree of the polynomial that the independent variable X will be transformed to.

reg_factor: float

The factor that will determine the amount of regularization and feature

shrinkage.

n_iterations: float

The number of training iterations the algorithm will tune the weights for.

learning_rate: float

The step length that will be used when updating the weights.

"""

def__init__(self, degree, reg_factor, n_iterations=3000, learning_rate=0.01):

self.degree = degree

super(LassoRegression, self).__init__(n_iterations,

learning_rate)

def fit(self, X, y):

X = normalize(polynomial_features(X, degree=self.degree))

super(LassoRegression, self).fit(X, y)

def predict(self, X):

X = normalize(polynomial_features(X, degree=self.degree))

return super(LassoRegression, self).predict(X)

这⾥normalize()和polynomial_features()位于utils.data_manipulation下：

def normalize(X, axis=-1, order=2):

""" Normalize the dataset X """

l2 = np.atleast_1d((X, order, axis))

l2[l2 == 0] = 1

return X / np.expand_dims(l2, axis)

()：⽤于求范数，ord=order⽤于指定计算的范数类型。

np.atleast_1d()：将输⼊的数据直接视为1维，⽐如输⼊的是1，那么经过该函数之后的输出就是[1]

def polynomial_features(X, degree):

n_samples, n_features = np.shape(X)

def index_combinations():

combs = [combinations_with_replacement(range(n_features), i) for i in range(0, degree + 1)]

flat_combs = [item for sublist in combs for item in sublist]

return flat_combs

combinations = index_combinations()

n_output_features = len(combinations)

X_new = np.empty((n_samples, n_output_features))

for i, index_combs in enumerate(combinations):

X_new[:, i] = np.prod(X[:, index_combs], axis=1)

return X_new

这个是计算多项式特征函数。什么是多项式特征呢？

以sklearn中的为例：使⽤sklearn.preprocessing.PolynomialFeatures来进⾏特征的构造。它是使⽤多项式的⽅法来进⾏的，如果有a，b两个特征，那么它的2次多项式为（1,a,b,a^2,ab, b^2）。

PolynomialFeatures有三个参数

degree：控制多项式的度

interaction_only：默认为False，如果指定为True，那么就不会有特征⾃⼰和⾃⼰结合的项，上⾯的⼆次项中没有a^2和b^2。

include_bias：默认为True。如果为True的话，那么就会有上⾯的 1那⼀项。

最后是使⽤：

⾸先是部分数据集：

time temp

0.00273224 0.1

0.005464481 -4.5

0.008196721 -6.3

0.010928962 -9.6

0.013661202 -9.9

0.016393443 -17.1

0.019125683 -11.6

0.021857923 -6.2

0.024590164 -6.4

0.027322404 -0.5

0.030054645 0.5

0.032786885 -2.4

0.035519126 -7.5

然后是lasso回归的运⾏代码：

from__future__import print_function

import sys

sys.path.append("/content/drive/My Drive/learn/ML-From-Scratch/")

import matplotlib.pyplot as plt

import numpy as np

import pandas as pd

# Import helper functions

from mlfromscratch.supervised_learning import LassoRegression

from mlfromscratch.utils import k_fold_cross_validation_sets, normalize, mean_squared_error from mlfromscratch.utils import train_test_split, polynomial_features, Plot

def main():

# Load temperature data

data = pd.read_csv('mlfromscratch/', sep="\t")

time = np.atleast_2d(data["time"].values).T

temp = data["temp"].values

X = time # fraction of the year [0, 1]

y = temp

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.4)

poly_degree = 13

model = LassoRegression(degree=15,

reg_factor=0.05,

learning_rate=0.001,

n_iterations=4000)

model.fit(X_train, y_train)

# Training error plot

n = aining_errors)

training, = plt.plot(range(n), aining_errors, label="Training Error")

plt.legend(handles=[training])

plt.title("Error Plot")

plt.ylabel('Mean Squared Error')

plt.xlabel('Iterations')

plt.show()

y_pred = model.predict(X_test)

mse = mean_squared_error(y_test, y_pred)

print ("Mean squared error: %s (given by reg. factor: %s)" % (mse, 0.05))

y_pred_line = model.predict(X)

# Color map

cmap = _cmap('viridis')

# Plot the results

m1 = plt.scatter(366 * X_train, y_train, color=cmap(0.9), s=10)

m2 = plt.scatter(366 * X_test, y_test, color=cmap(0.5), s=10)

plt.plot(366 * X, y_pred_line, color='black', linewidth=2, label="Prediction")

plt.suptitle("Lasso Regression")

plt.title("MSE: %.2f" % mse, fontsize=10)

plt.xlabel('Day')

plt.ylabel('Temperature in Celcius')

plt.legend((m1, m2), ("Training data", "Test data"), loc='lower right')

plt.show()

if__name__ == "__main__":

main()

这⾥⾯也⽤到了utils下的很多函数，我们⼀⼀解析。

def train_test_split(X, y, test_size=0.5, shuffle=True, seed=None):

""" Split the data into train and test sets """

if shuffle:

X, y = shuffle_data(X, y, seed)

# Split the training data from test data in the ratio specified in

# test_size

split_i = len(y) - int(len(y) // (1 / test_size))

X_train, X_test = X[:split_i], X[split_i:]

y_train, y_test = y[:split_i], y[split_i:]

return X_train, X_test, y_train, y_test

这⾥代码挺简单，⾥⾯还使⽤了：

def shuffle_data(X, y, seed=None):

""" Random shuffle of the samples in X and y """

if seed:

np.random.seed(seed)

idx = np.arange(X.shape[0])

np.random.shuffle(idx)

return X[idx], y[idx]

将数据进⾏打乱。

def k_fold_cross_validation_sets(X, y, k, shuffle=True):

""" Split the data into k sets of training / test data """

if shuffle:

X, y = shuffle_data(X, y)

n_samples = len(y)

left_overs = {}

n_left_overs = (n_samples % k)

if n_left_overs != 0:

left_overs["X"] = X[-n_left_overs:]

left_overs["y"] = y[-n_left_overs:]

X = X[:-n_left_overs]

y = y[:-n_left_overs]

X_split = np.split(X, k)

y_split = np.split(y, k)

sets = []

for i in range(k):

X_test, y_test = X_split[i], y_split[i]

X_train = np.concatenate(X_split[:i] + X_split[i + 1:], axis=0)

y_train = np.concatenate(y_split[:i] + y_split[i + 1:], axis=0)

sets.append([X_train, X_test, y_train, y_test])

# Add left over samples to last set as training samples

if n_left_overs != 0:

np.append(sets[-1][0], left_overs["X"], axis=0)

np.append(sets[-1][2], left_overs["y"], axis=0)

return np.array(sets)

进⾏k-fold交叉验证。

这⾥的这些函数在sklearn中都是有的，看这些代码可以加深理解。结果：

Mean squared error: 11.302155412035969 (given by reg. factor: 0.05)

688IT编程网

python实现线性回归之lasso回归

发表评论

推荐文章

马尔可夫网络的参数调整技巧(六)

基于注意力机制的非线性时间序列预测模型

pytorch 代码损失函数l2正则化代码

提高深度学习技术模型训练效果和收敛速度的优化方法和策略

人工智能深度学习技术练习(习题卷1)

热门文章

网络工程师上午基础知识考试选择题专项强化真题试卷35(题后含答案及...

基于小波正则化的对抗训练方法

数据库规范化习题

《基于四元数的边缘自适应正则化模型》范文

基于正则化的高维数据降维算法研究

信息安全复习题(非标准答案)

弹性阻抗反演的后验正则化方法

地球物理反演中的正则化技术分析

网络安全师证书信息防护策略考试选择题 64题

isc bind 正则表达式处理拒绝服务漏洞

深入正则表达式原理

南开大学2021年9月《密码学》作业考核试题及答案参考1

2022～2023自考专业(计算机网络)考试题库及答案解析第50期

Xgboost的sklearn接口参数说明

网络安全管理员中级工模拟习题+答案

一种基于先验引导对抗性表征学习的垃圾分类方法

AI面试必备深度学习100问1-50题答案解析

gradientboostingregressor原理

软件工程导论选择题

大连理工大学22春“计算机科学与技术”《网络安全》作业考核题库高频考...

最新文章

马尔可夫网络的参数调整技巧(六)

基于注意力机制的非线性时间序列预测模型

提高深度学习技术模型训练效果和收敛速度的优化方法和策略

人工智能深度学习技术练习(习题卷1)

euclidean范数

如何解决图像识别中的模型过拟合问题(六)

标签列表

688IT编程网

python实现线性回归之lasso回归

发表评论

推荐文章

马尔可夫网络的参数调整技巧(六)

基于注意力机制的非线性时间序列预测模型

pytorch 代码损失函数l2正则化代码

提高深度学习技术模型训练效果和收敛速度的优化方法和策略

人工智能深度学习技术练习(习题卷1)

热门文章

网络工程师上午基础知识考试选择题专项强化真题试卷35(题后含答案及...

基于小波正则化的对抗训练方法

数据库规范化习题

《基于四元数的边缘自适应正则化模型》范文

基于正则化的高维数据降维算法研究

信息安全复习题(非标准答案)

弹性阻抗反演的后验正则化方法

地球物理反演中的正则化技术分析

网络安全师证书信息防护策略考试 选择题 64题

isc bind 正则表达式处理拒绝服务漏洞

深入正则表达式原理

南开大学2021年9月《密码学》作业考核试题及答案参考1

2022～2023自考专业(计算机网络)考试题库及答案解析第50期

Xgboost的sklearn接口参数说明

网络安全管理员中级工模拟习题+答案

一种基于先验引导对抗性表征学习的垃圾分类方法

AI面试必备深度学习100问1-50题答案解析

gradientboostingregressor原理

软件工程导论选择题

大连理工大学22春“计算机科学与技术”《网络安全》作业考核题库高频考...

最新文章

马尔可夫网络的参数调整技巧(六)

基于注意力机制的非线性时间序列预测模型

提高深度学习技术模型训练效果和收敛速度的优化方法和策略

人工智能深度学习技术练习(习题卷1)

euclidean范数

如何解决图像识别中的模型过拟合问题(六)

标签列表

网络安全师证书信息防护策略考试选择题 64题