当前位置：首页 >> 编程语言 >> 【PyTorch】softmax回归,7770

【PyTorch】softmax回归,7770

0evadmin 2025-12-10 15:18:02 编程语言 3

文件名：【PyTorch】softmax回归,7770 【PyTorch】softmax回归

文章目录 1. 模型与代码实现1.1. 模型1.2. 代码实现 2. Q&A

1. 模型与代码实现 1.1. 模型背景在分类问题中，模型的输出层是全连接层，每个类别对应一个输出。我们希望模型的输出

\hat{y}_j

可以视为属于类

j

的概率，然后选择具有最大输出值的类别作为我们的预测。 softmax函数能够将未规范化的输出变换为非负数并且总和为1，同时让模型保持可导的性质，而且不会改变未规范化的输出之间的大小次序。softmax函数

\mathbf{\hat{y}}=\mathrm{softmax}(\mathbf{o})

其中

\hat{y}_j=\frac{\mathrm{exp}({o_j})}{\sum_{k}\mathrm{exp}({o_k})}

softmax是一个非线性函数，但softmax回归的输出仍然由输入特征的仿射变换决定，因此，softmax回归是一个线性模型。为了避免将softmax的输出直接送入交叉熵损失造成的数值稳定性问题，将softmax和交叉熵损失结合在一起，具体做法是：不将softmax概率传递到损失函数中，而是在交叉熵损失函数中传递未规范化的输出，并同时计算softmax及其对数。因此定义交叉熵损失函数时也进行了softmax运算。 1.2. 代码实现 import torchfrom torchvision.datasets import FashionMNISTfrom torchvision import transformsfrom torch.utils.data import DataLoaderfrom torch import nnfrom tensorboardX import SummaryWriter# 全局参数设置batch_size = 256num_workers = 0num_epochs = 10device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')writer = SummaryWriter()# 加载数据集root = "./dataset"transform = transforms.Compose([transforms.ToTensor()])mnist_train = FashionMNIST(root=root, train=True, transform=transform, download=True)mnist_test = FashionMNIST(root=root, train=False, transform=transform, download=True)dataloader_train = DataLoader(mnist_train, batch_size, shuffle=True, num_workers=num_workers)dataloader_test = DataLoader(mnist_test, batch_size, shuffle=False,num_workers=num_workers)# 定义神经网络net = nn.Sequential(nn.Flatten(), nn.Linear(784, 10)).to(device)# 初始化模型参数def init_weights(m):if type(m) == nn.Linear:nn.init.normal_(m.weight, mean=0, std=0.01)nn.init.constant_(m.bias, val=0)net.apply(init_weights)# 定义损失函数criterion = nn.CrossEntropyLoss(reduction='none')# 定义优化器optimizer = torch.optim.SGD(net.parameters(), lr=0.1)class Accumulator:"""在n个变量上累加"""def __init__(self, n):self.data = [0.0] * ndef add(self, *args):self.data = [a + float(b) for a, b in zip(self.data, args)]def reset(self):self.data = [0.0] * len(self.data)def __getitem__(self, idx):return self.data[idx]def accuracy(y_hat, y):"""计算预测正确的数量"""if len(y_hat.shape) > 1 and y_hat.shape[1] > 1:y_hat = y_hat.argmax(axis=1)cmp = y_hat.type(y.dtype) == yreturn float(cmp.type(y.dtype).sum())for epoch in range(num_epochs):# 训练模型net.train()train_metrics = Accumulator(3) # 训练损失总和、训练准确度总和、样本数for X, y in dataloader_train:X, y = X.to(device), y.to(device)y_hat = net(X)loss = criterion(y_hat, y)optimizer.zero_grad()loss.mean().backward()optimizer.step()train_metrics.add(float(loss.sum()), accuracy(y_hat, y), y.numel())train_loss = train_metrics[0] / train_metrics[2]train_acc = train_metrics[1] / train_metrics[2]# 测试模型net.eval()with torch.no_grad(): test_metrics = Accumulator(2) # 测试准确度总和、样本数for X, y in dataloader_test:X, y = X.to(device), y.to(device)y_hat = net(X)loss = criterion(y_hat, y)test_metrics.add(accuracy(y_hat, y), y.numel())test_acc = test_metrics[0] / test_metrics[1]writer.add_scalars("metrics", {'train_loss': train_loss, 'train_acc': train_acc, 'test_acc': test_acc}, epoch)writer.close()

输出结果：

2. Q&A 运行过程中出现以下警告：

UserWarning: The given NumPy array is not writeable, and PyTorch does not support non-writeable tensors. This means you can write to the underlying (supposedly non-writeable) NumPy array using the tensor. You may want to copy the array to protect its data or make it writeable before converting it to a tensor. This type of warning will be suppressed for the rest of this program. (Triggered internally at …\torch\csrc\utils\tensor_numpy.cpp:180.) return torch.from_numpy(parsed.astype(m[2], copy=False)).view(*s)

该警告的大致意思是给定的NumPy数组不可写，并且PyTorch不支持不可写的张量。这意味着你可以使用张量写入底层（假定不可写）NumPy数组。在将数组转换为张量之前，可能需要复制数组以保护其数据或使其可写。在本程序的其余部分，此类警告将被抑制。因此需要修改C:\Users\%UserName%\anaconda3\envs\%conda_env_name%\lib\site-packages\torchvision\datasets\mnist.py的第498行，将return torch.from_numpy(parsed.astype(m[2], copy=False)).view(*s)中的False改成True。

一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30	31

【PyTorch】softmax回归,7770

【PyTorch】nn.Conv2d函数详解,电脑功率

【PyTorch】代码学习,4860

【PyGIS】使用阿里AIEarth快速下载指定区域指定年份的土地利用数据,6199

【PyQt】(自制类)处理鼠标点击逻辑,风暴战区礼券

【PyQt】调整子控件的层级以调整绘制的先后顺序,国产精品手机网站（pyqt 控件）

【PyQt学习篇 · ⑨】：QWidget -控件交互,朗琴x300

【PyQt小知识 - 2】：QTextEdit内容的更新和获取、隐藏或显示滚动条、光标插入文本、文本自适应移动,libfetion

【PyTorch 08】如果要手动安装对应的包,泡泡网

【PyTorch】暂退法（dropout）,gtx970

【PyTorch】PyTorch、Cuda 的安装和使用,hd高清

【PyTorch】nn.Conv2d函数详解,电脑功率

【PyTorch】softmax回归,7770

【PyTorch】nn.Conv2d函数详解,电脑功率

【PyTorch】代码学习,4860

【PyGIS】使用阿里AIEarth快速下载指定区域指定年份的土地利用数据,6199

【PyQt】(自制类)处理鼠标点击逻辑,风暴战区礼券

【PyQt】调整子控件的层级以调整绘制的先后顺序,国产精品手机网站（pyqt 控件）

【PyQt学习篇 · ⑨】：QWidget -控件交互,朗琴x300

【PyQt小知识 - 2】：QTextEdit内容的更新和获取、隐藏或显示滚动条、光标插入文本、文本自适应移动,libfetion

【PyTorch 08】如果要手动安装对应的包,泡泡网

【PyTorch】 暂退法（dropout）,gtx970

【PyTorch】PyTorch、Cuda 的安装和使用,hd高清

【PyTorch】nn.Conv2d函数详解,电脑功率

【PyTorch】暂退法（dropout）,gtx970