卷积神经网络(CNN)是一种广泛应用于图像识别的深度学习模型。在PyTorch中,我们可以使用PyTorch的torchvision库来构建和训练卷积神经网络。以下是一个简单的示例,展示了如何使用PyTorch构建一个用于图像分类的卷积神经网络。
首先,我们需要导入所需的库:
```python
import torch
import torch.nn as nn
import torch.optim as optim
from torchvision import datasets, transforms
from torchvision.models import resnet50
```
接下来,我们定义一个自定义的ResNet模型,这个模型将继承自resnet50模型,并添加一些额外的层以适应我们的图像分类任务。在这个例子中,我们将添加一个全连接层,用于输出分类结果。
```python
class CustomResNet(nn.Module):
def __init__(self):
super(CustomResNet, self).__init__()
# 继承自resnet50模型
self.conv1 = nn.Conv2d(3, 64, kernel_size=7, stride=2, padding=3, bias=False)
self.bn1 = nn.BatchNorm2d(64)
self.conv2 = nn.Conv2d(64, 128, kernel_size=3, stride=1, padding=1, bias=False)
self.bn2 = nn.BatchNorm2d(128)
self.conv3 = nn.Conv2d(128, 256, kernel_size=3, stride=1, padding=1, bias=False)
self.bn3 = nn.BatchNorm2d(256)
self.avgpool = nn.AdaptiveAvgPool2d((1, 1))
self.fc = nn.Linear(256 * 8 * 8, 10) # 假设我们有10个类别
def forward(self, x):
x = self.conv1(x)
x = self.bn1(x)
x = torch.relu(x)
x = self.conv2(x)
x = self.bn2(x)
x = torch.relu(x)
x = self.conv3(x)
x = self.bn3(x)
x = self.avgpool(x)
x = torch.flatten(x, 1)
x = self.fc(x)
return x
```
现在,我们可以创建一个数据集,并对它进行预处理:
```python
transform = transforms.Compose([
transforms.Resize(256),
transforms.CenterCrop(224),
transforms.ToTensor(),
transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225])
])
trainset = datasets.ImageFolder('path/to/train/images', transform=transform)
testset = datasets.ImageFolder('path/to/test/images', transform=transform)
trainloader = torch.utils.data.DataLoader(trainset, batch_size=64, shuffle=True)
testloader = torch.utils.data.DataLoader(testset, batch_size=64, shuffle=False)
```
接下来,我们可以创建网络、损失函数和优化器:
```python
model = CustomResNet()
criterion = nn.CrossEntropyLoss()
optimizer = optim.SGD(model.parameters(), lr=0.001, momentum=0.9)
```
然后,我们可以编译模型:
```python
model.train()
```
最后,我们可以训练模型:
```python
for epoch in range(10): # 假设我们训练10个epoch
running_loss = 0.0
for i, data in enumerate(trainloader, 0):
inputs, labels = data
inputs = inputs.view(-1, 256 * 8 * 8)
optimizer.zero_grad()
outputs = model(inputs)
loss = criterion(outputs, labels)
loss.backward()
optimizer.step()
running_loss += loss.item()
print('Epoch %d loss: %.3f' % (epoch + 1, running_loss / (i + 1)))
```
训练完成后,我们可以使用测试集评估模型的性能:
```python
correct = 0
total = 0
with torch.no_grad():
for data in testloader:
images, labels = data
outputs = model(images)
_, predicted = torch.max(outputs.data, 1)
total += labels.size(0)
correct += (predicted == labels).sum().item()
print('Accuracy of the network on the test images: %d %%' % (100 * correct / total))
```
这个示例展示了如何使用PyTorch构建一个简单的卷积神经网络进行图像分类。你可以根据需要修改这个示例,例如添加更多的层、调整学习率等。