【计算机视觉 | 目标检测 | 图像分割】arxiv 计算机视觉关于目标检测和图像分割的学术速递(7 月 17 日论文合集)
发布时间:2025-02-10 09:28
**计算机视觉 | 目标检测 | 图像分割**
**ARXIV 计算机视觉关于目标检测和图像分割的学术速递(7 月17 日论文合集)**
**1. Target Detection**
###1.1 **"YOLOv5: An Incremental Improvement Over YOLOv4"**
import torchfrom torchvision import models# YOLOv5模型定义class YOLOv5(torch.nn.Module): def __init__(self): super(YOLOv5, self).__init__() self.backbone = models.resnet50(pretrained=True) self.head = torch.nn.Sequential( torch.nn.Conv2d(512,256, kernel_size=3), torch.nn.ReLU(), torch.nn.MaxPool2d(kernel_size=2, stride=2), torch.nn.Flatten() ) def forward(self, x): x = self.backbone(x) x = self.head(x) return x
###1.2 **"EfficientDet: Rethinking Efficient Object Detection with a Unified Network Architecture"**
import torch.nn as nn# EfficientDet模型定义class EfficientDet(nn.Module): def __init__(self): super(EfficientDet, self).__init__() self.backbone = nn.Sequential( nn.Conv2d(3,64, kernel_size=3), nn.ReLU(), nn.MaxPool2d(kernel_size=2, stride=2) ) self.head = nn.Sequential( nn.Linear(512,256), nn.ReLU(), nn.Linear(256,10) ) def forward(self, x): x = self.backbone(x) x = self.head(x) return x
**2. Image Segmentation**
###2.1 **"U-Net: Deep Learning for Image Segmentation with Fully Convolutional Networks"**
import torch.nn as nn# U-Net模型定义class UNet(nn.Module): def __init__(self): super(UNet, self).__init__() self.encoder = nn.Sequential( nn.Conv2d(3,64, kernel_size=3), nn.ReLU(), nn.MaxPool2d(kernel_size=2, stride=2) ) self.decoder = nn.Sequential( nn.Upsample(scale_factor=2), nn.Conv2d(32,16, kernel_size=3), nn.ReLU() ) def forward(self, x): x = self.encoder(x) x = self.decoder(x) return x
###2.2 **"DeepLab: Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs"**
import torch.nn as nn# DeepLab模型定义class DeepLab(nn.Module): def __init__(self): super(DeepLab, self).__init__() self.encoder = nn.Sequential( nn.Conv2d(3,64, kernel_size=3), nn.ReLU(), nn.MaxPool2d(kernel_size=2, stride=2) ) self.decoder = nn.Sequential( nn.Upsample(scale_factor=2), nn.Conv2d(32,16, kernel_size=3), nn.ReLU() ) def forward(self, x): x = self.encoder(x) x = self.decoder(x) return x