• 摘要: 无监督行人重识别因其广泛的实际应用前景而受到越来越多的关注。大多数基于聚类的对比学习方法将每个集群视为一个伪身份类,忽略了由相机风格差异造成的类内差异。一些方法引入了相机感知对比学习,根据相机视角将单一集群划分为多个子集群,但它们容易受到噪声伪标签的误导。为解决这一问题,本文首先基于实例在特征空间中的相似性,采用最近邻的预测标签和原始聚类结果的加权组合细化伪标签。然后,采用细化伪标签动态地关联实例可能属于的类别中心,同时剔除可能存在的假阴性样本。这一方法改进了相机感知对比学习中正负样本的选择机制,有效地减轻了噪声伪标签对对比学习任务的误导。在Market-1501、MSMT17、Personx数据集上mAP/Rank-1分别达到了85.2%/94.4%、44.3%/74.1%、88.7%/95.9%。

       

      Abstract: Unsupervised person re-identification has attracted more and more attention due to its extensive practical application prospects. Most clustering-based contrastive learning methods treat each cluster as a pseudo-identity class, overlooking intra-class variances caused by differences in camera styles. While some methods have introduced camera-aware contrastive learning by partitioning a single cluster into multiple sub-clusters based on camera views, they are susceptible to misguidance from noisy pseudo-labels. To address this issue, we first refine pseudo-labels by leveraging the similarity between instances in the feature space, using a weighted combination of the nearest neighboring predicted labels and the original clustering results. Subsequently, it dynamically associates instances with possible category centers based on refined pseudo-labels while eliminating potential false negative samples. This method enhances the selection mechanism for positive and negative samples in camera-aware contrastive learning, effectively mitigating the influence of noisy pseudo-labels on the contrastive learning task. On Market-1501, MSMT17 and Personx datasets, mAP/Rank-1 reached 85.2%/94.4%, 44.3%/74.1% and 88.7%/95.9%.