Pattern Recognition and Computer Vision: 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13–15, 2023, Proceedings, Part IV (Lecture Notes in Computer Science) 9819984610, 9789819984619

The 13-volume set LNCS 14425-14437 constitutes the refereed proceedings of the 6th Chinese Conference on Pattern Recogni

166 16 70MB

English Pages 520 [514] Year 2024

Table of contents :
Preface
Organization
Contents – Part IV
Pattern Classification and Cluster Analysis
Shared Nearest Neighbor Calibration for Few-Shot Classification
1 Introduction
2 Related Work
3 Method
3.1 Preliminaries
3.2 Shared Nearest Neighbor Calibration (SNNC)
4 Experiments
4.1 Experimental Setup
4.2 Results
4.3 Ablation Study
5 Conclusion
References
Prototype Rectification with Region-Wise Foreground Enhancement for Few-Shot Classification
1 Introduction
2 Method
2.1 Preliminaries
2.2 Overall Framework
2.3 Region-Wise Foreground Enhencement
2.4 Prototype Rectification
3 Experiments
3.1 Experimental Setup
3.2 Implementation Details
3.3 Results
3.4 Ablation Studies
4 Conclusion
References
Rotation Augmented Distillation for Exemplar-Free Class Incremental Learning with Detailed Analysis
1 Introduction
2 Related Work
3 Methodology
3.1 Preliminary
3.2 A Baseline Method: Feat*
3.3 Rotation Augmented Distillation
4 Experiments
4.1 Detailed Analysis Under Conventional EFCIL Setting
4.2 Detailed Analysis Under Challenging EFCIL Setting
5 Conclusion
References
Nonconvex Tensor Hypergraph Learning for Multi-view Subspace Clustering
1 Introduction
2 Related Work
2.1 Multi-view Subspace Clustering
2.2 Hypergraph Learning
3 The Proposed Approach
3.1 The Proposed Model
3.2 Optimization
3.3 Complexity Analysis
4 Experimental Results and Analysis
4.1 Database and Competitors
4.2 Experimental Results and Analysis
4.3 Parameter Analysis
4.4 Convergence Analysis
5 Conclusion
References
A Novel Method for Identifying Bipolar Disorder Based on Diagnostic Texts
1 Introduction
2 Related Work
3 Proposed Method
3.1 Text Embedding
3.2 Emotional Feature Extraction
3.3 Multiscale Feature Extraction
3.4 Feature Fusion
4 Experiments and Results
4.1 Experimental Setup
4.2 Comparison with Recent Works
4.3 Ablation Study
5 Conclusion
References
Deep Depression Detection Based on Feature Fusion and Result Fusion
1 Introduction
2 Related Work
3 Proposed Method
3.1 Model Structure
4 Experiment
4.1 Dataset
4.2 Experiment Settings
4.3 Comparative Experiments
4.4 Ablation Study
5 Conclusion
References
Adaptive Cluster Assignment for Unsupervised Semantic Segmentation
1 Introduction
2 Related Work
2.1 Unsupervised Semantic Segmentation
2.2 Self-supervised Visual Feature Learning
2.3 Zero-Shot Semantic Segmentation with Pretrained Image-Language Model
3 Method
3.1 Overview
3.2 Adaptive Clustering Assignment Module
3.3 Objective Function
3.4 Inference
4 Experiments
4.1 Datasets and Experimental Settings
4.2 Quantitative Comparative Results
4.3 Ablation Studies
5 Conclusion
References
Confidence-Guided Open-World Semi-supervised Learning
1 Introduction
2 Related Work
3 Methodology
3.1 Problem Formulation
3.2 Overall Framework
3.3 Consistency-Guided Temperature Scaling
3.4 Adaptive Class-Specific Re-weighting Contrastive Module
4 Experiments
4.1 Datasets and Evaluation Metrics
4.2 Implementation Details
4.3 Experimental Results and Analysis
4.4 Ablation and Analysis
5 Conclusion
References
SSCL: Semi-supervised Contrastive Learning for Industrial Anomaly Detection
1 Introduction
2 Related Works
3 Method
3.1 Self-supervised Contrastive Learning
3.2 Supervised Contrastive Learning
3.3 Semi-supervised Contrastive Learning
4 Experiments
4.1 Competition Methods
4.2 Evaluation Metrics
4.3 Scenario Setup
4.4 Experimental Scenarios on MNIST and CIFAR-10
4.5 Experimental Scenarios on Industrial Anomaly Detection MVtec and STC
4.6 Ablation Study
5 Conclusion and Future Work
References
One Step Large-Scale Multi-view Subspace Clustering Based on Orthogonal Matrix Factorization with Consensus Graph Learning
1 Introduction
2 Proposed Method
2.1 Overview and Objective Function
2.2 Optimization
2.3 Complexity Analysis
3 Experiments
3.1 Datasets and Comparison Methods
3.2 Experimental Setup
3.3 Experiments Results
4 Conclusion
References
Deep Multi-task Image Clustering with Attention-Guided Patch Filtering and Correlation Mining
1 Introduction
2 Multi-task Image Clustering with Attention-Guided Patch Filtering and Correlation Mining
2.1 Overview
2.2 Key Patch Filter
2.3 Weights Estimation
2.4 Multi-task Clustering
3 Experiments
3.1 Data Sets
3.2 Baselines
3.3 Experimental Setup
3.4 Compared with the Latest Method
3.5 Impact of the Number of Clusters for Auxiliary Clustering
3.6 Impact of the Number of Extracted Patches
3.7 Impact of Word Count in Visual Dictionaries
4 Conclusions
References
Deep Structure and Attention Aware Subspace Clustering
1 Introduction
2 Proposed Method
2.1 Content Feature Learning
2.2 Structure Feature Learning
2.3 Training Strategy
3 Experiments
3.1 Experiments Settings
3.2 Results and Discussion
4 Conclusion
References
Broaden Your Positives: A General Rectification Approach for Novel Class Discovery
1 Introduction
2 Related Work
3 Proposed Approach
3.1 Problem Definition
3.2 Positive-Augmented Contrastive Learning
3.3 Theoretical Analysis for PACL
3.4 PACL for NCD and GCD
4 Experiments
4.1 Experimental Setup
4.2 PACL for Novel Category Discovery
4.3 PACL on Generalized Category Discovery
4.4 Additional Study
5 Conclusion
References
CE2: A Copula Entropic Mutual Information Estimator for Enhancing Adversarial Robustness
1 Introduction
2 Background
2.1 Copula Entropy and MI
2.2 Adversarial Attacks and Defenses
3 Methodology
3.1 Motivation
3.2 Copula Entropic MI Estimator
4 Experiments
4.1 Experiment Setups
4.2 Effectiveness of MI Estimator
4.3 Robustness Evaluation and Analysis
5 Conclusion
References
Two-Step Projection of Sparse Discrimination Between Classes for Unsupervised Domain Adaptation
1 Introduction
2 Related Work
2.1 Distribution Matching
2.2 Structure Maintenance
3 Two-Step Projection of Sparse Discrimination Between Classes (TSPSDC) Method
3.1 Notation
3.2 Model Formulation
3.3 Overall Objective and Optimization Algorithm
4 Experiments
4.1 The Cross-Domain Experiment
4.2 Analysis of the Convergence
4.3 The Parameter Analysis
5 Conclusion
References
Enhancing Adversarial Robustness via Stochastic Robust Framework
1 Introduction
2 Related Work
2.1 Adversarial Attack
2.2 Adversarial Defense
3 Methodology
3.1 Model Definition
3.2 Method
4 Experiments
4.1 Adversarial Defenses Under White Box Attacks
4.2 Hyperparameter Selection
4.3 Ablation Study
5 Conclusion
References
Pseudo Labels Refinement with Stable Cluster Reconstruction for Unsupervised Re-identification
1 Introduction
2 Related Work
3 Proposed Approach
3.1 Revisit USL Re-ID Model Based on Contrast Learning
3.2 Stable Cluster Reconstruction Module
3.3 Similarity Recalculation Module
3.4 Contrast Learning with Refinement of Labels
4 Experiments
4.1 Datasets and Evaluation Metrics
4.2 Implementation Details
4.3 Comparison with State-of-the-Arts
4.4 Ablation Studies
5 Conclusion
References
Ranking Variance Reduced Ensemble Attack with Dual Optimization Surrogate Search
1 Introduction
2 Related Work
3 Method
3.1 Preliminaries
3.2 Perturbation Machine with RVRE
3.3 Dual Optimization Weight Update Strategy
4 Experiments
4.1 Experiment Setup
4.2 Attack Effect Analysis
4.3 Ablation Study
4.4 Comparison on Hyper-parameters
5 Conclusion
References
Performance Evaluation and Benchmarks
PCR: A Large-Scale Benchmark for Pig Counting in Real World
1 Introduction
2 Related Work
2.1 Existing Animal Counting Datasets
2.2 Object Counting Methods
2.3 Pig Counting Methods
3 PCR Dataset
3.1 Data Collection
3.2 Data Pruning and Pre-processing
3.3 Data Annotation
4 Proposed Approach
4.1 Network Architecture
4.2 The SSR Loss Function
5 Experiments
5.1 Implementation Details and Evaluation Setup
5.2 Results of the Proposed Approach
5.3 Comparisons Against State-of-the-Art
5.4 Ablation Analysis
6 Conclusion
References
A Hierarchical Theme Recognition Model for Sandplay Therapy
1 Introduction
2 Related Work
2.1 Image Recognition Tasks
2.2 Projection Test and Sandplay Therapy
3 Method
3.1 Basic Semantic Perception Module
3.2 External Knowledge Incorporation Module
3.3 Theme Classification Module
4 SP2 Dataset
4.1 Dataset Construction
4.2 Psychological Attributes of Sand Objects
5 Experiments
5.1 Experimental Setup
5.2 Comparison with the State-of-the-Art
5.3 Ablation Study
6 Conclusion and Feature Work
References
Remote Sensing Image Interpretation
Change-Aware Network for Damaged Roads Recognition and Assessment Based on Multi-temporal Remote Sensing Imageries
1 Introduction
2 Related Works
2.1 Change Detection Methods Based on Deep Learning
2.2 Transformer and UNet Architecture
3 Method
3.1 Multi-scale Feature Extraction Module (MFE)
3.2 Trans and Res Skip Connection (TRSC)
3.3 Dense Cased Upsample Module (DCU)
3.4 Prediction Heads and Loss Function
4 Datasets
4.1 RSRDD
4.2 LEVIR-CD
5 Experiments
5.1 Implemented Details
5.2 Comparison to State-of-the-Art Methods
5.3 Qualitative Result
5.4 Ablation Study
6 Conclusion
References
UAM-Net: An Attention-Based Multi-level Feature Fusion UNet for Remote Sensing Image Segmentation
1 Introduction
2 Related Work
2.1 Encoder-Decoder Designs
2.2 Attention Mechanism
3 Proposed Method
3.1 Philosophy of Model Design
3.2 Network Design
3.3 Global Context Guidance Module
3.4 Attention Module
4 Experiment
4.1 Datasets
4.2 Experimental Setting
4.3 Quantitative Results
4.4 Qualitative Results
4.5 Efficiency Comparison
4.6 Ablation Study
5 Conclusion
References
Improved Conditional Generative Adversarial Networks for SAR-to-Optical Image Translation
1 Introduction
2 Related Work
2.1 Descriptions of GAN
2.2 SAR-to-Optical Image Translation
3 Method
3.1 The Generator with Style-Based Recalibration Module
3.2 Multi-scale Discriminator
3.3 Loss Functions
4 Experiments
4.1 Experimental Setup
4.2 Experimental Analysis
5 Conclusion
References
A Novel Cross Frequency-Domain Interaction Learning for Aerial Oriented Object Detection
1 Introduction
2 Proposed Method
2.1 Extraction of Frequency-Domain Features Module
2.2 Interaction of Frequency-Domain Features Module
2.3 Combined with Existing Framework
3 Experiments and Analysis
3.1 Datasets
3.2 Implementation Detail
3.3 Comparisons with the State-of-the-Art
3.4 Ablation Studies
3.5 Visualization Results
4 Conclusion
References
DBDAN: Dual-Branch Dynamic Attention Network for Semantic Segmentation of Remote Sensing Images
1 Introduction
2 Related Work
2.1 Semantic Segmentation of RSIs
2.2 Attention-Based Methods
3 Method
3.1 Overview
3.2 Correlation Attention Module
3.3 Local Dynamic Attention Branch
3.4 Global Dynamic Attention Branch
3.5 Dual-Branch Feature Fusion
4 Results
4.1 Datasets and Evaluation Metrics
4.2 Comparison with the State-of-the-Art Methods
4.3 Ablation Study
5 Conclusion
References
Multi-scale Contrastive Learning for Building Change Detection in Remote Sensing Images
1 Introduction
2 Related Works
2.1 Self-supervised Contrastive Learning
2.2 SSL in Remote Sensing
3 Methodology
3.1 Data Preprocessing and Representation Generation
3.2 Instance-Level CL Module
3.3 Pixel-Level CL Module
4 Experiments
4.1 Datasets
4.2 Implementation Details
4.3 Experiment on Change Detection and Building Extraction
4.4 Ablation Studies
5 Conclusion
References
Shadow Detection of Remote Sensing Image by Fusion of Involution and Shunted Transformer
1 Introduction
2 Related Work
3 Methods
3.1 Proposed STAISD-Net
3.2 Improved Multiscale Asymmetric Involution
3.3 Improved Shunted Transformer
3.4 Optimization
4 Experiments
4.1 Dataset and Evaluation Metrics
4.2 Comparisons
4.3 Ablation Experiments
5 Conclusion
References
Few-Shot Infrared Image Classification with Partial Concept Feature
1 Introduction
2 Related Work
2.1 Few-Shot Learning
2.2 Partial-Based Methods
3 Problem Definition
4 Method
4.1 Concept Feature Extraction
4.2 Similarity Metric
5 Experiment
5.1 Infrared Dataset
5.2 Experiment Settings
5.3 Ablation Study
5.4 Analysis of Classification Effect
5.5 Real-Time Analysis
6 Conclusion
References
AGST-LSTM: The ConvLSTM Model Combines Attention and Gate Structure for Spatiotemporal Sequence Prediction Learning
1 Introduction
2 Related Work
3 Method
3.1 GM-LSTM Module
3.2 Attention Module
4 Experiment
4.1 Moving-MNIST Dataset
4.2 AirTemperature
4.3 Ablation Study
5 Conclusion
References
A Shape-Based Quadrangle Detector for Aerial Images
1 Introduction
2 Method
2.1 Overview
2.2 Vertex Sorting Function
2.3 PolyIoU Loss Function
2.4 Implementation Details
3 Experiments
3.1 Datasets
3.2 Ablation Experiments
3.3 Comparison with State-of-the-Art Methods
4 Conclusions
References
End-to-End Unsupervised Style and Resolution Transfer Adaptation Segmentation Model for Remote Sensing Images
1 Introduction
2 EUSRTA Model
2.1 Style Transfer Network
2.2 Segmentation Network
2.3 Output-Level Discrimination Network
2.4 Loss Function
3 Experiment Results
3.1 Dataset
3.2 Implementation Details
3.3 State-of-the-Art Comparison
3.4 Ablation Study
4 Conclusion
References
A Physically Feasible Counter-Attack Method for Remote Sensing Imaging Point Clouds
1 Introduction
2 Related Work
3 Methods
3.1 Real Mapping to Digital Module
3.2 Digital Project Back to Real Module
4 Experimental Analysis and Discussion
4.1 Datasets and Classifiers
4.2 Attack Performance
4.3 Simulation of Physical Scene Attacks
5 Conclusion
References
Adversarial Robustness via Multi-experts Framework for SAR Recognition with Class Imbalanced
1 Introduction
2 Related Work
2.1 Adversarial Robustness
2.2 Long-Tailed Recognition
3 Methodology
3.1 Adversarial Robustness
3.2 Personalized Expert Training
3.3 Contrastive Self-supervised Aggregation Strategy
3.4 Loss Function
4 Experiments
4.1 Setup
4.2 Experimental Results
4.3 Ablation Study
5 Conclusion
References
Recognizer Embedding Diffusion Generation for Few-Shot SAR Recognization
1 Introduction
2 Related Work
2.1 SAR Few-Shot Learning
2.2 SAR Generative Models
3 Method
3.1 Overall
3.2 Generation Module
3.3 Recognition Module
4 Experiment
4.1 Dataset
4.2 Implementation
4.3 Similarity Results
4.4 Recognition Results
4.5 Ablation Studies
5 Conclusion
References
A Two-Stage Federated Learning Framework for Class Imbalance in Aerial Scene Classification
1 Introduction
2 Related Work
2.1 Class Imbalance Aerial Image Classification
2.2 Federated Learning
3 Method
3.1 A Two-Stage Federated Learning Framework (TS-FL)
3.2 Feature Representation Method Based on Federated Knowledge Distillation
3.3 Federated Classifier Learning
4 Experiment
4.1 Dataset
4.2 Implementation Details
4.3 Experimental Results
4.4 Feature Visualization
4.5 Ablation Studies
5 Conclusion
References
SAR Image Authentic Assessment with Bayesian Deep Learning and Counterfactual Explanations
1 Introduction
2 Method
2.1 Overview
2.2 Bayesian Convolutional Neural Network
2.3 Counterfactual Explanation Generation
3 Experiments
3.1 Dataset and Settings
3.2 Performance of BayesCNN
3.3 Inauthentic Assessment Based on BayesCNN's Prediction
3.4 Counterfactual Explanation
4 Conclusion
References
Circle Representation Network for Specific Target Detection in Remote Sensing Images
1 Introduction
2 Methods
2.1 Anchor-Free Structure
2.2 Radius and Adaptive Network Heads
2.3 Circle-IOU
3 Experiment and Analysis
3.1 Data
3.2 Experimental Environment and Evaluation Indicators
3.3 Results
4 Conclusion
References
A Transformer-Based Adaptive Semantic Aggregation Method for UAV Visual Geo-Localization
1 Introduction
2 Related Work
3 Method
3.1 Transformer-Based Backbone
3.2 Adaptive Semantic Aggregation Module
3.3 Classification Module
4 Experiments
4.1 Dataset
4.2 Evaluation Protocol
4.3 Implementation Details
4.4 Experimental Results
5 Conclusion
References
Lightweight Multiview Mask Contrastive Network for Small-Sample Hyperspectral Image Classification
1 Introduction
2 Proposed Method
2.1 SVE Module
2.2 PVE Module
2.3 Lightweight Multiview Mask Contrastive Network
3 Experimental Results and Analysis
3.1 Datasets
3.2 Comparative Experiment
3.3 Ablation Study
4 Conclusion
References
Dim Moving Target Detection Based on Imaging Uncertainty Analysis and Hybrid Entropy
1 Introduction
2 Theoretical and Method
2.1 Uncertainty Model
2.2 Method Procedure
3 Experiment and Analysis
3.1 Evaluation Metrics
3.2 Datasets
3.3 Comparison of Detection Performance
4 Conclusion
References
Author Index