Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part V (Lecture Notes in Computer Science) 9819980720, 9789819980727

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

158 102 68MB

English Pages 628 [621] Year 2023

Table of contents :
Preface
Organization
Contents – Part V
Applications
Text to Image Generation with Conformer-GAN
1 Introduction
2 Related Work
2.1 GAN-Based Text-to-Image Synthesis
2.2 Transformer-Based Text-to-Image Synthesis
3 Method
3.1 Model Overview
3.2 Conformer Block
3.3 Loss Functions
4 Experiments
4.1 Quantitative Results
4.2 Qualitative Results
4.3 Ablation Studies
5 Conclusions
References
MGFNet: A Multi-granularity Feature Fusion and Mining Network for Visible-Infrared Person Re-identification
1 Introduction
2 Related Work
3 Methodology
3.1 Backbone Network
3.2 Local Residual Spatial Attention
3.3 Multi-granularity Feature Fusion and Mining Module
3.4 Loss Function
4 Experiments
4.1 Datasets and Settings
4.2 Comparison with the State-of-the-Art Methods
5 Conclusion
References
Isomorphic Dual-Branch Network for Non-homogeneous Image Dehazing and Super-Resolution
1 Introduction
2 Related Works
2.1 Dual-Branch Network for Dehazing
2.2 Super-Resolution in Image Dehazing
3 Proposed Method
3.1 Network Architecture
3.2 Loss Function
4 Experiment
4.1 Experimental Settings
4.2 Results and Discussion
4.3 Ablation Study
5 Conclusion
References
Hi-Stega: A Hierarchical Linguistic Steganography Framework Combining Retrieval and Generation
1 Introduction
2 Related Works
3 The Proposed Framework
4 Methodology of Our Implementation
4.1 Steganographic Text Generation
4.2 Prompt Learning Module (PL Module)
4.3 Keyword Guide Module (KG Module)
5 Experiments and Analysis
5.1 Dataset
5.2 Experimental Setting
5.3 Imperceptibility Analysis
5.4 Anti-steganalysis Ability
6 Conclusion
References
Effi-Seg: Rethinking EfficientNet Architecture for Real-Time Semantic Segmentation
1 Introduction
2 Related Work
3 Proposed Method
3.1 Encoder Design
3.2 Decoder Design
4 Experiment
4.1 Datasets, Implementation Details, and Performance Metrics
4.2 Ablation Study
4.3 Model Evaluation
5 Conclusion
References
Quantum Autoencoder Frameworks for Network Anomaly Detection
1 Introduction
2 Related Work
3 Autoencoders and Quantum Autoencoders
3.1 Autoencoders
3.2 Quantum Autoencoders
4 Our Novel Quantum Autoencoder Frameworks
4.1 Experimental Setup
4.2 Implementation
5 Experimental Results and Discussion
6 Conclusion
References
Spatially-Aware Human-Object Interaction Detection with Cross-Modal Enhancement
1 Introduction
2 Related Work
2.1 Two-Stage Methods
2.2 One-Stage and DETR-Based Methods
3 Method
3.1 Overview
3.2 Prompt-Enhanced Spatial Modeling
3.3 Spatially-Aware Interaction Recognition
3.4 Training and Inference
4 Experiments
4.1 Experiment Setting
4.2 Comparison to State-of-the-Art
4.3 Ablation Study
4.4 Generalizability of PESM Modules
4.5 Qualitative Analysis
5 Conclusion
References
Intelligent Trajectory Tracking Control of Unmanned Parafoil System Based on SAC Optimized LADRC
1 Introduction
2 A Brief Introduction to a 8-DOF Parafoil Model
3 Controller Design of Trajectory Tracking Control for Parafoil System
3.1 Trajectory Tracking Guidance Law
3.2 LADRC Controller
4 SAC Optimized LADRC
5 Simulation Results
6 Conclusion
References
CATS: Connection-Aware and Interaction-Based Text Steganalysis in Social Networks
1 Introduction
2 Proposed Method
2.1 Task Modeling
2.2 Internal Feature Augmentation: Extracting Linguistic Features
2.3 External Feature Augmentation: Extracting Social Networks Graph Features
2.4 Internal and External Hybrid Augmentation: Feature Interaction and Comprehensive Discriminator
3 Experiments and Analysis
3.1 Experiment Settings
3.2 Evaluation Results and Discussion
4 Conclusion
References
Syntax Tree Constrained Graph Network for Visual Question Answering
1 Introduction
2 Related Work
3 The STCGN Model
3.1 Network Architecture
3.2 Syntax-Aware Tree Convolution
3.3 Phrase-Aware Entity Message Passing
3.4 Top-Down Attention-Based Answer Prediction
3.5 Loss Function
4 Experiments
4.1 Experimental Settings
4.2 Performance Comparison
4.3 Ablation Study
4.4 Parameter Sensitivity
4.5 Attention Visualization
5 Conclusion
A Experimental Settings
A.1 Datasets
A.2 Baselines
A.3 Settings
References
CKR-Calibrator: Convolution Kernel Robustness Evaluation and Calibration
1 Introduction
2 Related Work
2.1 Robustness of Deep Learning Based Model
2.2 Deep Feature Attribution
3 CKR-Calibrator
3.1 Preliminary Analysis
3.2 Convolution Kernel Robustness Evaluation
3.3 Convolution Kernel Robustness Calibration
4 Experiments
4.1 The Effect of CKR-Calibrator for SOTA CNN Classifiers
4.2 The Effect of CKR-Calibrator Combined with SOTA Method
4.3 The Ablation Study
5 Conclusion
References
SGLP-Net: Sparse Graph Label Propagation Network for Weakly-Supervised Temporal Action Localization
1 Introduction
2 Related Work
3 Method
3.1 Problem Definition
3.2 Feature Extractor
3.3 Multi-scale Temporal Feature Embedding
3.4 Action Label Propagation
4 Experiments
4.1 Experimental Setup
4.2 Comparison with the State of the Art
4.3 Ablation Studies on THUMOS'14
4.4 Qualitative Results
5 Conclusion
References
VFIQ: A Novel Model of ViT-FSIMc Hybrid Siamese Network for Image Quality Assessment
1 Introduction
2 Related Work
2.1 FR-IQA Metrics
2.2 Vision Transformer (ViT)
2.3 FSIMc
3 Model and Methodology
3.1 Feature Extraction Module
3.2 Evaluation Module
3.3 Hybrid Module
3.4 Mathematical Formulation
4 Experiments
4.1 Datasets
4.2 Implementation Details
4.3 Results
4.4 Reference Image Recovery
5 Ablation of the ViT-FSIMc Architecture
6 Conclusion
References
Spiking Reinforcement Learning for Weakly-Supervised Anomaly Detection
1 Introduction
2 Related Work
2.1 Weakly-Supervised Tabular Anomaly Detection
2.2 Reinforcement Learning
2.3 Spiking Neural Network
3 The Proposed Approach
3.1 Problem Statement
3.2 Algorithm Framework
3.3 SRL-Based Agent
3.4 Anomaly-Biased Environment
4 Experiments
4.1 Overall Performance
4.2 Anomaly Pollution Test
4.3 Ablation Study
5 Conclusions
References
Resource-Aware DNN Partitioning for Privacy-Sensitive Edge-Cloud Systems
1 Introduction
2 Related Works
3 Design of awareSL
3.1 Resource-Aware Model Partitioning
3.2 Privacy Preservation via Distance Correlation
4 Evaluation
4.1 Experimental Setup
4.2 Resource-Aware Computation Partitioning
4.3 Balancing Model Accuracy and Privacy Preservation
4.4 Inference Data Privacy Protection
5 Conclusion
References
A Frequency Reconfigurable Multi-mode Printed Antenna
1 Introduction
2 Design and Analysis of Antenna
3 Results and Discussion
4 Conclusion
References
Multi-view Contrastive Learning for Knowledge-Aware Recommendation
1 Introduction
2 Related Work
2.1 Knowledge-Aware Recommendation
2.2 Contrastive Learning for Recommender Systems
3 Preliminaries
4 Methodology
4.1 Multi Views Generation
4.2 Multi-view Contrastive Learning
4.3 Model Predict
4.4 Multi-task Training
5 Experiment
5.1 Datasets and Experiment Settings
5.2 Performance Comparison (RQ1)
5.3 Ablation Studies and Sensitivity Analysis (RQ2)
5.4 Benefits of MCKR in Alleviating Data Noise Effect (RQ3)
6 Conclusion
References
PYGC: A PinYin Language Model Guided Correction Model for Chinese Spell Checking
1 Introduction
2 Related Work
3 Methods
3.1 Problem Formulation
3.2 Architecture
3.3 Multitask Learning
3.4 Confusion Set Based Pre-training Strategy
3.5 Predict Inference
4 Experiments
4.1 Experimental Setup
4.2 Main Results
4.3 Isolation Test
4.4 Ablation Study
4.5 Study of the Pinyin Language Model
4.6 Case Study
5 Conclusion
References
Empirical Analysis of Multi-label Classification on GitterCom Using BERT and ML Classifiers
1 Introduction
2 Related Work
3 Dataset Description
4 Research Methodology
4.1 Feature Selection
4.2 Data Balancing
4.3 Machine Learning Classifiers
5 BERT Architecture
6 Comparative Analysis
6.1 Feature Selection
6.2 Effect of Oversampling
6.3 Performance of Classifiers
6.4 BERT Layer Performance Comparison (RQ4)
7 Conclusion
References
A Lightweight Safety Helmet Detection Network Based on Bidirectional Connection Module and Polarized Self-attention
1 Introduction
2 Related Work
3 Approach
3.1 Lightweight Feature Extraction Networks
3.2 Polarized Self-attention
3.3 Structure of BCM-PAN
4 Experiment
4.1 Experimental Details
4.2 DataSets
4.3 Results and Discussions
5 Conclusion
References
Direct Inter-Intra View Association for Light Field Super-Resolution
1 Introduction
2 Related Work
2.1 Traditional Methods for LFSR
2.2 Learning-Based Methods for LFSR
3 Architecture of 4DTNet
3.1 Overall Architecture
3.2 Structure of LF Transformer
4 Experiments
4.1 Implementation Details
4.2 Comparison with State-of-The-Art Methods
4.3 Ablation Study
5 Conclusion
References
Responsive CPG-Based Locomotion Control for Quadruped Robots
1 Introduction
2 Control Framework
2.1 CPG Model Based on Gradient System
2.2 CPG Signal Mapping
2.3 Vestibular Sensory Feedback
2.4 Differential Evolution Algorithm
3 Simulations, Comparisons and Analysis
3.1 Simulation of the Quadruped Robot with RG-CPG
3.2 Slope Experiments
4 Prototype Experiments
4.1 Technical Specifications
4.2 Experiments
5 Conclusion
References
Vessel Behavior Anomaly Detection Using Graph Attention Network
1 Introduction
2 Related Work
2.1 Deep Learning-Based Anomaly Detection
2.2 Vessel Behavior Anomaly Detection
3 Proposed Method
3.1 Problem Definition
3.2 Graph Attention Module
3.3 Joint Detection Module
4 Experiments
4.1 Experimental Setup
4.2 Ablation Study
4.3 Comparative Experiments
5 Conclusion and Future Work
References
TASFormer: Task-Aware Image Segmentation Transformer
1 Introduction
2 Related Works
3 TASFormer Architectures
4 Binary Intersection over Union
4.1 Motivation
4.2 Comparison with Similar Existing Metrics
5 Experimental Results
5.1 Datasets
5.2 Training Setup
5.3 Output Adaptation
5.4 Results on ADE20K
6 Conclusion
References
Unsupervised Joint-Semantics Autoencoder Hashing for Multimedia Retrieval
1 Introduction
2 Related Work
2.1 Supervised Hashing
2.2 Unsupervised Hashing
3 The Proposed Method
3.1 Notations
3.2 Architecture
3.3 Objective Function
3.4 Extensions
3.5 Computational Complexity Analysis
4 Experiments
4.1 Datasets
4.2 Baselines and Evaluation Metric
4.3 Implementation Detail
4.4 Retrieval Accuracy Comparison
4.5 Parameter Sensitivity Analysis
4.6 Ablation Experiments
5 Conclusion
References
TKGR-RHETNE: A New Temporal Knowledge Graph Reasoning Model via Jointly Modeling Relevant Historical Event and Temporal Neighborhood Event Context
1 Introduction
2 Related Work
2.1 Neural Hawkes Process-Based Method
2.2 Graph Neural Network-Based Method
3 The Proposed Model
3.1 Encoder for Relevant Historical Event Context
3.2 Aggregator for Temporal Neighborhood Event Context
3.3 Conditional Intensity
3.4 Model Training and Prediction
4 Experiments and Analysis
4.1 Experimental Settings and Implementation Details
4.2 Experimental Results and Discussion
4.3 Ablation Study
5 Conclusion
References
High-Resolution Self-attention with Fair Loss for Point Cloud Segmentation
1 Introduction
2 Related Work
3 Method
3.1 Segmentation Pipeline
3.2 High Resolution Self-attention Module
3.3 Fair Loss Function
4 Experiments and Evaluation
4.1 Semantic Segmentation in S3DIS
4.2 Ablation Study
5 Conclusion and Future Work
References
Transformer-Based Video Deinterlacing Method
1 Introduction
2 Related Work
3 Method
3.1 Overview
3.2 Feature Extractor (FE)
3.3 De-Transformer (DeT)
3.4 Residual Densnet(RD)
4 Experiments and Results
4.1 Dataset and Training
4.2 Loss Function
4.3 Results and Comparison
4.4 Ablation Experiments
4.5 More Comparision
5 Conclusion
References
SCME: A Self-contrastive Method for Data-Free and Query-Limited Model Extraction Attack
1 Introduction
2 Related Work
3 Preliminary
3.1 Adversarial Attack
3.2 Contrastive Learning
4 Methodology
4.1 Overview
4.2 Intra- and Inter-class Diverse
4.3 Model-Independent Boundary Example
4.4 Objective Function
5 Experiments
5.1 Setup
5.2 Attack Performance
5.3 Evaluation on Data Diversity
5.4 Evaluation on Boundary Value
5.5 Ablation Study
6 Conclusion
References
CSEC: A Chinese Semantic Error Correction Dataset for Written Correction
1 Introduction
2 Related Work
3 Creation of Chinese Semantic Error Dataset
4 Methodology
5 Experiments
5.1 Implementation Details
5.2 Experimental Results
5.3 Ablation Study
6 Conclusion
References
Contrastive Kernel Subspace Clustering
1 Introduction
2 Related Work
3 Proposed Methodology
4 Experiments
5 Conclusions
References
UATR: An Uncertainty Aware Two-Stage Refinement Model for Targeted Sentiment Analysis
1 Introduction
2 Related Work
2.1 Targeted Sentiment Analysis
2.2 Model Uncertainty
3 Approach
3.1 Uncertainty Aware Loss
3.2 Uncertainty Aware Two-Stage Refinement Model
4 Experiment
4.1 Dataset
4.2 Baseline Model
4.3 Settings
4.4 Main Results
5 Analysis
5.1 The Distribution of Misclassified Samples
5.2 Ablation Study
6 Conclusion
References
AttIN: Paying More Attention to Neighborhood Information for Entity Typing in Knowledge Graphs
1 Introduction
2 Related Work
2.1 Embedding-Based Methods
2.2 GCN-Based Methods
2.3 Transformer-Based Methods
3 Our Method
3.1 Notations
3.2 Model Structure
3.3 Pooling Method
3.4 Optimization Approach
4 Experiments
4.1 Datasets
4.2 Experiment Setup
4.3 Performance Comparison
4.4 Ablation Study
5 Conclusion and Future Work
References
Text-Based Person re-ID by Saliency Mask and Dynamic Label Smoothing
1 Introduction
2 Approach
2.1 Overview
2.2 Baseline Model
2.3 Saliency Mask Modelling
2.4 Dynamic Label Smoothing
3 Experiments
3.1 Datasets
3.2 Evaluation Metrics and Implementation Details
3.3 Comparison with State-of-the-art Methods
3.4 Ablation Study
3.5 Hyperparameter Exploration Experiments
3.6 Qualitative Results
4 Conclusion
References
Robust Multi-view Spectral Clustering with Auto-encoder for Preserving Information
1 Introduction
2 Related Work
2.1 Notations
2.2 Multi-view Spectral Clustering
3 Methodology
3.1 Graph Construction
3.2 Multi-view Graph Fusion
3.3 Preserving Information by Auto-encoder
3.4 Overall Learning Framework
4 Optimization
4.1 Subproblem of G
4.2 Subproblem of Ev
4.3 Subproblem of Mv
4.4 Subproblem of F
4.5 Complexity Analysis
5 Experiments
5.1 Comparison Algorithms and Evaluation Metrics
5.2 Experimental Setup
5.3 Experimental Results and Analysis
5.4 Ablation Study
5.5 Convergence Analysis
6 Conclusion and Future Work
References
Learnable Color Image Zero-Watermarking Based on Feature Comparison
1 Introduction
2 Related Work
3 Methods
3.1 Generation of Zero-Watermarking
3.2 Extraction of Zero-Watermarking
3.3 Feature Extractor
3.4 Combined Noisy Layer
4 Experiments
4.1 Implementation Details
4.2 Metrics
4.3 Results
4.4 Ablation Study
5 Conclusion
References
P-IoU: Accurate Motion Prediction Based Data Association for Multi-object Tracking
1 Introduction
2 Related Work
2.1 Multi-object Tracking
2.2 Motion Models
2.3 Tracking Clues
3 Method
3.1 P-IoU
3.2 Motion Prediction
3.3 PoseTracker
4 Experiments
4.1 Experimental Settings
4.2 Evaluation Results
4.3 Ablation Studies
5 Conclusion
References
WCA-VFnet: A Dedicated Complex Forest Smoke Fire Detector
1 Introduction
2 Related Work
3 Method
3.1 Dataset and Annotation
3.2 WCA-VFnet
3.3 Improved Offset and Summation
3.4 The Computational Cost of the Weld C-A Module
3.5 Weld C-A and VFnet
4 Experiment
4.1 Comparison with Advanced Models
4.2 Ablation Study on WCA-VFnet
5 Conclusion
References
Label Selection Algorithm Based on Ant Colony Optimization and Reinforcement Learning for Multi-label Classification
1 Introduction
2 Basic Concepts
3 The Proposed Method
3.1 Label Selection Algorithm Based on Ant Colony Optimization and Reinforcement Learning
3.2 Label Recovery Operator Based on Binary Neural Network
4 Experiments
4.1 Benchmark Data Sets and Evaluation Metrics
4.2 Experimental Settings
4.3 Results and Discussion
5 Conclusion
References
Reversible Data Hiding Based on Adaptive Embedding with Local Complexity
1 Introduction
2 Proposed Method
2.1 Image Pre-processing
2.2 Local Complexity Calculation
2.3 Calculation of Prediction Errors
2.4 The Data Embedding Procedure
2.5 The Data Extraction and Recovery Procedure
3 Experimental Results and Analyses
3.1 The Visual Quality of Stego-Image
3.2 Comparison of the ISPs
3.3 Comparison of Embedding Capacity
3.4 Comparison of PSNR
4 Conclusions
References
Generalized Category Discovery with Clustering Assignment Consistency
1 Introduction
2 Related Work
3 Method
3.1 Preliminaries
3.2 Semi-supervised Representation Learning
3.3 Label Assignment with Louvain Algorithm
4 Experiments
4.1 Experimental Setup
4.2 Comparison with State-of-the-Arts
4.3 Ablation Study
5 Conclusion
References
CInvISP: Conditional Invertible Image Signal Processing Pipeline
1 Introduction
2 Methodology
2.1 The Framework of CInvISP
2.2 The Prediction of Conditional Information
2.3 The Conditional Invertible Block
2.4 Loss Function
3 Experimental Results on the Accuracy of Mappings
3.1 Experimental Settings
3.2 Comparison with SOTA Methods
3.3 Ablation Study
4 The Applications of CInvISP in Low-Level Vision Tasks
4.1 Image Denoising
4.2 Image Retouching
5 Conclusion
References
Ignored Details in Eyes: Exposing GAN-Generated Faces by Sclera
1 Introduction
2 Related Works
2.1 GAN-Generated Face Generation
2.2 Physical/Physiological-Based GAN-Generated Face Detection
2.3 Attention Mechanism
3 Method
3.1 Sclera Segmentation
3.2 Residual Attention Network
4 Experiments
4.1 Comparison with the State of the Arts
4.2 How Do Eye Region Features Affect the Proposed Method?
4.3 Robustness Analysis
4.4 Qualitative Result
5 Conclusion
References
A Developer Recommendation Method Based on Disentangled Graph Convolutional Network
1 Introduction
2 Related Work
2.1 Crowdsourcing Software Development
2.2 Graph Neural Network
2.3 Disentangle Representation Learning
3 Method
3.1 Problem Description
3.2 Feature Representation
3.3 Graph Disentangle Module
4 Experiment and Results
4.1 Data Preparation
4.2 Parameter Setting
4.3 Comparison Method
4.4 Evaluation Indicators
4.5 Results Comparison
5 Conclusion
References
Novel Method for Radar Echo Target Detection
1 Introduction
2 Related Work
3 Proposed Method
3.1 Clutter Suppression and Feature Extraction Net
3.2 Target Detection Net
3.3 Loss Function Design
4 Experiments
4.1 Datasets Description
4.2 Performance Metrics
4.3 Experimental Details
4.4 Ablation Experiments
4.5 Comparative Experiment
5 Conclusions
References
Correction to: High-Resolution Self-attention with Fair Loss for Point Cloud Segmentation
Correction to: Chapter 27 in: B. Luo et al. (Eds.): Neural Information Processing, LNCS 14451, https://doi.org/10.1007/978-981-99-8073-4_27
Author Index

Recommend Papers

Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part VI (Lecture Notes in Computer Science, 14452) 9819980755, 9789819980758

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

118 2 49MB Read more

Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part II (Lecture Notes in Computer Science, 14448) [1 ed.] 981998081X, 9789819980819

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

113 68 59MB Read more

Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part IV (Lecture Notes in Computer Science, 14450) 9819980690, 9789819980697

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

125 49 85MB Read more

Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part III (Lecture Notes in Computer Science) 9819980666, 9789819980666

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

109 13 70MB Read more