Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part VI (Lecture Notes in Computer Science, 14452) 9819980755, 9789819980758

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

151 5 49MB

English Pages 523 [521] Year 2023

Table of contents :
Preface
Organization
Contents – Part VI
Applications
MIC: An Effective Defense Against Word-Level Textual Backdoor Attacks
1 Introduction
2 Related Work
2.1 Backdoor Attack
2.2 Backdoor Defense
3 Methodology
3.1 Triple Metric Learning
3.2 Contrastive Learning Based on Semantic Negative Examples
4 Experiments
4.1 Experimental Settings
4.2 Main Results
4.3 Ablation Experiment
4.4 Hyper-parameter Study
5 Conclusion
References
Active Learning for Open-Set Annotation Using Contrastive Query Strategy
1 Introduction
2 Related Work
3 Methodology
3.1 Problem Definition
3.2 Algorithm Detail
4 Experiments
4.1 Baselines
4.2 Experiment Settings
4.3 Performance Evaluation
4.4 Information Gain Analysis
5 Conclusion
References
Cross-Domain Bearing Fault Diagnosis Method Using Hierarchical Pseudo Labels
1 Introduction
2 Related Work
3 Proposed Method
3.1 Problem Formulation
3.2 Network Architecture
3.3 Domain Level Confusion Training
3.4 Class Level Confusion Training
4 Experiments
4.1 Datasets Description
4.2 Comparison Methods
4.3 Implementation Details
4.4 Results and Analysis
5 Conclusions
References
Differentiable Topics Guided New Paper Recommendation
1 Introduction
2 Related Work
3 New Paper Recommendation Method
3.1 Problem Definition
3.2 Overall Framework
3.3 NTM-Based Subspace Representation
3.4 GCN-Based Asymmetric Topic Propagation
3.5 User Interest Prediction
4 Experiments
4.1 Experimental Settings
4.2 Results
5 Conclusion
References
IIHT: Medical Report Generation with Image-to-Indicator Hierarchical Transformer
1 Introduction
2 Related Work
3 Method
3.1 Classifier Module
3.2 Indicator Expansion Module
3.3 Generator Module
4 Experimental Setup
4.1 Data
4.2 Implementation
4.3 Metrics
5 Experimental Results
5.1 Report Generation
5.2 Ablation Study
6 Conclusion
References
OD-Enhanced Dynamic Spatial-Temporal Graph Convolutional Network for Metro Passenger Flow Prediction
1 Introduction
2 Related Work
3 Problem Formulation
4 Methodology
4.1 Overview
4.2 Static Spatial Correlation Module
4.3 Dynamic Spatial Correlations Module
4.4 Multi-resolution Temporal Dependency Module
4.5 Multi-component Fusion
5 Experiments
5.1 Datasets
5.2 Experimental Setups
5.3 Experimental Results
6 Conclusion
References
Enhancing Heterogeneous Graph Contrastive Learning with Strongly Correlated Subgraphs
1 Introduction
2 Related Work
2.1 Graph Representation Learning
2.2 Graph Contrastive Learning
3 Preliminaries
4 Method
4.1 Overall Framework
4.2 Strongly Correlated Subgraph Selection and Encoding
4.3 Feature Shifting and Loss Calculation
5 Experiments
5.1 Experiment Setup
5.2 Results and Analysis (RQ1)
5.3 Ablation Study (RQ2)
6 Conclusions
References
DRPDDet: Dynamic Rotated Proposals Decoder for Oriented Object Detection
1 Introduction
2 Proposed Method
2.1 The Overall Network Framework of DRPDDet
2.2 Harmony RPN
2.3 Dynamic Rotated Proposals Decoder (DRPD)
3 Experiments
3.1 Datasets
3.2 Experimental Details
3.3 Ablation Study
3.4 Contrast Test
4 Conclusion
References
MFSFFuse: Multi-receptive Field Feature Extraction for Infrared and Visible Image Fusion Using Self-supervised Learning
1 Introduction
2 Proposed Method
2.1 Overall Framework
2.2 Feature Extraction and Fusion Module
2.3 Decomposition Module
2.4 Loss Function
3 Experiments
3.1 Datasets and Training Details
3.2 Fusion Metrics
3.3 Comparative Experiment
3.4 Ablation Study
3.5 Execution Time
4 Conclusion
References
Progressive Temporal Transformer for Bird's-Eye-View Camera Pose Estimation
1 Introduction
2 Related Work
2.1 Dataset for Camera Relocalization
2.2 Camera Relocalization
3 Method
3.1 Network Architecture
3.2 Loss Function
3.3 Implementation Details
4 Aerial Visual Localization Dataset
4.1 Sensor Setup
4.2 BEV Dataset
5 Experiments
6 Conclusion
References
Adaptive Focal Inverse Distance Transform Maps for Cell Recognition
1 Introduce
2 Related Work
3 Methods
3.1 Overall Framework
3.2 AFIDT Module
3.3 Match Module
3.4 NMS Module
3.5 Loss Function
4 Experiments
4.1 Dataset
4.2 Implementation Details
4.3 Experimental Results
4.4 Visualization Analysis
5 Conclusion
References
Stereo Visual Mesh for Generating Sparse Semantic Maps at High Frame Rates
1 Introduction
2 Related Work
3 The Visual Mesh
4 The Stereo Visual Mesh
4.1 Linking Two Visual Meshes
4.2 Binary Space Partition
4.3 Visual Mesh Graph Traversal
5 CNN Architecture
6 Dataset
7 Linked Mono Network
8 Training
9 Results
10 Limitations
11 Conclusion
References
Micro-expression Recognition Based on PCB-PCANet+
1 Introduction
2 Proposed Method
2.1 PCANet+ Feature Extraction
2.2 Multi Branch LSTMs Based on Facial ROIs
2.3 Multi-branch Fusion and Classification
3 Experimental Results and Analysis
3.1 Dataset
3.2 Performance Metric
3.3 Different Methods of Feature Tensor Segmentation
3.4 The Influence of Feature Weight
3.5 Model Parameter Optimization
3.6 Comparison with Other Methods
4 Conclusion
References
Exploring Adaptive Regression Loss and Feature Focusing in Industrial Scenarios
1 Introduction
2 Related Work
2.1 Universal Object Detection
2.2 Regression Loss Function for Object Detection
3 Method
3.1 Overview
3.2 Encoder
3.3 Decoder
3.4 Loss Function
4 Experiments and Analysis
4.1 Parameter Settings and Experimental Environment
4.2 Experimental Datasets
4.3 Comparative Experiment
4.4 Ablation Experiment
5 Conclusions
References
Optimal Task Grouping Approach in Multitask Learning
1 Introduction
2 Related Work
3 Data Representation
4 Problem Formulation
5 Proposed Methodology
5.1 Data Preparation
5.2 Multitask Network
5.3 Training Multi-task Network
5.4 Individual Task Transference Quantification into Group
6 Experimental Evaluation and Results
6.1 RQ1: Vehicle Task Grouping Results
6.2 RQ2: Vehicle Behavior Transference Quantification Results
7 Discussion and Conclusion
References
Effective Guidance in Zero-Shot Multilingual Translation via Multiple Language Prototypes
1 Introduction
2 Background and Motivation
3 Method
3.1 Baseline Strategies
3.2 Language Representative Prototypes
3.3 Integration of Multiple TLP
4 Experiments and Analysis
4.1 Experimental Settings
4.2 Overall Performance
4.3 Ablation Study
5 Conclusion
References
Extending DenseHMM with Continuous Emission
1 Introduction
2 Gaussian Dense Hidden Markov Model
3 Basic Learning Algorithm
4 Region-Based Co-occurrence Learning Algorithm
4.1 Properties Study
4.2 Recommendation Saturation Simulation
5 Conclusions
References
An Efficient Enhanced-YOLOv5 Algorithm for Multi-scale Ship Detection
1 Introduction
2 Method
2.1 Overall Framework
2.2 MetaAconC-Inspired Dynamic Spatial-Channel Attention Module
2.3 Gradient-Refined Bounding Box Regeression Module
2.4 Taylor Expansion-Based Classification Module
3 Experiments
3.1 Experimental Settings
3.2 Quantitative Analysis
3.3 Ablation Studies
3.4 Qualitative Analysis
4 Conclusion
References
Double-Layer Blockchain-Based Decentralized Integrity Verification for Multi-chain Cross-Chain Data
1 Introduction
2 Preliminaries
2.1 Bilinear Pairing
2.2 BLS Signature
2.3 Verifiable Random Function
3 Problem Statement
3.1 System Model
3.2 Threat Models
3.3 Design Goals
4 Double-Layer Blockchain-Based Decentralized Integrity Verification Scheme
4.1 Overview
4.2 Integrity Consensus
4.3 Block Consensus
5 Security Analysis
6 Performance Evaluation
6.1 Experimental Settings
6.2 Experimental Results and Analysis
7 Conclusion
References
Inter-modal Fusion Network with Graph Structure Preserving for Fake News Detection
1 Introduction
2 Related Works
2.1 Fake News Detection
2.2 Graph Neural Networks
2.3 Contrastive Learning
3 Our Approach
3.1 Inter-modal Cross-Layer Fusion (ICF) Module
3.2 Graph Structure Preserving (GSP) Module
3.3 Multi-modal Fusion (MMF) Module
4 Experiment
4.1 Datasets
4.2 Baseline
4.3 Implementation Details
4.4 Results
4.5 Ablation Experiments
5 Conclusion
References
Learning to Match Features with Geometry-Aware Pooling
1 Introduction
2 Related Works
3 The Proposed Method
3.1 Feature Encoder
3.2 Geometry-Aware Attention Module
3.3 Matching Determination
3.4 Loss Formulation
4 Experiments
4.1 Homography Estimation
4.2 Relative Pose Estimation
4.3 Visual Localization
4.4 Ablation Study
5 Conclusion
References
PnP: Integrated Prediction and Planning for Interactive Lane Change in Dense Traffic
1 Introduction
2 Background
2.1 Markov Decision Process (MDP)
2.2 TD Search
2.3 Trajectory Prediction
3 Method
3.1 Reactivate Trajectory Prediction
3.2 Integrated Prediction and Planning
4 Experiments
4.1 Experiment Setup
4.2 Reactive Trajectory Prediction Model Evaluation
4.3 Lane Change Decision Performance
5 Conclusion
References
Towards Analyzing the Efficacy of Multi-task Learning in Hate Speech Detection
1 Introduction
2 Related Work
2.1 Works on Single Dataset
2.2 Works on Multiple Datasets
3 Dataset Description
4 Methodology
5 Experimental Results and Analysis
5.1 RQ1: Single Task vs Merging All vs Multi-task
5.2 RQ2: Fully Shared vs. Shared Private (+/- Adversarial Training)
5.3 RQ3: Datasets Combination
6 Conclusion and Future Work
References
Exploring Non-isometric Alignment Inference for Representation Learning of Irregular Sequences
1 Introduction
2 Related Work
3 Non-isometric Alignment Inference
3.1 Preliminaries
3.2 Theoretical Analysis
3.3 Non-isometric Alignment Inference Architecture Network
4 Experiments
4.1 Forecasting Performance
4.2 Anomaly Detection Performance
4.3 Ablation Experiments
5 Conclusion
References
Retrieval-Augmented GPT-3.5-Based Text-to-SQL Framework with Sample-Aware Prompting and Dynamic Revision Chain
1 Introduction
2 Related Work
2.1 Encoder-Decoder SQL Generation
2.2 LLM-Based SQL Generation
3 Methodology
3.1 Retrieval Repository
3.2 Dynamic Revision Chain
4 Experiments
4.1 Experimental Setup
4.2 Main Results
4.3 Various Difficulty Levels Analysis
4.4 Ablation Study
4.5 Iterative Round Analysis
4.6 Case Study
5 Conclusion
References
Improving GNSS-R Sea Surface Wind Speed Retrieval from FY-3E Satellite Using Multi-task Learning and Physical Information
1 Introduction
2 Related Work
3 Methodology
3.1 FY-3E and HWRF Data
3.2 Model Structure
3.3 Metrics
4 Experiment and Result
4.1 Data Preprocessing
4.2 Experiment
4.3 Ablation Experiment
4.4 Result
5 Conclusion
References
Incorporating Syntactic Cognitive in Multi-granularity Data Augmentation for Chinese Grammatical Error Correction
1 Introduction
2 Related Work
2.1 Chinese Grammatical Error Correction
2.2 Chinese Sentence Pattern Structure Treebank
3 Proposed Method
3.1 Materials
3.2 Syntactic Error Sentence Generation
3.3 Multi-granularity Data Augmentation
3.4 Transformer for CGEC
4 Experiments and Results
4.1 Datasets
4.2 Parameters and Metrics
4.3 Baselines
4.4 Main Results
4.5 Effect of Data Augmentation at Different Scales
4.6 Effect of Data Augmentation at Different Granularities
4.7 Effect of Chinese Sentence Pattern Structure Treebank
5 Conclusion
References
Long Short-Term Planning for Conversational Recommendation Systems
1 Introduction
2 Related Work
2.1 Heavy Recommendation, Light Conversation
2.2 Light Recommendation, Heavy Conversation
3 Preliminaries
4 Our Proposed Model
4.1 Knowledge Representation and Grounding
4.2 Long-Term Planning
4.3 Short-Term Planning
4.4 Plan-Based Response Generation
5 Data Collection
6 Experiments
6.1 Baselines and Metrics
6.2 Main Results
6.3 Additional Analysis
7 Conclusion
References
Gated Bi-View Graph Structure Learning
1 Introduction
2 Problem Definition
3 The Proposed Model
3.1 The Selection of Basic Views
3.2 View Estimator
3.3 View Interaction
3.4 View Attention Fusion
3.5 Optimization Objective
4 Experiments
4.1 Experimental Setup
4.2 Node Classification
4.3 Defense Performance
4.4 Ablation Studies
5 Conclusion
References
How Legal Knowledge Graph Can Help Predict Charges for Legal Text
1 Introduction
2 Related Work
2.1 Charge Prediction
2.2 Legal Domain Knowledge Graph
3 Methods
3.1 Overview
3.2 Criminal Behavior Knowledge Graph (CBKG)
3.3 Extract Module
3.4 Encoder Module
3.5 Subgraph Encoder Module
3.6 Feature Fusion
4 Experimental Setup
4.1 Dataset
4.2 Implementation Details
4.3 Metric
4.4 Baseline Methods
5 Experimental Results
5.1 Analyze
5.2 Ablation Study
6 Conclusion
References
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition
1 Introduction
2 Methodology
2.1 Position Self-enhanced Encoder
2.2 Visual Recognition Branch
2.3 Iterative Semantic Recognition Branch
2.4 Training Objective Function
3 Experiments
3.1 Datasets
3.2 Implementation Details
3.3 Comparisons with State-of-the-Arts
3.4 Ablation Study
3.5 Qualitative Analysis
4 Conclusion
References
Introducing Semantic-Based Receptive Field into Semantic Segmentation via Graph Neural Networks
1 Introduction
2 Related Works
2.1 Semantic Segmentation
2.2 Graph Neural Networks (GNNs)
3 Method
3.1 Graph Convolution Receptor
3.2 Over-Smoothing Alleviation
3.3 Baseline Encoder
3.4 Decoder
4 Experiments
4.1 Datasets
4.2 Experimental Settings
4.3 Comparison with Baseline Methods
4.4 Ablation Study
5 Conclusion
References
Transductive Cross-Lingual Scene-Text Visual Question Answering
1 Introduction
2 Our Framework
2.1 The Overview of Our Framework
2.2 Inputs of Cross-Lingual Text-VQA
2.3 Question Reading
2.4 Transductive Answering
2.5 Training Loss
3 Experiments
3.1 EST-VQA Dataset and Baselines
3.2 Implementation Details
3.3 Overall Performance
3.4 Multi-lingual and Multi-modal Embeddings
4 Related Work
4.1 Multilingual Language Models
4.2 Text-VQA Datasets
4.3 Text-VQA Methods
5 Conclusions and Future Work
References
Learning Representations for Sparse Crowd Answers
1 Introduction
2 Related Work
2.1 Quality Control in Crowdsourcing
2.2 Self-supervised Representation Learning
3 Problem Setting
4 Learning Representations for Crowd Answers
4.1 Self-supervised Tasks
4.2 Neural Architecture of CrowdLR
4.3 Loss Function
4.4 Answer Aggregation
5 Experiments
5.1 Experimental Settings
5.2 Experimental Results
6 Conclusion
References
Identify Vulnerability Types: A Cross-Project Multiclass Vulnerability Classification System Based on Deep Domain Adaptation
1 Introduction
2 Previous Work
3 Framework
3.1 Code Snippet Generation
3.2 Snippet Attention Generation
3.3 Vectorization
3.4 Deep Feature Representation
3.5 Domain Adaptation and Classification
4 Experiment Implementation
4.1 Dataset and Experimental Setup
4.2 Evaluation Metric and Baseline
4.3 Experimental Results
5 Conclusion and Future Work
References
Author Index

Recommend Papers

Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part V (Lecture Notes in Computer Science) 9819980720, 9789819980727

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

125 101 68MB Read more

Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part II (Lecture Notes in Computer Science, 14448) [1 ed.] 981998081X, 9789819980819

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

113 68 59MB Read more

Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part IV (Lecture Notes in Computer Science, 14450) 9819980690, 9789819980697

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

125 49 85MB Read more

Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part III (Lecture Notes in Computer Science) 9819980666, 9789819980666

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

109 13 70MB Read more