493 117 76MB
English Pages 869 [865] Year 2021
LNAI 12712
Kamal Karlapalem · Hong Cheng · Naren Ramakrishnan · R. K. Agrawal · P. Krishna Reddy · Jaideep Srivastava · Tanmoy Chakraborty (Eds.)
Advances in Knowledge Discovery and Data Mining 25th Pacific-Asia Conference, PAKDD 2021 Virtual Event, May 11–14, 2021 Proceedings, Part I
123
Lecture Notes in Artificial Intelligence Subseries of Lecture Notes in Computer Science
Series Editors Randy Goebel University of Alberta, Edmonton, Canada Yuzuru Tanaka Hokkaido University, Sapporo, Japan Wolfgang Wahlster DFKI and Saarland University, Saarbrücken, Germany
Founding Editor Jörg Siekmann DFKI and Saarland University, Saarbrücken, Germany
12712
More information about this subseries at http://www.springer.com/series/1244
Kamal Karlapalem Hong Cheng Naren Ramakrishnan R. K. Agrawal P. Krishna Reddy Jaideep Srivastava Tanmoy Chakraborty (Eds.) •
•
•
•
•
•
Advances in Knowledge Discovery and Data Mining 25th Pacific-Asia Conference, PAKDD 2021 Virtual Event, May 11–14, 2021 Proceedings, Part I
123
Editors Kamal Karlapalem IIIT, Hyderabad Hyderabad, India
Hong Cheng Chinese University of Hong Kong Shatin, Hong Kong
Naren Ramakrishnan Virginia Tech Arlington, VA, USA
R. K. Agrawal Jawaharlal Nehru University New Delhi, India
P. Krishna Reddy IIIT Hyderabad Hyderabad, India
Jaideep Srivastava University of Minnesota Minneapolis, MN, USA
Tanmoy Chakraborty IIIT Delhi New Delhi, India
ISSN 0302-9743 ISSN 1611-3349 (electronic) Lecture Notes in Artificial Intelligence ISBN 978-3-030-75761-8 ISBN 978-3-030-75762-5 (eBook) https://doi.org/10.1007/978-3-030-75762-5 LNCS Sublibrary: SL7 – Artificial Intelligence © Springer Nature Switzerland AG 2021 This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, expressed or implied, with respect to the material contained herein or for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. This Springer imprint is published by the registered company Springer Nature Switzerland AG The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
General Chairs’ Preface
On behalf of the Organizing Committee, it is our great pleasure to welcome you to the 25th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2021). Starting in 1997, PAKDD has long established itself as one of the leading international conferences in data mining and knowledge discovery. Held during May 11–14, 2021, PAKDD returned to India for the second time, after a gap of 11 years, moving from Hyderabad in 2010 to New Delhi in 2021. Due to the unexpected COVID-19 epidemic, the conference was held fully online, and we made all the conference sessions accessible online to participants around the world. Our gratitude goes first and foremost to the researchers, who submitted their work to the PAKDD 2021 main conference, workshops, and data mining contest. We thank them for the efforts in research, as well as in preparing high-quality online presentations videos. It is our distinct honor that five eminent keynote speakers graced the conference: Professor Anil Jain of the Michigan State University, USA, Professor Masaru Kitsuregawa of the Tokyo University, and also the National Institute of Informatics, Japan, Dr. Lada Adamic of Facebook, Prof. Fabrizio Sebastiani of ISTI-CNR, Italy, and Professor Sunita Sarawagi of IIT-Mumbai, India. Each of them is a leader of international renown in their respective areas, and we look forward to their participation. Given the importance of data science, not just to academia but also to industry, we are pleased to have two distinguished industry speakers. The conference program was further enriched with three high-quality tutorials, eight workshops on cutting-edge topics, and one data mining contest on the prediction of memory failures. We would like to express our sincere gratitude to the contributions of the Senior Program Committee (SPC) members, Program Committee (PC) members, and anonymous reviewers, led by the PC co-chairs, Kamal Karlapalem (IIIT, Hyderabad), Hong Cheng (CUHK), Naren Ramakrishnan (Virginia Tech). It is through their untiring efforts that the conference have an excellent technical program. We are also thankful to the other Organizing Committee members: industry co-chairs, Gautam Shroff (TCS) and Srikanta Bedathur (IIT Delhi); workshop co-chairs, Ganesh Ramakrishnan (IIT Mumbai) and Manish Gupta (Microsoft); tutorial co-chairs, B. Ravindran (IIT Chennai) and Naresh Manwani (IIIT Hyderabad); Publicity Co-Chairs, Sonali Agrawal (IIIT Allahabad), R. Uday Kiran (University of Aizu), and Jerry C-W Lin (WNU of Applied Sciences); competitions chair, Mengling Feng (NUS); Proceedings Chair, Tanmoy Chakraborthy (IIIT Delhi); and registration/local arrangement co-chairs, Vasudha Bhatnagar (University of Delhi), Vikram Goel (IIIT Delhi), Naveen Kumar (University of Delhi), Rajiv Ratn Shah (IIIT Delhi), Arvind Agarwal (IBM), Aditi Sharan (JNU), Mukesh Giluka (JNU) and Dhirendra Kumar (DTU). We appreciate the hosting organizations IIIT Hyderabad and the JNU, Delhi, and all our sponsors for their institutional and financial support of PAKDD 2021. We also appreciate Alibaba for sponsoring the data mining contest. We feel indebted to the
vi
General Chairs’ Preface
PAKDD Steering Committee for its continuing guidance and sponsorship of the paper and student travel awards. Finally, our sincere thanks go to all the participants and volunteers. There would be no conference without you. We hope all of you enjoy PAKDD 2021. May 2021
R. K. Agrawal P. Krishna Reddy Jaideep Srivastava
PC Chairs’ Preface
It is our great pleasure to present the 25th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2021). PAKDD is a premier international forum for exchanging original research results and practical developments in the space of KDD-related areas, including data science, machine learning, and emerging applications. We received 768 submissions from across the world. We performed an initial screening of all submissions, leading to the desk rejection of 89 submissions due to violations of double-blind and page limit guidelines. Six papers were also withdrawn by authors during the review period. For submissions entering the double-blind review process, each paper received at least three reviews from PC members. Further, an assigned SPC member also led a discussion of the paper and reviews with the PC members. The PC co-chairs then considered the recommendations and meta-reviews from SPC members in making the final decision. As a result, 157 papers were accepted, yielding an acceptance rate of 20.4%. The COVID-19 pandemic caused several challenges to the reviewing process, and we appreciate the diligence of all reviewers, PC members, and SPC members to ensure a quality PAKDD 2021 program. The conference was conducted in an online environment, with accepted papers presented via a pre-recorded video presentation with a live Q/A session. The conference program also featured five keynotes from distinguished researchers in the community, one most influential paper talk, two invited industrial talks, eight cutting-edge workshops, three comprehensive tutorials, and one dedicated data mining competition session. We wish to sincerely thank all SPC members, PC members, and external reviewers for their invaluable efforts in ensuring a timely, fair, and highly effective PAKDD 2021 program. May 2021
Hong Cheng Kamal Karlapalem Naren Ramakrishnan
Organization
Organization Committee General Co-chairs R. K. Agrawal P. Krishna Reddy Jaideep Srivastava
Jawaharlal Nehru University, India IIIT Hyderabad, India University of Minnesota, USA
Program Co-chairs Kamal Karlapalem Hong Cheng Naren Ramakrishnan
IIIT Hyderabad, India The Chinese University of Hong Kong, China Virginia Tech, USA
Industry Co-chairs Gautam Shroff Srikanta Bedathur
TCS Research, India IIT Delhi, India
Workshop Co-chairs Ganesh Ramakrishnan Manish Gupta
IIT Bombay, India Microsoft Research, India
Tutorial Co-chairs B. Ravindran Naresh Manwani
IIT Madras, India IIIT Hyderabad, India
Publicity Co-chairs Sonali Agarwal R. Uday Kiran Jerry Chau-Wei Lin
IIIT Allahabad, India The University of Aizu, Japan Western Norway University of Applied Sciences, Norway
Sponsorship Chair P. Krishna Reddy
IIIT Hyderabad, India
Competitions Chair Mengling Feng
National University of Singapore, Singapore
Proceedings Chair Tanmoy Chakraborty
IIIT Delhi, India
x
Organization
Registration/Local Arrangement Co-chairs Vasudha Bhatnagar Vikram Goyal Naveen Kumar Arvind Agarwal Rajiv Ratn Shah Aditi Sharan Mukesh Kumar Giluka Dhirendra Kumar
University of Delhi, India IIIT Delhi, India University of Delhi, India IBM Research, India IIIT Delhi, India Jawaharlal Nehru University, India Jawaharlal Nehru University, India Delhi Technological University, India
Steering Committee Longbing Cao Ming-Syan Chen David Cheung Gill Dobbie Joao Gama Zhiguo Gong Tu Bao Ho Joshua Z. Huang Masaru Kitsuregawa Rao Kotagiri Jae-Gil Lee Ee-Peng Lim Huan Liu Hiroshi Motoda Jian Pei Dinh Phung P. Krishna Reddy Kyuseok Shim Jaideep Srivastava Thanaruk Theeramunkong Vincent S. Tseng Takashi Washio Geoff Webb Kyu-Young Whang Graham Williams Min-Ling Zhang Chengqi Zhang
University of Technology Sydney, Australia National Taiwan University, Taiwan, ROC University of Hong Kong, China The University of Auckland, New Zealand University of Porto, Portugal University of Macau, Macau Japan Advanced Institute of Science and Technology, Japan Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China Tokyo University, Japan University of Melbourne, Australia Korea Advanced Institute of Science and Technology, South Korea Singapore Management University, Singapore Arizona State University, USA AFOSR/AOARD and Osaka University, Japan Simon Fraser University, Canada Monash University, Australia International Institute of Information Technology, Hyderabad (IIIT-H), India Seoul National University, South Korea University of Minnesota, USA Thammasat University, Thailand National Chiao Tung University, Taiwan, ROC Osaka University, Japan Monash University, Australia Korea Advanced Institute of Science and Technology, South Korea Australian National University, Australia Southeast University, China University of Technology Sydney, Australia
Organization
Ning Zhong Zhi-Hua Zhou
xi
Maebashi Institute of Technology, Japan Nanjing University, China
Senior Program Committee Fei Wang Albert Bifet Alexandros Ntoulas Anirban Dasgupta Arnab Bhattacharya B. Aditya Prakash Bart Goethals Benjamin C. M. Fung Bin Cui Byung Suk Lee Chandan K. Reddy Chang-Tien Lu Fuzhen Zhuang Gang Li Gao Cong Guozhu Dong Hady Lauw Hanghang Tong Hongyan Liu Hui Xiong Huzefa Rangwala Jae-Gil Lee Jaideep Srivastava Jia Wu Jian Pei Jianyong Wang Jiuyong Li Kai Ming Ting Kamalakar Karlapalem Krishna Reddy P. Lei Chen Longbing Cao Manish Marwah Masashi Sugiyama Ming Li Nikos Mamoulis Peter Christen Qinghua Hu
Cornell University, USA Universite Paris-Saclay, France University of Athens, Greece IIT Gandhinagar, India IIT Kanpur, India Georgia Institute of Technology, USA Universiteit Antwerpen, Belgium McGill University, Canada Peking University, China University of Vermont, USA Virginia Tech, USA Virginia Tech, USA Institute of Computing Technology, Chinese Academy of Sciences, China Deakin University, Australia Nanyang Technological University, Singapore Wright State University, USA Singapore Management University, Singapore University of Illinois at Urbana-Champaign, USA Tsinghua University, China Rutgers University, USA George Mason University, USA KAIST, South Korea University of Minnesota, USA Macquarie University, Australia Simon Fraser University, Canada Tsinghua University, China University of South Australia, Australia Federation University, Australia IIIT Hyderabad, India International Institute of Information Technology, Hyderabad, India Hong Kong University of Science and Technology, China University of Technology Sydney, Australia Micro Focus, USA RIKEN, The University of Tokyo, Japan Nanjing University, China University of Ioannina, Greece The Australian National University, Australia Tianjin University, China
xii
Organization
Rajeev Raman Raymond Chi-Wing Wong Sang-Wook Kim Sheng-Jun Huang Shou-De Lin Shuigeng Zhou Shuiwang Ji Takashi Washio Tru Hoang Cao Victor S. Sheng Vincent Tseng Wee Keong Ng Weiwei Liu Wu Xindong Xia Hu Xiaofang Zhou Xing Xie Xintao Wu Yanchun Zhang Ying Li Yue Xu Yu-Feng Li Zhao Zhang
University of Leicester, UK Hong Kong University of Science and Technology,