About Me

I am working as Assistant Professor in the Department of Computer Science and Application at Indian Institute of Science Education and Research, Kolkata. Before joining IISER, Kolkata, I was a Post-doctoral Researcher at GESIS – Leibniz Institute for the Social Sciences, Cologne, Germany working with Dr. Philipp Mayr in the DFG funded project ConData. Prior to that, I was a senior research fellow at Indian Statistical Institute, Kolkata. My PhD. supervisor was Dr. Mandar Mitra, Associate Professor at Computer and Communication Science Division, Indian Statistical Institute, Kolkata.

Work Experience


Assistant Professor
2020 August - till date Department of Computer Science and Application
Indian Institute of Science Education and Research, Kolkata
Post Doctoral Researcher
2019 July - 2020 July Information & Data Retrieval Team
GESIS - Leibniz Institute for the Social Sciences, Cologne, Germany

Research

Word Embedding for Information Retrieval

Bibliographic Citation Recommendation

Knowledge Graph

Retrieval Functions

Relevance Feedback

Lucene

Publications

2021

  • Thomas Krämer, Zeljko Carevic, Dwaipayan Roy , Claus-Peter Klas, Philipp Mayr. ConSTR: A Contextual Search Term Recommender. JCDL 2021 [Preprint]
  • 2020

  • Suchana Datta, Derek Greene, Debasis Ganguly, Dwaipayan Roy, Mandar Mitra. Where's the Why? In Search of Chains of Causes for Query Events. AICS 2020 [Link]
  • Zeljko Carevic, Dwaipayan Roy, Philipp Mayr. Characteristics of Dataset Retrieval Sessions: Experiences from a Real-life Digital Library. TPDL 2020 [Link]
  • Suchana Datta, Debasis Ganguly, Dwaipayan Roy, Derek Greene, Francesca Bonin, Charles Jochim. Overview of the Causality-driven Adhoc Information Retrieval (CAIR) task at FIRE-2020. FIRE 2020 [Link]
  • Suchana Datta, Debasis Ganguly, Dwaipayan Roy, Francesca Bonin, Charles Jochim, Mandar Mitra. Retrieving Potential Causes from a Query Event. SIGIR 2020 [Link] [Code]
  • Dwaipayan Roy, Sumit Bhatia, Prateek Jain. A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages. LREC 2020 [Link] [Data] [Code]

  • 2019

  • Dwaipayan Roy, Sourav Saha, Mandar Mitra, Bihan Sen, Debasis Ganguly. I-REX: A Lucene Plugin for EXplainable IR. CIKM 2019 [Link] [Code] [demo]
  • Chandan Biswas, Debasis Ganguly, Dwaipayan Roy, Ujjwal Bhattacharya. Privacy Preserving Approximate K-means Clustering. CIKM 2019 [Link]
  • Dwaipayan Roy, Sumit Bhatia, Mandar Mitra. Selecting Discriminative Terms for Relevance Model. SIGIR 2019 [Link] [Code]

  • 2018

  • Debasis Ganguly, Haithem Afli, Dwaipayan Roy. Word Embedding based Semantic Cross-Lingual Document Alignment in Comparable Corpora. FIRE 2018 [Link]
  • Dwaipayan Roy, Debasis Ganguly, Sumit Bhatia, Srikanta Bedathur, Mandar Mitra. Using Word Embeddings for Information Retrieval: How Collection and Term Normalization Choices Affect Performance. CIKM 2018 [Link] [Preprint]
  • Dwaipayan Roy, Debasis Ganguly, Mandar Mitra, Gareth J.F. Jones. Estimating Gaussian Mixture Models in the Local Neighbourhood of Embedded Word Vectors for Query Performance Prediction. Elsevier IP\&M [Link] [Preprint]
  • Dwaipayan Roy, Mandar Mitra, Debasis Ganguly. To Clean or not to Clean: Document Preprocessing and Reproducibility. ACM JDIQ [Link] [Preprint] [Code]

  • 2017

  • Dwaipayan Roy. Word Embedding based Approaches for Information Retrieval. FDIA 2017 [Link]
  • Dwaipayan Roy. An Improved Test Collection and Baselines for Bibliographic Citation Recommendation. CIKM 2017 [Link] [Preprint] [Data]

  • 2016

  • Dwaipayan Roy, Kunal Ray, Mandar Mitra. From a Scholarly Big Dataset to a Test Collection for Bibliographic Citation Recommendation. AAAI Workshop: Scholarly Big Data 2016: 705-710 [Link]
  • Dwaipayan Roy, Debasis Ganguly, Mandar Mitra, Gareth J. F. Jones. Word Vector Compositionality based Relevance Feedback using Kernel Density Estimation. CIKM 2016: 1281-1290 [Link]
  • Dwaipayan Roy, Debjyoti Paul, Mandar Mitra, Utpal Garain. Using Word Embeddings for Automatic Query Expansion. [Link]
  • Dwaipayan Roy, Debasis Ganguly, Mandar Mitra, Gareth J. F. Jones. Representing Documents and Queries as Sets of Word Embedded Vectors for Information Retrieval. [Link - soon] [Preprint]

  • - 2015

  • Debasis Ganguly, Dwaipayan Roy, Mandar Mitra, Gareth J. F. Jones. Word Embedding based Generalized Language Model for Information Retrieval. SIGIR 2015: 795-798 [Link] [Preprint]
  • Ayan Bandyopadhyay, Dwaipayan Roy, Mandar Mitra, Sanjay Saha. Named Entity Recognition from Tweets. LWA 2014 [Link]
  • Dwaipayan Roy, Ayan Bandyopadhyay, Mandar Mitra. A Simple Context Dependent Suggestion System. TREC 2013 [Link]
  • Jamuna Kanta Sing, Dwaipayan Roy, Dipak Kumar Basu, Mita Nasipuri. Generalized Diagonal 2D FLDA for Efficient Face Recognition. International Conference on Communications, Devices and Intelligent Systems (CODIS), 2012 [Link]
  • Dwaipayan Roy. Generalized Diagonal Fisher Linear Discriminant Analysis for Efficient Face Recognition. Master's Thesis, submitted to Jadavpur University, Kolkata, 2012 [Link]
  • Curriculum Vitae

    I finished my bachelor's from Ramakrishna Mission Vidyamandira, Belurmath, and master's from Jadavpur University, Kolkata respectively in the year 2009 and 2012. I worked as a Junior Research Fellow at Indian Statistical Institute (ISI), Kolkata from 2012 to 2014. After working as a Senior Research Fellow from 2014, I received PhD in Computer Science from Indian Statistical Institute, Kolkata in June, 2019. Presently, I am a Post-doctoral Researcher at GESIS – Leibniz Institute for the Social Sciences, Cologne, Germany working with Dr. Philipp Mayr.

    Contact

    dwaipayan.roy@iiserkol.ac.in

    Indian Institute of Science Education And Research Kolkata
    Mohanpur, Nadia - 741 246
    West Bengal, India