About Me

I am working as Assistant Professor in the Department of Computational and Data Sciences at Indian Institute of Science Education and Research, Kolkata. Before joining IISER, Kolkata, I was a Post-doctoral Researcher at GESIS – Leibniz Institute for the Social Sciences, Cologne, Germany working with Dr. Philipp Mayr in the DFG funded project ConData. Prior to that, I was a senior research fellow at Indian Statistical Institute, Kolkata. My PhD. supervisor was Prof. Mandar Mitra, Professor at Computer and Communication Science Division, Indian Statistical Institute, Kolkata.

Work Experience


Assistant Professor
2020 August - till date Department of Computational and Data Sciences
Indian Institute of Science Education and Research, Kolkata

Post Doctoral Researcher
2019 July - 2020 July Information & Data Retrieval Team
GESIS - Leibniz Institute for the Social Sciences, Cologne, Germany

Research

Word Embedding for Information Retrieval

Bibliographic Citation Recommendation

Knowledge Graph

Retrieval Functions

Relevance Feedback

Lucene

Publications


2024

  • Priyangshu Datta, Suchana Datta, Dwaipayan Roy. RAGing Against the Literature: LLM-Powered Dataset Mention Extraction. JCDL 2024 [Link (soon)]
  • Suchana Datta, Dwaipayan Roy, Derek Greene, Gerardine Meaney. Unveiling Temporal Trends in 19th Century Literature: An Information Retrieval Approach. JCDL 2024 [Link (soon)]
  • Soumyadeep Sar, Dwaipayan Roy. Indigo at CheckThat! 2024: Using Setfit: A Resource Efficient Technique for Subjectivity Detection in News Article. CLEF 2024 [Link]
  • Shivam Kumar, Dwaipayan Roy. Comparative Analysis of Knowledge Graphs Constructed from Fake News and Legitimate News Sources. Wiki Workshop 2024 [Link]
  • Aman Sinha, Priyanshu Raj Mall, Dwaipayan Roy. Exploring the Nexus Between Retrievability and Query Generation Strategies. ECIR 2024 [Link] [Preprint]

  • 2023

  • Aman Sinha, Priyanshu Raj Mall, Dwaipayan Roy. A Comparative Analysis of Retrievability and PageRank Measures. FIRE 2023 [Link] [Preprint]
  • Aman Sinha, Priyanshu Raj Mall, Dwaipayan Roy. Findability: A Novel Measure of Information Accessibility. CIKM 2023 [Link] [Preprint]
  • Aman Sinha, Priyanshu Raj Mall, Dwaipayan Roy. A Comparative Analysis of Retrievability and PageRank Measures FIRE 2023 [Link]
  • Subinay Adhikary, Dwaipayan Roy, Debasis Ganguly, Shouvik Kumar Guha, Kripabandhu Ghosh. LeDA: A System for Legal Data Annotation. JURIX 2023 [Link]
  • Dwaipayan Roy, Zeljko Carevic, Philipp Mayr. Retrievability in an Integrated Retrieval System: An Extended Study. International Journal on Digital Libraries (IJDL) [Link] [Preprint]
  • Debanjan Dutta, Dipasree pal, Dwaipayan Roy, Mandar Mitra. Bibliography Counselor: A Citation Recommendation Tool. 23nd ACM/IEEE Joint Conference on Digital Libraries (JCDL 2023) [Link]

  • 2022

  • Dwaipayan Roy, Zeljko Carevic, Philipp Mayr. Studying Retrievability of Publications and Datasets in an Integrated Retrieval System. 22nd ACM/IEEE Joint Conference on Digital Libraries (JCDL 2022) [Link] [Preprint]
  • Sourav Saha, Dwaipayan Roy, Mandar Mitra. On Modifying Evaluation Measures to Deal with Ties in Ranked Lists. 22nd ACM/IEEE Joint Conference on Digital Libraries (JCDL 2022) [Link]
  • Dwaipayan Roy, Mandar Mitra, Philipp Mayr. Local or Global? A Comparative Study on Applications of Embedding Models for Information Retrieval. 9th ACM IKDD CODS and 27th COMAD 5th Joint International Conference on Data Science & Management of Data [Link]

  • 2021

  • Dwaipayan Roy, Sumit Bhatia, Prateek Jain. Information Asymmetry in Wikipedia Across Different Languages: A Statistical Analysis. Journal of the Association for Information Science and Technology [Link] [Code]
  • Suraj Agrawal, Dwaipayan Roy, Mandar Mitra. Tag Embedding Based Personalized Point Of Interest Recommendation System. Elsevier Journal Information Processing & Management. [Link] [Preprint] [Code]
  • Thomas Krämer, Zeljko Carevic, Dwaipayan Roy, Claus-Peter Klas, Philipp Mayr. ConSTR: A Contextual Search Term Recommender. JCDL 2021 [Link] [Preprint]

  • 2020

  • Suchana Datta, Derek Greene, Debasis Ganguly, Dwaipayan Roy, Mandar Mitra. Where's the Why? In Search of Chains of Causes for Query Events. AICS 2020 [Link]
  • Zeljko Carevic, Dwaipayan Roy, Philipp Mayr. Characteristics of Dataset Retrieval Sessions: Experiences from a Real-life Digital Library. TPDL 2020 [Link]
  • Suchana Datta, Debasis Ganguly, Dwaipayan Roy, Derek Greene, Francesca Bonin, Charles Jochim. Overview of the Causality-driven Adhoc Information Retrieval (CAIR) task at FIRE-2020. FIRE 2020 [Link]
  • Suchana Datta, Debasis Ganguly, Dwaipayan Roy, Francesca Bonin, Charles Jochim, Mandar Mitra. Retrieving Potential Causes from a Query Event. SIGIR 2020 [Link] [Code]
  • Dwaipayan Roy, Sumit Bhatia, Prateek Jain. A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages. LREC 2020 [Link] [Data] [Code]

  • 2019

  • Dwaipayan Roy, Debasis Ganguly, Mandar Mitra, Gareth J.F. Jones. Estimating Gaussian Mixture Models in the Local Neighbourhood of Embedded Word Vectors for Query Performance Prediction. Elsevier Information Processing & Management [Link] [Preprint]
  • Dwaipayan Roy, Sourav Saha, Mandar Mitra, Bihan Sen, Debasis Ganguly. I-REX: A Lucene Plugin for EXplainable IR. CIKM 2019 [Link] [Code] [demo]
  • Chandan Biswas, Debasis Ganguly, Dwaipayan Roy, Ujjwal Bhattacharya. Privacy Preserving Approximate K-means Clustering. CIKM 2019 [Link]
  • Dwaipayan Roy, Sumit Bhatia, Mandar Mitra. Selecting Discriminative Terms for Relevance Model. SIGIR 2019 [Link] [Code]

  • 2018

  • Debasis Ganguly, Haithem Afli, Dwaipayan Roy. Word Embedding based Semantic Cross-Lingual Document Alignment in Comparable Corpora. FIRE 2018 [Link]
  • Dwaipayan Roy, Debasis Ganguly, Sumit Bhatia, Srikanta Bedathur, Mandar Mitra. Using Word Embeddings for Information Retrieval: How Collection and Term Normalization Choices Affect Performance. CIKM 2018 [Link] [Preprint]
  • Dwaipayan Roy, Mandar Mitra, Debasis Ganguly. To Clean or not to Clean: Document Preprocessing and Reproducibility. ACM JDIQ [Link] [Preprint] [Code]

  • 2017

  • Dwaipayan Roy. Word Embedding based Approaches for Information Retrieval. FDIA 2017 [Link]
  • Dwaipayan Roy. An Improved Test Collection and Baselines for Bibliographic Citation Recommendation. CIKM 2017 [Link] [Preprint] [Data]

  • 2016

  • Dwaipayan Roy, Kunal Ray, Mandar Mitra. From a Scholarly Big Dataset to a Test Collection for Bibliographic Citation Recommendation. AAAI Workshop: Scholarly Big Data 2016: 705-710 [Link]
  • Dwaipayan Roy, Debasis Ganguly, Mandar Mitra, Gareth J. F. Jones. Word Vector Compositionality based Relevance Feedback using Kernel Density Estimation. CIKM 2016: 1281-1290 [Link]
  • Dwaipayan Roy, Debjyoti Paul, Mandar Mitra, Utpal Garain. Using Word Embeddings for Automatic Query Expansion. [Link]
  • Dwaipayan Roy, Debasis Ganguly, Mandar Mitra, Gareth J. F. Jones. Representing Documents and Queries as Sets of Word Embedded Vectors for Information Retrieval. [Link] [Preprint]

  • - 2015

  • Debasis Ganguly, Dwaipayan Roy, Mandar Mitra, Gareth J. F. Jones. Word Embedding based Generalized Language Model for Information Retrieval. SIGIR 2015: 795-798 [Link] [Preprint]
  • Ayan Bandyopadhyay, Dwaipayan Roy, Mandar Mitra, Sanjay Saha. Named Entity Recognition from Tweets. LWA 2014 [Link]
  • Dwaipayan Roy, Ayan Bandyopadhyay, Mandar Mitra. A Simple Context Dependent Suggestion System. TREC 2013 [Link]
  • Jamuna Kanta Sing, Dwaipayan Roy, Dipak Kumar Basu, Mita Nasipuri. Generalized Diagonal 2D FLDA for Efficient Face Recognition. International Conference on Communications, Devices and Intelligent Systems (CODIS), 2012 [Link]
  • Dwaipayan Roy. Generalized Diagonal Fisher Linear Discriminant Analysis for Efficient Face Recognition. Master's Thesis, submitted to Jadavpur University, Kolkata, 2012 [Link]
  • Curriculum Vitae

    I finished my bachelor's from Ramakrishna Mission Vidyamandira, Belurmath, and master's from Jadavpur University, Kolkata respectively in the year 2009 and 2012. I worked as a Junior Research Fellow at Indian Statistical Institute (ISI), Kolkata from 2012 to 2014. After working as a Senior Research Fellow from 2014, I received PhD in Computer Science from Indian Statistical Institute, Kolkata in June, 2019. Presently, I am working as an assistant professor at Indian Institute of Science and Education, Kolkata after serving as a Post-doctoral Researcher at GESIS – Leibniz Institute for the Social Sciences, Cologne, Germany working with Dr. Philipp Mayr.

    Contact

    dwaipayan.roy@iiserkol.ac.in

    Indian Institute of Science Education And Research Kolkata
    Mohanpur, Nadia - 741 246
    West Bengal, India