Homepage of Vamsi Meduri
I am a Software Engineer at Amazon Web Services (AWS) where I work in the Amazon Redshift Query Processing team. My interests lie at the intersection of AI/ML and database engines. Prior to this, I was a Staff Research Scientist in the database systems group at IBM Research - Silicon Valley and IBM Almaden Research Center where I worked on predictive table optimization for lakehouse systems and also towards extending hybrid search and generative AI components for RAG-based solutions on vector databases. I graduated with a Ph.D from Arizona State University where I worked in the Data Systems Lab under the supervision of Prof. Mohamed Sarwat. My M.S. degree was from the National University of Singapore where I was mentored by Prof. Kian-Lee Tan.
News
- [06/29/2026] Started employment at Amazon Web Services (AWS) in the Redshift team.
- [05/08/2026] Honored to receive a Distinguished PC Award for the ICDE Research Track 2026. Grateful to the PC Chairs - Alkis Simitsis, Khuzaima Daudjee, and Qiong Luo for the recognition.
- [03/03/2026, 04/08/2026] Invited talk at Adobe Experience Platform hosted by Dr. Yunayo Li and at CS Colloquium at Marquette University hosted by Dr. Kanchan Chowdhury respectively. Here is an abstract of the talk.
- [02/12/2026] Published a blog post about the semantic retrieval accuracy of IBM Fusion CAS.
- [12/20/2025] A paper titled “Schema-GraphRAG: Bridging Hybrid Search and Graph Traversal for Complex Retrieval Tasks” was accepted in the IEEE ICDE 2026 demo track.
- [12/08/2025] Relocated to IBM Research - Silicon Valley following the closure of the Almaden Lab.
- [11/24/2025] A paper titled “PTO: A Workload-driven Predictive Table Optimizer for Lakehouse Systems” was accepted in the ACM SIGMOD 2026 research track.
- [06/15/2025] Received an outstanding technical achievement award from IBM Research for the delivery of query optimizer in Watsonx.data 2.0 and order of magnitude performance improvement for TPC-DS 100 TBytes on Fusion HCI.
- [10/17/2023] A journal paper titled “ALFA: Active Learning for Graph Neural Network-based Semantic Schema Alignment” was accepted in The VLDB Journal: Special issue on Machine Learning and Databases 2024.
- [09/04/2023] Received a best reviewer award for the VLDB 2023 PhD Workshop.
- [03/29/2023] Received a first time patent application invention achievement award from IBM for a patent on active learning-based ontology alignment.
- [05/23/2022] Started employment at the IBM Almaden Research Center.
- [05/12/2022] Successfully defended my Ph.D dissertation on “Human-in-the-Loop Machine Learning Systems for Data Integration and Predictive Analytics”. Here is my acknowledgement thanking everyone who supported me through my doctoral studies.
- [03/23/2022] A paper on “ML-aware Spatial Re-partitioning” was accepted in IEEE ICDE 2022 research track.
- [08/15/2021] A poster paper titled “GEM: An Efficient Entity Matching Framework for Geospatial Data” was accepted in ACM SIGSPATIAL 2021.
- [08/13/2021] Completed a summer internship in the Database Group at the IBM Almaden Research Center under the mentorship of Abdul Quamar, Chuan Lei, Xiao Qin and Berthold Reinwald.
- [12/09/2020] A journal paper on “Evaluation of ML Algorithms in Predicting the Next SQL Query from the Future” was accepted in ACM TODS 2021.
- [09/11/2020] Completed a summer internship in the Database Group at the IBM Almaden Research Center. Thanks to my mentors, Abdul Quamar, Chuan Lei, Vasilis Efthymiou, and my manager, Fatma Özcan, for an enriching learning experience.
- [11/25/2019] A paper on “Active Learning for Entity Matching” was accepted in the ACM SIGMOD 2020 research track.
- [11/07/2019] A journal paper on “Mining Expressive Rules in Knowledge Graphs” was accepted in ACM JDIQ 2019.
- [08/08/2019] Completed the two year long collaboration between SRP & CASCADE @ ASU on the data reconciliation of diverse electric schemata.
- [06/05/2019] Presented my research work on query workload prediction at CASCADE-AMEX Big Data Deep Dive.
- [01/02/2019] A short paper on “RNNs for User Intent Prediction during Data Exploration” was accepted in EDBT 2019.
