Publications

You can also find my articles on my Google Scholar profile.

Preference Proxies: Evaluating Large Language Models in capturing Human Preferences in Human-AI Tasks

Published in ICML - Workshop on Theory of Mind in Communicating Agents, and Many Facets of Preference Learning Workshop, 2023

Recommended citation: Verma, Mudit, Siddhant Bhambri, and Subbarao Kambhampati. "Preference Proxies: Evaluating Large Language Models in capturing Human Preferences in Human-AI Tasks." In ICML 2023 Workshop The Many Facets of Preference-Based Learning. 2023. https://sbhambr1.github.io/files/Preference%20Proxies:%20Evaluating%20Large%20Language%20Models%20in%20capturing%20Human%20Preferences%20in%20Human-AI%20Tasks.pdf

Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning

Published in The AAAI Workshop on Representation Learning for Responsible Human-Centric AI (R2HCAI), and ICML - Many Facets of Preference Learning Workshop, 2023

Recommended citation: Verma, Mudit, Siddhant Bhambri, and Subbarao Kambhampati. "Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning." arXiv preprint arXiv:2302.08738 (2023). https://arxiv.org/abs/2302.08738

Using Deception in Markov Game to Understand Adversarial Behaviors through a Capture-The-Flag Environment

Published in Decision and Game Theory for Security: 13th International Conference, GameSec, 2022

Recommended citation: Bhambri, Siddhant, Purv Chauhan, Frederico Araujo, Adam Doupé, and Subbarao Kambhampati. "Using Deception in Markov Game to Understand Adversarial Behaviors Through a Capture-The-Flag Environment." In International Conference on Decision and Game Theory for Security, pp. 87-106. Cham: Springer International Publishing, 2022. https://arxiv.org/pdf/2210.15011

Contrastively Learning Visual Attention as Affordance Cues from Demonstrations for Robotic Grasping

Published in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021

Recommended citation: Y. Zha, S. Bhambri and L. Guan, "Contrastively Learning Visual Attention as Affordance Cues from Demonstrations for Robotic Grasping," 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 7835-7842, doi: 10.1109/IROS51168.2021.9636760. https://ieeexplore.ieee.org/document/9636760

Multi-objective Reinforcement Learning based approach for User-Centric Power Optimization in Smart Home Environments

Published in IEEE International Conference on Smart Data Services (SMDS), 2020

Recommended citation: S. Gupta, S. Bhambri, K. Dhingra, A. B. Buduru and P. Kumaraguru, "Multi-objective Reinforcement Learning based approach for User-Centric Power Optimization in Smart Home Environments," 2020 IEEE International Conference on Smart Data Services (SMDS), 2020, pp. 89-96, doi: 10.1109/SMDS49396.2020.00018. https://ieeexplore.ieee.org/document/9288505

Multiple Resource Management and Burst Time Prediction using Deep Reinforcement Learning

Published in Eighth International Conference on Advances in Computing, Communication and Information Technology CCIT, 2019

Recommended citation: Kumar V, Bhambri S, Shambharkar PG. Multiple resource management and burst time prediction using deep reinforcement learning. In: Eighth International Conference on advances in computing, communication and information technology CCIT, 2019, pp. 51–58. https://www.seekdl.org/conferences/paper/details/10091.html