Posts by Collection

portfolio

publications

Multiple Resource Management and Burst Time Prediction using Deep Reinforcement Learning

Published in Eighth International Conference on Advances in Computing, Communication and Information Technology CCIT, 2019

Recommended citation: Kumar V, Bhambri S, Shambharkar PG. Multiple resource management and burst time prediction using deep reinforcement learning. In: Eighth International Conference on advances in computing, communication and information technology CCIT, 2019, pp. 51–58. https://www.seekdl.org/conferences/paper/details/10091.html

Multi-objective Reinforcement Learning based approach for User-Centric Power Optimization in Smart Home Environments

Published in IEEE International Conference on Smart Data Services (SMDS), 2020

Recommended citation: S. Gupta, S. Bhambri, K. Dhingra, A. B. Buduru and P. Kumaraguru, "Multi-objective Reinforcement Learning based approach for User-Centric Power Optimization in Smart Home Environments," 2020 IEEE International Conference on Smart Data Services (SMDS), 2020, pp. 89-96, doi: 10.1109/SMDS49396.2020.00018. https://ieeexplore.ieee.org/document/9288505

Contrastively Learning Visual Attention as Affordance Cues from Demonstrations for Robotic Grasping

Published in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021

Recommended citation: Y. Zha, S. Bhambri and L. Guan, "Contrastively Learning Visual Attention as Affordance Cues from Demonstrations for Robotic Grasping," 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 7835-7842, doi: 10.1109/IROS51168.2021.9636760. https://ieeexplore.ieee.org/document/9636760

Using Deception in Markov Game to Understand Adversarial Behaviors through a Capture-The-Flag Environment

Published in Decision and Game Theory for Security: 13th International Conference, GameSec, 2022

Recommended citation: Bhambri, Siddhant, Purv Chauhan, Frederico Araujo, Adam Doupé, and Subbarao Kambhampati. "Using Deception in Markov Game to Understand Adversarial Behaviors Through a Capture-The-Flag Environment." In International Conference on Decision and Game Theory for Security, pp. 87-106. Cham: Springer International Publishing, 2022. https://arxiv.org/pdf/2210.15011

Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning

Published in The AAAI Workshop on Representation Learning for Responsible Human-Centric AI (R2HCAI), and ICML - Many Facets of Preference Learning Workshop, 2023

Recommended citation: Verma, Mudit, Siddhant Bhambri, and Subbarao Kambhampati. "Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning." arXiv preprint arXiv:2302.08738 (2023). https://arxiv.org/abs/2302.08738

Preference Proxies: Evaluating Large Language Models in capturing Human Preferences in Human-AI Tasks

Published in ICML - Workshop on Theory of Mind in Communicating Agents, and Many Facets of Preference Learning Workshop, 2023

Recommended citation: Verma, Mudit, Siddhant Bhambri, and Subbarao Kambhampati. "Preference Proxies: Evaluating Large Language Models in capturing Human Preferences in Human-AI Tasks." In ICML 2023 Workshop The Many Facets of Preference-Based Learning. 2023. https://sbhambr1.github.io/files/Preference%20Proxies:%20Evaluating%20Large%20Language%20Models%20in%20capturing%20Human%20Preferences%20in%20Human-AI%20Tasks.pdf

talks

teaching

Reviewer

Conference, IEEE International Conference on Intelligent Robots And Systems (IROS), 2021

Reviewer

Journal, IEEE Transactions on Dependable and Secure Computing (TDSC), 2021