Research
My research interests include machine learning and deep reinforcement learning (RL). Currently I am working on Safe RL and human in the loop RL, Multi-agent RL.
|
|
Guiding Safe Reinforcement Learning Policies
Using Structured Language Constraints
Bharat Prakash, Nicholas Waytowich2, Ashwinkumar Ganesan, Tim Oates, Tinoosh Mohsenin
Accepted, Safe-RL Workshop AAAI , 2020
A framework to train RL agents conditioned on constraints that are in the form of structured
language, thus reducing effort to design and integrate
specialized rewards into the environment. In our experiments, we show that this method can be used to ground
the language to behaviors and enable the agent to solve
tasks while following the constraints. We also show how
the agent can transfer these skills to other tasks.
|
|
Improving Safety in Reinforcement Learning using Model-Based Architectures
and Human Intervention
Bharat Prakash, Mohit Khatwani, Nicholas Waytowich, Tinoosh Mohsenin
Accepted, FLAIRS, 2019
A hybrid method for reducing the human intervention
time by combining model-based and model-free approaches and training
a supervised blocker to improve sample efficiency
while also ensuring safety. We evaluate these methods
on various grid-world environments using both standard
and visual representations and show that our approach
achieves better performance in terms of sample
efficiency, number of catastrophic states reached as
well as overall task performance compared to traditional
model-free approaches.
|
|
Representation learning by solving auxiliary tasks on Xray images
Bharat Prakash
Learning image representations on unannotated Chest Xray images using the method described in Noroozi
and Favaro to gain improvements in classification tasks. Here we they use the pretext task of solving
jigsaw puzzles to pre-train the convolutional neural network. Chest x-ray images from the x-ray14 database were used.
|
|
Improving Grammatical Error Correction Using Multi-Task Learning
Bharat Prakash, Ashwinkumar Ganesan, Sarthak Mehta, John Cellozi, Frank Ferraro
Poster Presentation at MASC-SLL-2018
We propose a multitask learning (MTL) approach to design a grammatical error correction system that uses minimal parallel text for training by adding auxiliary tasks that aid the main GEC training objective. The proposed experiments include adding part of speech tagging and language modeling as these additional auxiliary tasks.
|
|
Colorizing monochrome photographs
Bharat Prakash, Harsh Shrivastava, Deepanjan
Using deep neural networks to colorize monochrome photos. Course project, Data Science.
|
|
University FAQ ChatBot
Bharat Prakash, Abhay Kashyap
Facebook chatbot to answer basic questions about the university. Natural language understanding, NER,
information retrieval.
|
|
IVR Payment
Bharat Prakash, Abhay Kashyap
Developed an IVR system to process payments using PayPal APIs, Asterisk on Raspberry Pi. Internal
hackathon at PayPal.
|
|