rl_for_oal - Reinforcement Learning for Opportunistic Active Learning

Project Goals -

Can we learn a policy for opportunistic active learning - that is trading off queries that would help in future interactions, with finishing the current interaction quickly
Can we learn how to choose between predicates in a multilabel setting?

Dependencies

Download and preprocess the Visual Genome dataset using this commit of my preprocessing repository Instructions are present at dataset_preprocessing/VisualGenome/README.md

Instructions

Set up some environment variables for convenience

export DATA_DIR=<path to Visual Genome dataset>
export AGENTS_DIR=<path to store trained agents>

All scripts are in the scripts directory.

cd scripts

Create a baseline static policy and test it

./create_static_policy.sh static $AGENTS_DIR
./test_static_policy.sh static $AGENTS_DIR

Instantiate the desired policy. The script uses the algorithm hyperparameters in the paper. Some other policy classes can be found in src.

./create_learned_policy.sh learned $AGENTS_DIR

Initialize the policy using episodes from the static policy

./init_policy.sh learned static $AGENTS_DIR $DATA_DIR

Train the policy

./train_policy.sh learned $AGENTS_DIR $DATA_DIR

Test the learned policy

./test_policy.sh learned $AGENTS_DIR $DATA_DIR

Summarize results

cd ../
mkdir logs
echo $'static\nlearned' > logs/agent_list.txt
cd analysis
python evaluate_bulk.py \
    --agents-path=$AGENTS-DIR \
    --agent-list-file=../logs/agent_list.txt

This summarizes some useful results into logs.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
analysis		analysis
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rl_for_oal - Reinforcement Learning for Opportunistic Active Learning

Dependencies

Instructions

About

Releases

Packages

Languages

aishwaryap/rl_for_oal

Folders and files

Latest commit

History

Repository files navigation

rl_for_oal - Reinforcement Learning for Opportunistic Active Learning

Dependencies

Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages