-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TEM branch merger #120
Open
LukeHollingsworth
wants to merge
80
commits into
main
Choose a base branch
from
whittington_2020
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
TEM branch merger #120
Changes from all commits
Commits
Show all changes
80 commits
Select commit
Hold shift + click to select a range
004bff2
Add options for Windows and Unix OS in README
LukeHollingsworth 86d403b
Merge branch 'main' of https://github.com/SainsburyWellcomeCentre/Neu…
LukeHollingsworth f84c0b3
Merge branch 'main' of https://github.com/SainsburyWellcomeCentre/Neu…
LukeHollingsworth 29b678c
adding experimental runs to TEM
LukeHollingsworth e36c848
batch environment example working with Simple2D
29f1dfb
default argument of BatchEnvironment() set to DiscreteObjectEnvironme…
f634212
default argument of BatchEnvironment() set to DiscreteObjectEnvironme…
ad5cab6
merge main
ClementineDomine 2b0f9d8
Update README.md - Centered logo
JarvisDevon 034e685
debugging state density plot
60abefa
Merge branch 'whittington_2020' of https://github.com/SainsburyWellco…
68422a1
pre-commit changes
55d740e
change TEM imports to not require torch install
LukeHollingsworth 2f78dee
note on installing dependencies on zsh shell
LukeHollingsworth 3fae384
merged main into whittington_2020
LukeHollingsworth a60d64b
introduce logging of training accuracies
LukeHollingsworth ff12f40
pre-commit changes
LukeHollingsworth 5c4fd53
added comments to TEM run file
LukeHollingsworth 82a34d7
merge from main
48058c3
batch trajectories and grids plotted
4cd1f7a
Simple2D & DiscreteObject examples added for BatchEnvironment
a8b07cf
attempting to fix large file problem
ccc584a
running TEM tests
f67b4c2
slurm updated
978f001
slurm updated
348a161
slurm change
aebb6fb
huge 50K run added
c5762df
huge 50K run added
723c16d
state density and history bugs sorted
0aab239
TEM state density bugs fixed
1672921
big high density run added
7128313
small TEM run
eab0cdf
state density mismatch fixed
36f5da1
small training run (without width 2) added
18c4abb
medium size run added
5bc718b
problem with state assignment fixed
ccb394e
reduced slurm memory pool
94b8ac8
reduced slurm memory pool
7de0832
updated test
5d33231
pre-commit run on all files
0d277b1
is the cluster broken or is it just me?
74990a6
trying cpu slurm
ca1c310
trying cpu slurm
a03ada0
trying cpu slurm
92de616
looped walks added
0e93183
looping walk
a681c6b
cpu slurm added
5b22e32
cpu slurm added
78cb5bb
trying to fix slurm bug
2c01eac
big memory run with longer walks
078d41f
new training config
ab297dc
formatted
cc0ad77
full var walks added
709fc82
trailing whitespace
0ead088
full length training
1fe31ab
recent TEM updates
2a95433
minor update
d2bdd14
test push
853d185
black precommit changes
4afe24c
precommit black
d423f89
pre-merge
3351dd8
premerge to main
d2cb6c3
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 3921cf6
Merge remote-tracking branch 'origin/main' into whittington_2020
a6911d9
Merge branch 'whittington_2020' of https://github.com/SainsburyWellco…
2993e06
starting the cleaning process
62df4d6
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] bba6902
fixed arena examples bug
ab59665
Merge branch 'whittington_2020' of https://github.com/SainsburyWellco…
8b062e7
retrigger checks
rodrigcd 774a67f
SimpleDiscreteAgent added
3b5c1ac
Merge branch 'whittington_2020' of https://github.com/SainsburyWellco…
67b9c53
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 37cdc5a
fixing obs hist length
rodrigcd 4d15210
disabling default plt show and fixing bugs on tests
rodrigcd 6bd3651
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] c00cc09
removing tem logs from test
rodrigcd 9a4fe58
fixing live render code for matplotlib 3.10
rodrigcd d872e91
limiting to python>=3.10
rodrigcd fad1ccd
limiting to python>=3.10
rodrigcd File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes.
Large diffs are not rendered by default.
Oops, something went wrong.
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,113 @@ | ||
import numpy as np | ||
|
||
from .agent_core import AgentCore | ||
|
||
|
||
class SimpleDiscreteAgent(AgentCore): | ||
""" | ||
A simplified single-environment discrete agent, loosely mirroring TEM’s | ||
approach to picking actions and checking whether the environment | ||
actually moved. | ||
""" | ||
|
||
def __init__( | ||
self, | ||
agent_name: str = "SimpleDiscreteAgent", | ||
**model_kwargs, | ||
): | ||
""" | ||
Parameters | ||
---------- | ||
room_width : int | ||
Width (in discrete states) of the environment | ||
room_depth : int | ||
Depth (in discrete states) of the environment | ||
state_density : float | ||
Number of discrete states per unit distance (usually 1 / step_size) | ||
agent_name : str | ||
Agent's name | ||
""" | ||
super().__init__(agent_name=agent_name) | ||
self.room_width = model_kwargs["room_width"] | ||
self.room_depth = model_kwargs["room_depth"] | ||
self.state_density = model_kwargs["state_density"] | ||
# Discrete actions: stay, up, down, right, left | ||
self.poss_actions = [[0, 0], [0, 1], [0, -1], [1, 0], [-1, 0]] | ||
|
||
# For storing trajectory | ||
self.walk_actions = [] | ||
self.obs_history = [] | ||
|
||
# Keep track of previous observation/action so we know if the environment actually moved. | ||
self.prev_observation = None | ||
self.prev_action = [0, 0] | ||
self.n_walk = 0 | ||
|
||
def reset(self): | ||
""" | ||
Reset the agent’s history and counters. | ||
""" | ||
super().reset() | ||
self.walk_actions = [] | ||
self.obs_history = [] | ||
self.prev_observation = None | ||
self.prev_action = [0, 0] | ||
self.n_walk = 0 | ||
|
||
def act(self, observation, policy_func=None): | ||
""" | ||
Decide on the next action. If the environment did not change state | ||
(i.e. we got the same position as before, and we tried to move), | ||
then pick a new random action. Otherwise, record the old observation and action. | ||
|
||
Parameters | ||
---------- | ||
observation : list or np.ndarray | ||
Typically [state_index, object_info, (x,y)] for a discrete environment. | ||
The first element (observation[0]) is the discrete state index. | ||
|
||
policy_func : callable, optional | ||
Unused here. Included only for compatibility. | ||
|
||
Returns | ||
------- | ||
action : list | ||
Chosen discrete action [dx, dy] | ||
""" | ||
# If this is our first time calling act, initialise | ||
if self.prev_observation is None: | ||
self.prev_observation = observation | ||
self.prev_action = self.action_policy() | ||
return self.prev_action | ||
|
||
# Check if environment actually moved to a new state | ||
curr_state_idx = observation[0] | ||
prev_state_idx = self.prev_observation[0] | ||
|
||
if curr_state_idx == prev_state_idx and self.prev_action != [0, 0]: | ||
# The environment didn't move from last action, so pick a new random action | ||
new_action = self.action_policy() | ||
else: | ||
# The environment did move, so record old obs/action before picking the next action | ||
self.walk_actions.append(self.prev_action) | ||
self.obs_history.append(self.prev_observation) | ||
self.n_walk += 1 | ||
new_action = self.action_policy() | ||
|
||
self.prev_observation = observation | ||
self.prev_action = new_action | ||
return new_action | ||
|
||
def action_policy(self): | ||
""" | ||
Random action policy that selects an action from [stay, up, down, right, left]. | ||
""" | ||
idx = np.random.choice(len(self.poss_actions)) | ||
return self.poss_actions[idx] | ||
|
||
def update(self): | ||
""" | ||
Update the agent's internal state after a walk is completed. | ||
""" | ||
self.n_walk = 0 | ||
return None |
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this really necessary
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think so. I had some problems early on, cloning the dev version of NeuralPlayground on both Windows and Mac OS. If this has been fixed, then this is redundant and I'll change it.