Location
Badges
Activity
Ratings Progression
Challenge Categories
Challenges Entered
Sample-efficient reinforcement learning in Minecraft
Latest submissions
See Allgraded | 25843 | ||
graded | 25842 | ||
graded | 25841 |
Participant | Rating |
---|
Participant | Rating |
---|
NeurIPS 2019 : MineRL Competition
When will the results of round 1 be announced?
About 5 years agoHi @BrandonHoughton, I have seen the notice. But I was confused that it tested my repo two weeks ago which is not my latest version.
The evaluation result does not match my local testing
About 5 years agoThanks a lot! But the new maximum number of submissions is 25. That is too costly β¦
The evaluation result does not match my local testing
About 5 years agoI have a same problem.
Evaluation successful but only 1 episode with 4 steps.
But I used pytorch, and I donβt know what really happened.
Train a metacontroller based on Expert Data which is divided into several stages manually by reward
About 5 years agoCan I train a metacontroller based on Expert Data which is divided into several stages manually by reward, or does this count as hard coding?
Eg. stage 1 is the time when it has no reward .
How to explain this rule
Over 5 years agoHow to explain βA manually specified policy may not be used as a component of this model.β in the challenge rules ?
Can I define my own actionsοΌsuch as define forward 3 frames continuously as βmy fowardβ (just like skip frame), βcraft stick from logβ (at first frame craft planks, then craft stick) .
Can the agent get reward repeatedly?
Over 5 years agoMy agent run in βObtainDiamondβ, and it is rewarded repeatedly
It seems that the environment has some bug about it
Can the agent get reward repeatedly?
Over 5 years agoDid he talk about this evaluation_locally.sh?
I canβt find anything about ObtainDiamond or ObtainDiamondDense.
By the way, in βMinerlObtainDiamond-v0β environment agent is rewarded repeatedly on my computer. Is it a bug? How to fix it β¦
This is evaluation_locally.sh.
#!/bin/bash
set -e
AICROWD_DATA_ENABLED="YES"
if [[ " $@ " =~ " --no-data " ]]; then
AICROWD_DATA_ENABLED="NO"
else
python3 ./utility/verify_or_download_data.py
fi
EXTRAOUTPUT=" > /dev/null 2>&1 "
if [[ " $@ " =~ " --verbose " ]]; then
EXTRAOUTPUT=""
fi
# Run local name server
eval "pyro4-ns $EXTRAOUTPUT &"
trap "kill -11 $! > /dev/null 2>&1;" EXIT
# Run instance manager to generate performance report
export EVALUATION_STAGE='manager'
eval "python3 run.py --seeds 1 $EXTRAOUTPUT &"
trap "kill -11 $! > /dev/null 2>&1;" EXIT
# Run the evaluation
sleep 2
export MINERL_INSTANCE_MANAGER_REMOTE="1"
export EVALUATION_STAGE='testing'
export EVALUATION_RUNNING_ON='local'
export EXITED_SIGNAL_PATH='shared/exited'
rm -f $EXITED_SIGNAL_PATH
export ENABLE_AICROWD_JSON_OUTPUT='False'
eval "python3 run.py $EXTRAOUTPUT && touch $EXITED_SIGNAL_PATH || touch $EXITED_SIGNAL_PATH &"
trap "kill -11 $! > /dev/null 2>&1;" EXIT
# View the evaluation state
export ENABLE_AICROWD_JSON_OUTPUT='True'
python3 utility/parser.py || true
kill $(jobs -p)
Can the agent get reward repeatedly?
Over 5 years agoIn βMinerlObtainDiamond-v0β environment.
When agent get one stick, it has reward 4.
But if agent get two stick, can it has reward 8 ?
It seems that the current evalutation environment will give repeat reward.
Why evaluation failed
About 5 years agoEvaluation failed.
The aicrowd-bot did not provide the Error log.