Location
Badges
Activity
Ratings Progression
Challenge Categories
Challenges Entered
Measure sample efficiency and generalization in reinforcement learning using procedurally generated environments
Latest submissions
See Allgraded | 94646 | ||
failed | 94180 | ||
graded | 93850 |
Self-driving RL on DeepRacer cars - From simulation to real world
Latest submissions
Sample-efficient reinforcement learning in Minecraft
Latest submissions
Multi-agent RL in game environment. Train your Derklings, creatures with a neural network brain, to fight for you!
Latest submissions
Sample-efficient reinforcement learning in Minecraft
Latest submissions
Multi Agent Reinforcement Learning on Trains.
Latest submissions
Robots that learn to interact with the environment autonomously
Latest submissions
Participant | Rating |
---|
Participant | Rating |
---|
NeurIPS 2020: MineRL Competition
Is there any team that still has a seat?
About 4 years agoHi everyone.
I just finished my competition on Procgen and ranked 3rd place in round 2. Iβd like to participate in MineRL too but I donβt have many resources for this competition β I get one GPU and $300+ AWS credit. Also, I think working with people would be a lot more interesting than working alone! Therefore, I want to ask and see if any team still open to new members.
I wish to join a team thatβs active and eager to improve its agent and I promise that Iβll do my best in the rest time of the competition.
NeurIPS 2020: Procgen Competition
Is it a preknowledge that we should select one of existing submissions for the final evaluation?
About 4 years agoThe final evaluation evaluates generalization, but I did not use any regularizations such as batch normalization and data augmentations in my previous submissions. Also, in my latest few submissions, I chose to experiment with a newly introduced hyperparameter instead of using the one that performed well on my local machine.
Is it a preknowledge that we should select one of existing submissions for the final evaluation?
About 4 years agoHi @vrv
Thank you for the response. Yeah, I know that was my bad after reviewing thoroughly the overview page and the answer I linked before. However, we did not always follow these, right? For example, we used 6 public and 4 private test environments in the second round instead of 4 and 1 described on the overview page. Also, this answer said we got to pick 3 submissions but at the end of the day, we only pick one.
Maybe I should ask this before instead of wishfully thinking a new submission is viable. At this point, I donβt how which submission I should use as I said before, none of them were made for the final evaluation.
The purpose I posted this was to see if there was someone else facing a similar situation. If Iβm the only one, Iβll accept it.
Although some of the above words may seem like complaining, I am not meant to. Iβve learned a lot during the competition and received a lot of help from you guys. Thank you all.
Is it a preknowledge that we should select one of existing submissions for the final evaluation?
About 4 years agoHi everyone,
Iβm wondering if Iβm the only one that learns we should select one of the previous submissions for the final evaluation. I cannot find any official statement about this and the only clue I can find now is this answer, which Iβve previously read but not paid much attention to the word βexistingβ. That was a mistake of mine but I humbly donβt think such an answer in the forum could be counted as a formal statement.
Itβs really frustrating to learn this at this point as none of my previous solutions was prepared for the final evaluations. I thought the challenge was to find a good solution but in the end, I found myself trapped in some word game. I am not meant to complain as I definitely should be responsible for the above mistake. However, if anyone feels the same way, please say something. Maybe, together we can make the game more interesting.
Is it possible to run the evaluation on all environments in the final week of round 2?
About 4 years agoIn the final week of round 2, is it possible to run the evaluation on all environments? To reduce the computation cost, maybe we can reduce the submission quota a bit.
Suggestion to switch from spot to on-demand
About 4 years ago(post withdrawn by author, will be automatically deleted in 24 hours unless flagged)
Number of environments in round 2
About 4 years agoI am humbly against Feiyang. Not everyone gets that amount of computation resources to try out their ideas. A decent amount of daily limits on submissions are helpful.
On the other hand, I agree with @jurgisp and @quang_tran that we should relax the 2-hour limit. I think this makes the competition bias to on-policy algorithms. On-policy algorithms can take advantage of large batches and therefore use less training iterations. But off-policy algorithms usually work with a much smaller batch size and require more training iterations and more time to train.
Round 2 is open for submissions π
About 4 years agoHi @jyotish
Whatβs the use of the blind
reward?
Running the evaluation worker during evaluations is now optional
Over 4 years agoThank you @jyotish , I see now.
Running the evaluation worker during evaluations is now optional
Over 4 years ago(post withdrawn by author, will be automatically deleted in 24 hours unless flagged)
Running the evaluation worker during evaluations is now optional
Over 4 years agoHi @jyotish
How should I change train.py
to disable the evaluation worker locally?
Inform the agent a new episode starts
Over 4 years agoIs there a way to inform the agent that a new episode starts when defining trainer
using build_trainer
?
TF2 is by default not enabled?
Over 4 years agoI see. But why should it be restricted to TF1.x even though Iβve set framework=tfe
in the yaml
file?
TF2 is by default not enabled?
Over 4 years agoAdding the following code to this line of train.py
from tensorflow.python import tf2
print(tf2.enabled())
assert False
it prints False
. Is there any way to enable tf2?
About Ray trainer and workers
Over 4 years agoThis file may contain something what youβre looking for
Rllib custom env
Over 4 years agoHi @jyotish
Could you please answer my above question?
If we use frameskip, does the framework count the number of frames in the right way?
For example, if we useframe_skip=2
, the number of interactions between the agent and environment is8e6/2=4e6
when using only 8M frames. If we use the standard configuration which settimesteps_total=8000000
, will this stop correctly?
Rllib custom env
Over 4 years agoIf we use frameskip, does the framework count the number of frames in the right way?
For example, if we use frame_skip=2
, the number of interactions between the agent and environment is 8e6/2=4e6
when using only 8M frames. If we use the standard configuration which set timesteps_total=8000000
, will this stop correctly?
Rllib custom env
Over 4 years ago(post withdrawn by author, will be automatically deleted in 24 hours unless flagged)
Unusually large tensor in the starter code
Over 4 years agoThank you so much and sorry for the late response.
Error when downloading dataset
About 4 years agoHello,
Iβm trying to download minerl dataset, but I keep receiving errors like the following one
Is there another way to download the dataset? BTW, Iβm in China.