Skip to content
View minqi's full-sized avatar

Organizations

@uclnlp @lucidalabs @ucl-dark @FLAIROx

Block or report minqi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. facebookresearch/minimax facebookresearch/minimax Public

    Efficient baselines for autocurricula in JAX.

    Python 175 14

  2. facebookresearch/dcd facebookresearch/dcd Public

    Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.

    Python 127 25

  3. facebookresearch/level-replay facebookresearch/level-replay Public archive

    This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to le…

    Python 84 16

  4. learning-to-communicate-pytorch learning-to-communicate-pytorch Public

    Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

    Python 349 80

  5. hnatt hnatt Public

    Train and visualize Hierarchical Attention Networks

    Python 203 35

  6. facebookresearch/minihack facebookresearch/minihack Public

    MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

    Python 485 60