-
Notifications
You must be signed in to change notification settings - Fork 4.3k
Issues: hpcaitech/ColossalAI
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[FEATURE]: Lora/QLora in GeminiPlugin and TorchFSDP
enhancement
New feature or request
#6138
opened Nov 16, 2024 by
ericxsun
[FEATURE]: support google/gemma-2-2b for Tensor Parallelism
enhancement
New feature or request
#6120
opened Nov 9, 2024 by
jing-4369
2
[BUG]: ColossalAI Inference example response empty result without error
bug
Something isn't working
#6112
opened Nov 4, 2024 by
GuangyaoZhang
1 task done
[BUG]: why duplicate PID appears on rank 0
bug
Something isn't working
#6111
opened Nov 3, 2024 by
ericxsun
1 task done
[BUG]: Llama3.1 HybridParallelPlugin train failed when pp_size>1
bug
Something isn't working
#6110
opened Nov 2, 2024 by
cingtiye
1 task done
[PROPOSAL]: FP8 with block-wise amax
enhancement
New feature or request
#6105
opened Oct 28, 2024 by
Edenzzzz
1 task
[FEATURE]: Windows wheel needed
enhancement
New feature or request
#6103
opened Oct 27, 2024 by
nitinmukesh
[BUG]: weird stuck while training
bug
Something isn't working
#6095
opened Oct 19, 2024 by
ericxsun
1 task done
[BUG]: Got nan during backward with zero2
bug
Something isn't working
#6091
opened Oct 16, 2024 by
flymin
1 task done
[BUG]: Unable to train on H20 machine
bug
Something isn't working
#6079
opened Oct 6, 2024 by
kaixinbear
1 task done
[DOC]: 环境安装失败
documentation
Improvements or additions to documentation
#6066
opened Sep 21, 2024 by
eccct
[FEATURE]: Is it Possible to integrate Liger-Kernel?
enhancement
New feature or request
#6047
opened Sep 6, 2024 by
ericxsun
[BUG]: remove Something isn't working
.github/workflows/submodule.yml
bug
#6039
opened Aug 28, 2024 by
BoxiangW
1 task done
[FEATURE]: Support Zerobubble pipeline
enhancement
New feature or request
#6037
opened Aug 28, 2024 by
duanjunwen
[BUG]: errror Colossalai 0.4.0/0.4.2 /usr/bin/supervisord
bug
Something isn't working
#6032
opened Aug 23, 2024 by
Storm0921
1 task done
[BUG]: AttributeError: 'GeminiDDP' object has no attribute 'module'
bug
Something isn't working
#6021
opened Aug 20, 2024 by
dheerj188
1 task done
[BUG]: Torch compile causes multi-process to hang with python 3.9
bug
Something isn't working
#5987
opened Aug 10, 2024 by
Edenzzzz
1 task done
[FEATURE]: How to skip a custom node from generating strategies in colossal-auto?
enhancement
New feature or request
#5983
opened Aug 8, 2024 by
robotsp
[BUG]: Pytest with a specific config failed after PR #5868
bug
Something isn't working
shardformer
#5949
opened Jul 29, 2024 by
GuangyaoZhang
1 task done
[FEATURE]: Request updates for pretraining roberta
enhancement
New feature or request
#5948
opened Jul 29, 2024 by
jiahuanluo
[BUG]: Something isn't working
pip install .
error: identifier "__hsub" is undefined
bug
#5929
opened Jul 19, 2024 by
jtmer
1 task done
[BUG]: Shardformer FP8 communication training accuracy degradation
bug
Something isn't working
#5920
opened Jul 18, 2024 by
GuangyaoZhang
1 task done
[BUG]: Low_Level_Zero plugin crashes with LoRA
bug
Something isn't working
#5909
opened Jul 15, 2024 by
Fallqs
1 task done
Previous Next
ProTip!
Follow long discussions with comments:>50.