Skip to content

Pull requests: PaddlePaddle/PaddleFormers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix opt offload
#2955 opened Nov 14, 2025 by waliwali777 Loading…
2 tasks
disable qwen3 ci
#2954 opened Nov 14, 2025 by zjjlivein Loading…
2 tasks
fix dmodel download proxy
#2952 opened Nov 14, 2025 by Jonathans575 Loading…
2 tasks
fix cli batch_size contributor
#2951 opened Nov 14, 2025 by llbdyiu66 Loading…
Fix FC lora resume loss
#2950 opened Nov 14, 2025 by changeyoung98 Loading…
2 tasks
Chore/2892 test
#2948 opened Nov 14, 2025 by huanghengheng Loading…
2 tasks
[DOC] Fix pt dataset doc
#2946 opened Nov 14, 2025 by LittleHeroZZZX Loading…
2 tasks
Rope reproduction contributor
#2945 opened Nov 14, 2025 by cjw-d Loading…
1 of 2 tasks
[AutoParallel] fix trainer offload opt params bug
#2944 opened Nov 13, 2025 by waliwali777 Loading…
2 tasks
fix codecov without base report
#2942 opened Nov 13, 2025 by zjjlivein Loading…
2 tasks
fix:is_causal bug contributor
#2932 opened Nov 12, 2025 by w-yyh Loading…
add cp for GLM
#2922 opened Nov 12, 2025 by Wennie396 Loading…
2 tasks
[pp] change the tuple data stream to dict
#2921 opened Nov 12, 2025 by miao200years Loading…
2 tasks
[AutoParallel] Refactor qwen3 model in intermediate api
#2914 opened Nov 11, 2025 by waliwali777 Loading…
2 tasks
[AutoParallel] Refactor qwen2 model in intermediate api
#2912 opened Nov 11, 2025 by waliwali777 Loading…
2 tasks
update ernie4.5 best yaml contributor
#2910 opened Nov 10, 2025 by llbdyiu66 Loading…
Add global param to refined_recompute
#2899 opened Nov 10, 2025 by DongBaiYue Loading…
Optimize parallel matmul
#2897 opened Nov 10, 2025 by waliwali777 Loading…
2 tasks
fix ernie4.5 modeling bug contributor
#2884 opened Nov 7, 2025 by llbdyiu66 Loading…
[CI/CE] Add Qwen3MoE CI Config
#2876 opened Nov 6, 2025 by hushenwei2000 Loading…
Fix the non-convergence in DSV3 post-pretrain
#2871 opened Nov 6, 2025 by chen2016013 Loading…
2 tasks
fix dtype fp8 byte_size error contributor
#2861 opened Nov 5, 2025 by llbdyiu66 Loading…
[AutoParallel] Refactor llama3.1 model in intermediate api
#2859 opened Nov 5, 2025 by waliwali777 Loading…
2 tasks
add position ids in sft
#2851 opened Nov 5, 2025 by Jonathans575 Loading…
2 tasks
ProTip! Exclude everything labeled bug with -label:bug.