yuhui
|
979a633740
|
!869 Qwen-72B推理评估
Merge pull request !869 from yuhui/modellink
|
2024-03-07 06:16:34 +00:00 |
zhangbin
|
c5645c243a
|
!828 intern_7B修改readme
Merge pull request !828 from zhangbin/modellink
|
2024-03-05 12:42:50 +00:00 |
huangyiming
|
d87279867c
|
!832 修改bloom 176b训练脚本和readme
Merge pull request !832 from huangyiming/modellink
|
2024-03-05 06:43:04 +00:00 |
yuhui
|
90769b814b
|
!843 Qwen-14B推理评估
Merge pull request !843 from yuhui/modellink
|
2024-03-05 03:13:58 +00:00 |
王晶
|
f6b12f4631
|
!760 支持Mixtral 8x7B MOE模型
Merge pull request !760 from 王晶/modellink
|
2024-03-04 10:38:57 +00:00 |
yuhui
|
445aab7c2f
|
!839 Qwen-7B推理评估
Merge pull request !839 from yuhui/modellink
|
2024-03-02 01:34:12 +00:00 |
yuhui
|
91aa42d7af
|
!776 增加Qwen-14B模型训练
Merge pull request !776 from yuhui/modellink
|
2024-02-29 01:18:01 +00:00 |
liuyanghan
|
c083867e13
|
!693 【modellink】去除readme中加速算法相关的描述 && 删除loss曲线
Merge pull request !693 from liuyanghan/modellink
|
2024-02-28 02:19:39 +00:00 |
xiongliangcheng
|
1360d372ce
|
!714 添加baichuan1/2-7B推理、评估以及baichuan13B的lora微调
Merge pull request !714 from xiongliangcheng/modellink
|
2024-02-26 07:50:15 +00:00 |
yuhui
|
b45a4eb940
|
!678 增加Qwen-7B模型训练
Merge pull request !678 from yuhui/modellink
|
2024-02-22 12:16:06 +00:00 |
yaojia2021
|
ac5c626886
|
!732 add aquila related code and README
Merge pull request !732 from yaojia2021/modellink
|
2024-02-22 07:20:37 +00:00 |
iansheng
|
2005bc456d
|
!709 llama2 lora微调和推理
Merge pull request !709 from iansheng/modellink
|
2024-02-21 08:06:42 +00:00 |
zhangbin
|
f7e3c10101
|
!640 增加书生7B,65B
Merge pull request !640 from zhangbin/modellink
|
2024-02-20 03:35:34 +00:00 |
xiongliangcheng
|
4ba39f3bd1
|
!531 添加alibi编码适配代码与baichuan13B、baichuan2-13B精度性能README
Merge pull request !531 from xiongliangcheng/modellink
|
2024-02-19 12:31:34 +00:00 |
wwzhuo
|
0a61f7f3cc
|
!685 提交llama2-70B README及训练、推理、评估参数脚本
Merge pull request !685 from wwzhuo/modellink
|
2024-02-08 09:15:20 +00:00 |
iansheng
|
2b9d06e18f
|
!606 llama2-34B 训练、推理、评估
Merge pull request !606 from iansheng/modellink
|
2024-02-07 07:47:32 +00:00 |
huangyiming
|
6f84d081de
|
!646 bloom 7b 预训练,推理、评测脚本合入
Merge pull request !646 from huangyiming/modellink
|
2024-02-07 01:13:53 +00:00 |
iansheng
|
de4c7f7163
|
!572 llama-33B 训练、推理、评估
Merge pull request !572 from iansheng/modellink
|
2024-02-02 06:25:40 +00:00 |
wwzhuo
|
1744e24ca2
|
!565 llama-7B/13B/65B 训练、推理、评估
Merge pull request !565 from wwzhuo/modellink
|
2024-02-02 06:13:02 +00:00 |
liuyanghan
|
c0e684cb24
|
!563 替换modellink仓中的ascendspeed相关标志
Merge pull request !563 from liuyanghan/modellink
|
2024-01-29 13:14:34 +00:00 |
xiongliangcheng
|
2a9af85917
|
!449 增加baichuan7B/baichuan2-7B adaptor
Merge pull request !449 from xiongliangcheng/modellink
|
2024-01-26 11:52:49 +00:00 |
iansheng
|
774def76d7
|
!454 llama2-13B 训练、推理、评估
Merge pull request !454 from iansheng/modellink
|
2024-01-23 12:42:04 +00:00 |
i-robot
|
1d3f003a7e
|
!412 llama2_7b代码提交
Merge pull request !412 from yangcheng/modellink
|
2024-01-05 02:32:20 +00:00 |
yangcheng
|
96b56395e4
|
llama2_7b
|
2024-01-05 10:27:02 +08:00 |
RyanAlexander
|
8247e4de40
|
!401 支持modellink llama2模型推理
* feat: 支持Llama2和Llama推理
|
2024-01-02 12:30:29 +00:00 |
lizekai
|
8021b110d3
|
internlm infer & eval update
|
2023-12-19 14:53:01 +08:00 |
i-robot
|
5716c4727b
|
!373 Llama-65B训练优化并新增推理/评估
Merge pull request !373 from 丁子叉/master
|
2023-12-14 03:05:21 +00:00 |
dingzicha
|
a41b1e0840
|
llama-65B训练优化并新增推理/评估脚本
|
2023-12-14 10:57:04 +08:00 |
g00841271
|
85b38dc29a
|
add Baichuan evaluation script and readme
|
2023-12-13 16:24:34 +08:00 |
y00546703
|
7ebf7f801e
|
add generate, eval scripts, add to pretrain script, modify README accordingly. also change LlamaTokenizer to AutoTokenizer for tasks\evaluation\evaluation_llama.py to make it more general to all models.
|
2023-12-08 14:33:05 +08:00 |
i-robot
|
edf328cbfd
|
!360 更新百川readme与参数信息
Merge pull request !360 from guoxinjie/master
|
2023-12-07 09:13:59 +00:00 |
g00841271
|
3c0888c634
|
update Baichuan README and add downstream tasks
|
2023-12-07 16:33:46 +08:00 |
chantcalf
|
ca524ded82
|
增加Llama2-34B 启动脚本和readme
|
2023-12-06 17:17:44 +08:00 |
i-robot
|
7ef0d52d84
|
!348 修改llama-7B/13B README及参数脚本
Merge pull request !348 from stacey/master
|
2023-12-05 01:19:37 +00:00 |
shengjiayi@huawei.com
|
620dd31e4f
|
llama33B适配更新
|
2023-12-04 19:06:47 +08:00 |
i-robot
|
96f103fcda
|
!352 Aquila7B模型精度达标,性能达标,已修改AscendSpeed的master README增加Aquila相关信息
Merge pull request !352 from yaojia2021/master
|
2023-12-04 06:57:38 +00:00 |
19952409173
|
476131b265
|
modify llama-7B/13B readme for Q4
modify llama-7B/13B readme for Q4
modify llama-7B/13B readme for Q4
modify llama-7B/13B readme for Q4
modify llama-7B/13B readme for Q4
|
2023-12-04 11:29:28 +08:00 |
y00546703
|
3ba91a9d58
|
add aquila-7B model, modified AscendSpeed master README accordingly.
|
2023-12-04 09:59:20 +08:00 |
lizekai
|
99af579fe5
|
FineTune Internlm
|
2023-12-02 18:59:14 +08:00 |
liuyanghan
|
fe1c7d7869
|
书生65B模型合入
|
2023-12-02 10:39:47 +08:00 |
matrixssy
|
eb8973ce0b
|
llama2-7b全流程脚本及readme
|
2023-11-24 16:52:16 +08:00 |
ningbenzhe1
|
f801b32e15
|
llama2 uploads the training inference script and the README.
|
2023-11-24 12:15:46 +08:00 |
19952409173
|
f36b2414dc
|
更新llama-7B、llama-13B README.md;新增llama-13B 16p参数脚本
更新llama-7B、llama-13B README.md;新增llama-13B 16p参数脚本
更新llama-7B、llama-13B README.md;新增llama-13B 16p参数脚本
更新llama-7B、llama-13B README.md;新增llama-13B 16p参数脚本
|
2023-11-23 20:40:36 +08:00 |
l00619700
|
0211eea70d
|
Cadam Optimizer
|
2023-11-14 22:14:39 +08:00 |
chantcalf
|
eb38c34a8c
|
LLama2-70B 添加readme
|
2023-11-13 16:13:29 +08:00 |
fengliangjun
|
8801418472
|
add chinese
|
2023-11-11 15:49:16 +08:00 |
i-robot
|
d6e6722f6a
|
!227 add llama-33b script
Merge pull request !227 from chenzhihong/llama-33b
|
2023-10-30 07:19:58 +00:00 |
huangyiming
|
a8288efa15
|
1. 更新模型评估readme文件
2. 删除多余的图片
|
2023-10-27 11:06:08 +08:00 |
Zhihong Chen
|
8e6a9c0574
|
add llama-33b script and readme update
|
2023-10-26 15:38:32 +08:00 |
matrixssy
|
525a1d23ae
|
(部分代码)1.在线推理框架整合\n2.新增beam search等推理方式3.Readme更新4.alibi兼容性修复
|
2023-10-23 14:58:13 +08:00 |