1 Star 0 Fork 459

周胜凯 / mindformers

forked from MindSpore / mindformers 
加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
克隆/下载
gpt2_13b.yaml 837 Bytes
一键复制 编辑 原始数据 按行查看 历史
zyw_hw 提交于 2023-03-29 09:52 . yaml update
model:
model_config:
type: GPT2Config
seq_length: 1024
vocab_size: 50257
embedding_size: 5120
num_layers: 40
num_heads: 40
expand_ratio: 4
hidden_act: "fast_gelu"
dropout_prob: 0.0
hidden_dropout_prob: 0.1
attention_probs_dropout_prob: 0.1
initializer_range: 0.02
param_init_type: "float16"
layernorm_dtype: "float32"
softmax_dtype: "float16"
compute_dtype: "float16"
checkpoint_name_or_path: ""
eos_token: 50256
repetition_penalty: 1
max_decode_length: 1024
top_k: 5
top_p: 1
do_sample: True
arch:
type: GPT2LMHeadModel
processor:
return_tensors: ms
tokenizer:
unk_token: '<|endoftext|>'
bos_token: '<|endoftext|>'
eos_token: '<|endoftext|>'
pad_token: '<|endoftext|>'
type: GPT2Tokenizer
type: GPT2Processor
Python
1
https://gitee.com/zsk423200/mindformers.git
git@gitee.com:zsk423200/mindformers.git
zsk423200
mindformers
mindformers
r0.3

搜索帮助