代码拉取完成,页面将自动刷新
同步操作将从 MindSpore/mindformers 强制同步,此操作会覆盖自 Fork 仓库以来所做的任何修改,且无法恢复!!!
确定后同步将在后台操作,完成时将刷新页面,请耐心等待。
model:
arch:
type: T5ForConditionalGeneration
model_config:
batch_size: 1
d_ff: 2048
d_model: 512
do_sample: false
eos_token_id: 1
has_relative_bias: true
hidden_act: relu
hidden_dropout_prob: 0.1
initializer_factor: 1.0
initializer_range: 0.02
is_encoder_decoder: true
d_kv: 64
layer_norm_epsilon: 1.0e-06
length_penalty_weight: 1.0
max_decode_length: 128
max_length: 32
max_position_embeddings: 1024
num_heads: 8
num_hidden_layers: 6
pad_token_id: 0
relative_attention_num_buckets: 32
repetition_penalty: 1
scale_output: true
seq_length: 1024
top_k: 1
top_p: 0.95
type: T5Config
use_cache: true
vocab_size: 32128
checkpoint_name_or_path: t5_small
processor:
max_length: 77
padding: max_length
return_tensors: ms
tokenizer:
eos_token: </s>
pad_token: <pad>
type: T5Tokenizer
unk_token: <unk>
type: T5Processor
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。