镜像
mindformers_v1.1.0-mindspore_2.3.0rc2-cann_8.0rc1-py_3.9-ubuntu_18.04-aarch64-d910
运行
cd mindformers/research
bash ../scripts/msrun_launcher.sh
"llama3/run_finetune.py
--config llama3/run_llama3_8b_8k_800T_A2_64G.yaml
--load_checkpoint /home/ma-user/Models/Llama3-8B-Chinese-Chat.ckpt
--auto_trans_ckpt True
--use_parallel True
--run_mode finetune
--train_data /home/ma-user/Projects/llama3/data"
报错如下
No parameter is entered. Notice that the program will run on default 8 cards.
../scripts/msrun_launcher.sh: line 119: ulimit: max user processes: cannot modify limit: Operation not permitted
Running Command: msrun --worker_num=8 --local_worker_num=8 --master_port=8118 --log_dir=output/msrun_log --join=False --cluster_time_out=600 llama3/run_finetune.py --config llama3/run_llama3_8b_8k_800T_A2_64G.yaml --load_checkpoint /home/ma-user/Models/Llama3-8B-Chinese-Chat.ckpt --auto_trans_ckpt True --use_parallel True --run_mode finetune --train_data /home/ma-user/Projects/llama3/data
Please check log files in output/msrun_log
/home/ma-user/miniconda3/envs/llama/lib/python3.9/site-packages/numpy/core/getlimits.py:549: UserWarning: The value of the smallest subnormal for <class 'numpy.float64'> type is zero.
setattr(self, word, getattr(machar, word).flat[0])
/home/ma-user/miniconda3/envs/llama/lib/python3.9/site-packages/numpy/core/getlimits.py:89: UserWarning: The value of the smallest subnormal for <class 'numpy.float64'> type is zero.
return self._float_to_str(self.smallest_subnormal)
/home/ma-user/miniconda3/envs/llama/lib/python3.9/site-packages/numpy/core/getlimits.py:549: UserWarning: The value of the smallest subnormal for <class 'numpy.float32'> type is zero.
setattr(self, word, getattr(machar, word).flat[0])
/home/ma-user/miniconda3/envs/llama/lib/python3.9/site-packages/numpy/core/getlimits.py:89: UserWarning: The value of the smallest subnormal for <class 'numpy.float32'> type is zero.
return self._float_to_str(self.smallest_subnormal)
/bin/sh: 1: ip: not found
Traceback (most recent call last):
File "/home/ma-user/miniconda3/envs/llama/bin/msrun", line 8, in
sys.exit(main())
File "/home/ma-user/miniconda3/envs/llama/lib/python3.9/site-packages/mindspore/parallel/cluster/run.py", line 136, in main
run(args)
File "/home/ma-user/miniconda3/envs/llama/lib/python3.9/site-packages/mindspore/parallel/cluster/run.py", line 129, in run
process_manager = _ProcessManager(args)
File "/home/ma-user/miniconda3/envs/llama/lib/python3.9/site-packages/mindspore/parallel/cluster/process_entity/_api.py", line 104, in init
self.is_master = _is_local_ip(args.master_addr)
File "/home/ma-user/miniconda3/envs/llama/lib/python3.9/site-packages/mindspore/parallel/cluster/process_entity/_utils.py", line 78, in _is_local_ip
addr_infos = json.loads(addr_info_str)
File "/home/ma-user/miniconda3/envs/llama/lib/python3.9/json/init.py", line 346, in loads
return _default_decoder.decode(s)
File "/home/ma-user/miniconda3/envs/llama/lib/python3.9/json/decoder.py", line 337, in decode
obj, end = self.raw_decode(s, idx=_w(s, 0).end())
File "/home/ma-user/miniconda3/envs/llama/lib/python3.9/json/decoder.py", line 355, in raw_decode
raise JSONDecodeError("Expecting value", s, err.value) from None
json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)
请问你这个环境有镜像吗,我在modelarts云平台基础镜像mindspore_2.1.0-cann_6.3.2-py_3.7-euler_2.8.3-aarch64-d910下按照MindSpore教程又安装了ascend_toolkit,结果mindspore一直报错
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。
登录 后才可以发表评论