name | about | labels |
---|---|---|
Bug Report | Use this template for reporting a bug | kind/bug |
GPU环境,动态shape,单卡训练,单输入,HW维动态,使用model.train训练并验证mindir导入导出,报Tile算子相关错误
Ascend
/GPU
/CPU
) / 硬件环境:Please delete the backend not involved / 请删除不涉及的后端:
/device GPU
Software Environment / 软件环境 (Mandatory / 必填):
-- MindSpore version (e.g., 1.7.0.Bxxx) :
-- Python version (e.g., Python 3.7.5) :
-- OS platform and distribution (e.g., Linux Ubuntu 16.04):
-- GCC/Compiler version (if compiled from source):
commit_id = '[sha1]:2f410aa8,[branch]:(HEAD,origin/master,origin/HEAD,master)'
Excute Mode / 执行模式 (Mandatory / 必填)(PyNative
/Graph
):
Please delete the mode not involved / 请删除不涉及的模式:
/mode graph
test_ms_dynamic_shape_amp_o3_normal_net_0002_mindir_infer
test_ms_dynamic_shape_amp_o0_normal_net_0001_mindir_infer
source /home/miniconda3/bin/activate feature_39
export TRAIN_MODE=GRAPH_MODE
export DEVICE_TYPE=GPU_PCIE
export ENV_DEVICE=1
source solution_test/env_set.source -e cuda11.6
cd solution_test/cases/01frame_func/04model_save_load/dynamic_shape
pytest -s test_ms_dynamic_shape_amp_o3_normal_net_0002_mindir_infer.py
报错日志:
[CRITICAL] KERNEL(63211,7f254eaca700,python):2024-05-19-01:19:40.410.202 [mindspore/core/utils/shape_utils.h:44] SizeOf] The product value of shape (1, 1, 275, 214, 1, 1, 275, 214, 1, 1, 275, 214, 32, 3, 275, 214) exceeds the maximum value of size_t: 18446744073709551615
[ERROR] RUNTIME_FRAMEWORK(63211,7f254eaca700,python):2024-05-19-01:19:40.410.584 [mindspore/ccsrc/runtime/graph_scheduler/actor/kernel_async_infer_actor.cc:38] InferShape] Failed to infer shape for kernel: Gradients/Default/network-TrainOneStepCell/network-WithLossCell/_backbone-Dynamic_Net/gradReduceSum-expand/Tile-op4 and catch exception: The product value of shape (1, 1, 275, 214, 1, 1, 275, 214, 1, 1, 275, 214, 32, 3, 275, 214) exceeds the maximum value of size_t: 18446744073709551615
mindspore/core/utils/shape_utils.h:44 SizeOf
[ERROR] RUNTIME_FRAMEWORK(63211,7f25512cb700,python):2024-05-19-01:19:40.410.818 [mindspore/ccsrc/runtime/graph_scheduler/actor/actor_common.cc:274] WaitRuntimePipelineFinish] Wait runtime pipeline finish and an error occurred: The product value of shape (1, 1, 275, 214, 1, 1, 275, 214, 1, 1, 275, 214, 32, 3, 275, 214) exceeds the maximum value of size_t: 18446744073709551615
mindspore/core/utils/shape_utils.h:44 SizeOf
训练推理正常,用例pass
走给 周莉莉
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。
感谢您的提问,您可以评论//mindspore-assistant更快获取帮助:
用例当前成功
回归版本:
ms版本:commit_id = '[sha1]:13fef1e8,[branch]:(HEAD,origin/master,origin/HEAD,master)'
run包:runpkg_version:Milan_C18/20240522
回归步骤:参考issue复现步骤
基本功能:周ci跑测通过
test_ms_dynamic_shape_amp_o3_normal_net_0002_mindir_infer
====== 1 passed, 4 warnings in 42.60s ======
test_ms_dynamic_shape_amp_o0_normal_net_0001_mindir_infer
==== 1 passed, 4 warnings in 42.04s ====
测试结论:回归通过
登录 后才可以发表评论