11 Star 55 Fork 20

MindSpore / xai

 / 详情

关于TB-Net的运行问题

DONE
Bug-Report
创建于  
2022-10-21 17:52
  1. 【Document Link】/【文档链接】
    https://gitee.com/mindspore/xai/tree/master/models/whitebox/tbnet

  2. 【Issues Section】/【问题文档片段】

  3. 【Existing Issues】/【存在的问题】
    当运行模式为GRAPH ,程序在第一个epoch结束后卡住

[WARNING] ME(4540:13808,MainProcess):2022-10-21-17:44:45.285.363 [mindspore\dataset\engine\datasets_user_defined.py:657] Python multiprocessing is not supported on Windows platform.
[WARNING] ME(4540:13808,MainProcess):2022-10-21-17:44:45.285.363 [mindspore\dataset\engine\datasets_user_defined.py:657] Python multiprocessing is not supported on Windows platform.
creating dataset from F:\model1021\xai\models\whitebox\tbnet\data\steam\train.csv...
creating TBNet for training...
training...
===================== Epoch 0 =====================
Train epoch time: 12016.697 ms, per step time: 156.061 ms

而当运行模式为PYNative时
报错信息如下

creating dataset from F:\model1021\xai\models\whitebox\tbnet\data\steam\train.csv...
creating TBNet for training...
training...
===================== Epoch 0 =====================
Traceback (most recent call last):
  File "F:\model1021\xai\models\whitebox\tbnet\train.py", line 155, in <module>
    train_tbnet()
  File "F:\model1021\xai\models\whitebox\tbnet\train.py", line 144, in train_tbnet
    model.train(epoch=1, train_dataset=train_ds, callbacks=[time_callback, loss_callback], dataset_sink_mode=False)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\train\model.py", line 1045, in train
    self._train(epoch,
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\train\model.py", line 98, in wrapper
    func(self, *args, **kwargs)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\train\model.py", line 617, in _train
    self._train_process(epoch, train_dataset, list_callback, cb_params, initial_epoch, valid_infos)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\train\model.py", line 908, in _train_process
    outputs = self._train_network(*next_element)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\nn\cell.py", line 620, in __call__
    raise err
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\nn\cell.py", line 616, in __call__
    output = self._run_construct(args, kwargs)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\nn\cell.py", line 418, in _run_construct
    output = self.construct(*cast_inputs, **kwargs)
  File "F:\model1021\xai\models\whitebox\tbnet\src\tbnet.py", line 322, in construct
    grads = self.grad(self.network, weights)(item, rl1, ref, rl2, hist_item, label, sens)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\ops\composite\base.py", line 401, in after_grad
    return grad_(fn, weights)(*args, **kwargs)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\common\api.py", line 99, in wrapper
    results = fn(*arg, **kwargs)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\ops\composite\base.py", line 391, in after_grad
    self._pynative_forward_run(fn, grad_, args, kwargs)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\ops\composite\base.py", line 430, in _pynative_forward_run
    fn(*args, **new_kwargs)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\nn\cell.py", line 620, in __call__
    raise err
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\nn\cell.py", line 616, in __call__
    output = self._run_construct(args, kwargs)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\nn\cell.py", line 418, in _run_construct
    output = self.construct(*cast_inputs, **kwargs)
  File "F:\model1021\xai\models\whitebox\tbnet\src\tbnet.py", line 217, in construct
    self.network(item, rl1, ref, rl2, hist_item)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\nn\cell.py", line 620, in __call__
    raise err
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\nn\cell.py", line 616, in __call__
    output = self._run_construct(args, kwargs)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\nn\cell.py", line 418, in _run_construct
    output = self.construct(*cast_inputs, **kwargs)
  File "F:\model1021\xai\models\whitebox\tbnet\src\tbnet.py", line 108, in construct
    rl1_embs = self.relation_emb_matrix(rl1)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\nn\cell.py", line 620, in __call__
    raise err
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\nn\cell.py", line 616, in __call__
    output = self._run_construct(args, kwargs)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\nn\cell.py", line 418, in _run_construct
    output = self.construct(*cast_inputs, **kwargs)
  File "F:\model1021\xai\models\whitebox\tbnet\src\embedding.py", line 70, in construct
    output = self.reshape(output_for_reshape, out_shape)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\ops\primitive.py", line 303, in __call__
    return _run_op(self, self.name, args)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\common\api.py", line 99, in wrapper
    results = fn(*arg, **kwargs)
  File "C:\Users\Administrator\miniconda3\envs\md10\lib\site-packages\mindspore\ops\primitive.py", line 752, in _run_op
    output = real_run_op(obj, op_name, args)
RuntimeError: At most one component of input shape can be -1, but got [const vector][-1, -1, 26, 26]

----------------------------------------------------
- C++ Call Stack: (For framework developers)
----------------------------------------------------
mindspore\core\ops\reshape.cc:49 mindspore::ops::update_shape

我目前版本为win-GPU版本的测试版,请问以上问题可能原因出在哪里?
4. 【Expected Result】【预期结果】

  • Please fill in the expected result

评论 (4)

Luxian 创建了Question
fangwenyi 任务状态TODO 修改为ACCEPTED
展开全部操作日志

你好,问题收到,我们已安排人员分析

fangwenyi 负责人设置为ZhidanLiu
fangwenyi 添加了
 
mindspore-assistant
标签
fangwenyi 关联项目设置为MindSpore Issue Assistant
fangwenyi 任务类型Question 修改为Bug-Report

暂时XAI并没有对Windows的支持, 请考虑使用Ubuntu的版本或参考 https://www.mindspore.cn/xai/docs/zh-CN/r1.8/installation.html 所列明的支持平台

好的

您好,由于问题单时间较长可能会有版本gap暂时关闭,如您尝试新版本仍无法解决,可以反馈下具体信息,并将ISSUE状态修改为WIP,我们这边会进一步跟踪,谢谢

Shawny 任务状态ACCEPTED 修改为DONE

登录 后才可以发表评论

状态
负责人
项目
里程碑
Pull Requests
关联的 Pull Requests 被合并后可能会关闭此 issue
分支
开始日期   -   截止日期
-
置顶选项
优先级
预计工期 (小时)
参与者(4)
9345044 hanxiaoyang1 1683445866 8108889 shawny233 1628167362
Python
1
https://gitee.com/mindspore/xai.git
git@gitee.com:mindspore/xai.git
mindspore
xai
xai

搜索帮助

53164aa7 5694891 3bd8fe86 5694891