14 Star 268 Fork 104

PaddlePaddle / PaddleGAN

 / 详情

运行wav2lip.py 文件 在GUP 多卡的情况下报错Process abort signal` is detected by the operating system,显卡内存充足的情况下报错

待办的
创建于  
2022-11-07 10:21

py3.7.0
linux ubuntu 18.04
CUDA 11.2
paddlepaddle-gpu 2.3.2
四张16G显卡

以下是报错信息
0 paddle::imperative::Tracer::TraceOp(std::string const&, paddle::imperative::NameVarBaseMap const&, paddle::imperative::NameVarBaseMap const&, paddle::framework::AttributeMap, std::map<std::string, std::string, std::less<std::string >, std::allocator<std::pair<std::string const, std::string > > > const&)
1 void paddle::imperative::Tracer::TraceOpImplpaddle::imperative::VarBase(std::string const&, paddle::imperative::details::NameVarMapTraitpaddle::imperative::VarBase::Type const&, paddle::imperative::details::NameVarMapTraitpaddle::imperative::VarBase::Type const&, paddle::framework::AttributeMap&, phi::Place const&, bool, std::map<std::string, std::string, std::less<std::string >, std::allocator<std::pair<std::string const, std::string > > > const&, paddle::framework::AttributeMap*, bool)
2 paddle::imperative::PreparedOp::Run(paddle::imperative::NameVarBaseMap const&, paddle::imperative::NameVarBaseMap const&, paddle::framework::AttributeMap const&, paddle::framework::AttributeMap const&)
3 void phi::AddRawKernel<float, phi::GPUContext>(phi::GPUContext const&, phi::DenseTensor const&, phi::DenseTensor const&, int, phi::DenseTensor*)
4 float* phi::DeviceContext::Alloc(phi::TensorBase*, unsigned long, bool) const
5 phi::DenseTensor::AllocateFrom(phi::Allocator*, paddle::experimental::DataType, unsigned long)
6 paddle::memory::allocation::StatAllocator::AllocateImpl(unsigned long)
7 paddle::memory::allocation::RetryAllocator::AllocateImpl(unsigned long)
8 paddle::memory::allocation::StreamSafeCUDAAllocator::AllocateImpl(unsigned long)
9 paddle::memory::allocation::AutoGrowthBestFitAllocator::FreeIdleChunks()
10 paddle::memory::allocation::CUDAAllocator::FreeImpl(phi::Allocation*)
11 paddle::platform::RecordedGpuMallocHelper::Free(void*, unsigned long)


Error Message Summary:

FatalError: Process abort signal is detected by the operating system.
[TimeInfo: *** Aborted at 1667558259 (unix time) try "date -d @1667558259" if you are using GNU date ***]
[SignalInfo: *** SIGABRT (@0x9a43) received by PID 39491 (TID 0x7f28c5194740) from PID 39491 ***]
输入图片说明

评论 (0)

Mr_One 创建了任务

登录 后才可以发表评论

状态
负责人
里程碑
Pull Requests
关联的 Pull Requests 被合并后可能会关闭此 issue
分支
开始日期   -   截止日期
-
置顶选项
优先级
参与者(1)
Python
1
https://gitee.com/paddlepaddle/PaddleGAN.git
git@gitee.com:paddlepaddle/PaddleGAN.git
paddlepaddle
PaddleGAN
PaddleGAN

搜索帮助

53164aa7 5694891 3bd8fe86 5694891