python joblib并行在Windows上工作甚至“如果__name__ ==’__ main__’:”被添加

前端之家收集整理的这篇文章主要介绍了python joblib并行在Windows上工作甚至“如果__name__ ==’__ main__’:”被添加前端之家小编觉得挺不错的,现在分享给大家,也给大家做个参考。
我在 Windows上运行 Python并行处理.这是我的代码
from joblib import Parallel,delayed

def f(x): 
    return sqrt(x)

if __name__ == '__main__':
    a = Parallel(n_jobs=2)(delayed(f)(i) for i in range(10))

这是错误消息:

Process PoolWorker-2:  
Process PoolWorker-1:  
Traceback (most recent call last):    
File "C:\Users\yoyo__000.BIGBLACK\AppData\Local\Enthought\Canopy\App\appdata\canopy-1.5.4.3105.win-x86_64\lib\multiprocessing\process.py",line 258,in _bootstrap
self.run()   
File "C:\Users\yoyo__000.BIGBLACK\AppData\Local\Enthought\Canopy\App\appdata\canopy-1.5.4.3105.win-x86_64\lib\multiprocessing\process.py",line 114,in run
self._target(*self._args,**self._kwargs)   
File "C:\Users\yoyo__000.BIGBLACK\AppData\Local\Enthought\Canopy\App\appdata\canopy-1.5.4.3105.win-x86_64\lib\multiprocessing\pool.py",line 102,in worker
task = get()   
File "C:\Users\yoyo__000.BIGBLACK\AppData\Local\Enthought\Canopy\User\lib\site-packages\joblib\pool.py",line 363,in get
return recv()  
AttributeError: 'module' object has no attribute 'f'

解决方法

根据 this site,问题是Windows特定的:

Yes: under linux we are forking,thus their is no need to pickle the
function,and it works fine. Under windows,the function needs to be
pickleable,ie it needs to be imported from another file. This is
actually good practice: making modules pushes for reuse.

我已经尝试过你的代码,它在Linux下运行完美.
在Windows下,如果从脚本运行,它运行正常,如python script_with_your_code.py.但是在交互式python会话中运行时失败了.当我将f函数保存在单独的模块中并将其导入到我的交互式会话中时,它对我有用.

不工作:

互动环节:

>>> from math import sqrt
>>> from joblib import Parallel,delayed

>>> def f(x):
...     return sqrt(x)

>>> if __name__ == '__main__':
...     a = Parallel(n_jobs=2)(delayed(f)(i) for i in range(10))
...
Process PoolWorker-1:
Traceback (most recent call last):
  File "C:\Python27\lib\multiprocessing\process.py",in _bootstrap
    self.run()
  File "C:\Python27\lib\multiprocessing\process.py",in run
    self._target(*self._args,**self._kwargs)
  File "C:\Python27\lib\multiprocessing\pool.py",in worker
    task = get()
  File "C:\Python27\lib\site-packages\joblib\pool.py",line 359,in get
    return recv()
AttributeError: 'module' object has no attribute 'f'

工作方式:
fun.py

from math import sqrt

def f(x):
    return sqrt(x)

互动环节:

>>> from joblib import Parallel,delayed
>>> from fun import f

>>> if __name__ == '__main__':
...     a = Parallel(n_jobs=2)(delayed(f)(i) for i in range(10))
...
>>> a
[0.0,1.0,1.4142135623730951,1.7320508075688772,2.0,2.23606797749979,2.449489742783178,2.6457513110645907,2.8284271247461903,3.0]

猜你在找的Python相关文章