为什么同样是 lxml5.4.0,在 ubuntu上会报错 ImportError: lxml.html.clean module is now a separate project lxml_html_clean 但是 debian 不会呢?
测试的 python 都是 3.11;lxml 版本都是 5.4。0
机器 A Ubuntu 24.04.2 LTS x86_64
In [1]: import lxml.html.clean
---------------------------------------------------------------------------
ImportError Traceback (most recent call last)
Cell In[1], line 1
----> 1 import lxml.html.clean
File ~/.local/share/virtualenvs/parser-w55maLTU/lib/python3.10/site-packages/lxml/html/clean.py:18
8 __all__ = [
9 "clean_html",
10 "clean",
(...)
15 "word_break_html",
16 ]
17 except ImportError:
---> 18 raise ImportError(
19 "lxml.html.clean module is now a separate project lxml_html_clean./n"
20 "Install lxml[html_clean] or lxml_html_clean directly."
21 ) from None
ImportError: lxml.html.clean module is now a separate project lxml_html_clean.
Install lxml[html_clean] or lxml_html_clean directly.
机器 B Debian GNU/Linux 12 (bookworm)
In [1]: import lxml.html.clean
最后搞不懂,估计是一些依赖残留的问题
所以我把虚拟环境删除了,重建了一个新的虚拟环境,然后重新安装所有依赖,问题就消失了
IT极限技术分享汇