• CVE-2024-5206

发布时间: 2024年6月21日

修改时间: 2024年10月31日

概要

A sensitive data leakage vulnerability was identified in scikit-learn s TfidfVectorizer, specifically in versions up to and including 1.4.1.post1, which was fixed in version 1.5.0. The vulnerability arises from the unexpected storage of all tokens present in the training data within the `stop_words_` attribute, rather than only storing the subset of tokens required for the TF-IDF technique to function. This behavior leads to the potential leakage of sensitive information, as the `stop_words_` attribute could contain tokens that were meant to be discarded and not stored, such as passwords or keys. The impact of this vulnerability varies based on the nature of the data being processed by the vectorizer.

CVSS v3 指标

NVD openEuler
Confidentiality High High
Attack Vector Local Network
CVSS评分 4.7 5.3
Attack Complexity High High
Privileges Required Low Low
Scope Unchanged Unchanged
Integrity None None
User Interaction None None
Availability None None

安全公告

公告名 概要 发布时间
KylinSec-SA-2024-3541 python-scikit-learn security update 2025年2月5日

影响产品

产品 状态
KY3.4-5A python-scikit-learn Fixed
KY3.5.2 python-scikit-learn Fixed
V6 python-scikit-learn Fixed