Scikit Learn CountVectorizer 入门实例

来源：互联网收集：自由互联发布时间：2022-07-20

http://stackoverflow.com/questions/27488446/scikit-learn-countvectorizer from sklearn.feature_extraction.text import CountVectorizer texts=["dog cat fish","dog cat cat","fish bird", 'bird'] cv = CountVectorizer() cv_fit=cv.fit_t

http://stackoverflow.com/questions/27488446/scikit-learn-countvectorizer

from sklearn.feature_extraction.text import CountVectorizer

texts=["dog cat fish","dog cat cat","fish bird", 'bird']
cv = CountVectorizer()
cv_fit=cv.fit_transform(texts)

print(cv.get_feature_names())
print(cv_fit.toarray())
#['bird', 'cat', 'dog', 'fish']
#[[0 1 1 1]
# [0 2 1 0]
# [1 0 0 1]
# [1 0 0 0]]

print(cv_fit.toarray().sum(axis=0))
#[2 3 2 2]

上一篇：Java-Python的完全对齐的tokenizer（字级别）
下一篇：没有了

Scikit Learn CountVectorizer 入门实例
Java-Python的完全对齐的tokenizer（字级别）
python sqlite insert 报错 no such column
tfidf python 中文实例
python 不保留float最后的0
python flask mysql request 实例
matplotlib InstallationError: Command python setup.py egg_info failed with error code 1
ipython notebook http://localhost:8888/tree error
python 一维数组变为二维数组
一段python代码熟悉语法
【推荐】10个好用的Python集成开发环境！
python 自动化办公之批量修改文件名

网友评论

相关栏目

delphi
微信开发
其它开发
ruby
c语言
java
python
批处理
c++
小程序开发

Scikit Learn CountVectorizer 入门实例

相关文章