程序師世界是廣大編程愛好者互助、分享、學習的平台，程序師世界有你更精彩！


設為首頁	加入收藏

首頁
編程語言: C語言|JAVA編程
 Python編程
網頁編程: ASP編程|PHP編程
 JSP編程
數據庫知識: MYSQL數據庫|SqlServer數據庫
 Oracle數據庫|DB2數據庫

您现在的位置：程式師世界 >> 編程語言 > >> 更多編程語言 >> Python

Is there something wrong with the Apriori algorithm code in Chapter 4 of Introduction and practice of Python data mining?

編輯：Python

This is a apriori Part of the algorithm code . We want to start with only 1 The frequent itemsets of items are included 2 Frequent itemsets of items . The code is as follows ：

from collections import defaultdict\n",
"\n",
"def find_frequent_itemsets(favorable_reviews_by_users, k_1_itemsets, min_support):\n",
" counts = defaultdict(int)\n",
" for user, reviews in favorable_reviews_by_users.items():\n",
" for itemset in k_1_itemsets:\n",
" if itemset.issubset(reviews):\n",
" for other_reviewed_movie in reviews - itemset:\n",
" current_superset = itemset | frozenset((other_reviewed_movie,))\n",
" counts[current_superset] += 1\n",
" return dict([(itemset, frequency) for itemset, frequency in counts.items() if frequency >= min_support])"

I think the frequent itemsets here are recalculated . for example ： For users 1 Come on , aggregate {A,B} and {B,A} It's the same , But according to the code ：

for itemset in k_1_itemsets:\n",
" if itemset.issubset(reviews):\n",
" for other_reviewed_movie in reviews - itemset:\n",
" current_superset = itemset | frozenset((other_reviewed_movie,))\n",
" counts[current_superset] += 1\n",

When itemset==A, We are right. {A,B} Count once ,
When itemset==B, We are right. {B,A} Count again ,
So here is not repeated counting ？ If it is , How to modify the program ？

上一篇文章： Python，PyQt調用本地模塊做父類，運行子類顯示父類沒有setObectName
下一篇文章： Python analysis programmers are not really concerned about technology, but

Python

Python+opencv實圖片定位

# -*- coding: utf-8 -*- import

python convert chinese date to numeric date

1、說明 This article is to help a

Case 1: Pandas time series 01

Now we have 2015 To 2017 year

python——使用API

文章目錄使用API1. 使用Web API1.1 使用API

【Python自動化測試31】Web自動化之鼠標鍵盤操作、select用法

文章目錄一、前言二、鼠標操作三、鍵盤操作四、select用法

How do I use Python to get the phone number and QQ of girls in the whole school? Technical flirtation

Preface ： This is not technica

相關文章

Python Dictionary: is there a higher-level game I cant play?

Python determines whether there is an even number and outputs the maximum even prime and the minimum even number

Pandas calculates the distance between two longitudes and latitudes. The amount of data is too large, hundreds of millions of lines. Is there an efficient algorithm?

It seems that there is a hot question. Will Python become a common programming tool for public office in the future?

Is there a tool to automatically sort functions in Python classes by function name?

What is the development prospect of Python? Is there a future for zero basics Python?

Python crawler, whats the problem when you encounter this kind of problem? Is there any explanation for your tears

About Python: after adding handlers and print (resp.text), there will be problems with the running results and prompt for format problems

Python crawler series something 403 response solution

Vscode can run python, but there is no package or module

閱讀排行榜

Learn a move! How to use pandas to batch merge excel? Too many vscode Python autocompletions 《大秦賦》最近有點火！於是我用Python抓取了“相關數據”，發現了這些秘密...... python滲透測試入門之wordpress登錄 Python：使用拉依達准則（3σ准則）剔除excel表中異常數據第十八：第一個Python+Selenium自動化測試實戰項目（粗糙） Python實驗報告 [Python]使用折線圖統計水果店一周內銷售額的變化曲線 python os.fork模塊復制父進程所有內存數據並分配新地址 Python項目:爬蟲實戰 9、 Python learning notes - network programming -socket

熱門圖文

c語言：用戶輸入10個整數，程序找出其中的最大值和最小值【C#】第1章 VS2015中C#6的新特性，使用silverlight構建一個工作流設計器(十二) Struts，MVC 的一種開放源碼實現 android-如何檢查 scroll view 是否存在？藍牙設備信息-求怎樣使用藍牙模塊.! android-Android新人，這是什麼問題，求助！ PHP基本語法的小結，PHP基本語法小結

欄目導航

編程綜合問答

更多關於編程

編程問題解答

Copyright © 程式師世界 All Rights Reserved