[repost ]Data mining with WEKA, Part 2: Classification and clustering
oringinal:http://www.ibm.com/developerworks/opensource/library/os-weka2/index.html Michael Abernethy, Freelance Programmer, Freelancer Summary: Data mining is a collective term for dozens of...
View Article[repost ]Data mining with WEKA, Part 3: Nearest Neighbor and server-side library
original:http://www.ibm.com/developerworks/opensource/library/os-weka3/index.html Michael Abernethy, Freelance Programmer, Freelancer Summary: Data mining can be used to turn seemingly meaningless...
View Article[repost ]Statistical Data Mining Tutorials
original:http://www.autonlab.org/tutorials/ Tutorial Slides by Andrew Moore Advertisment: In 2006 I joined Google. We are growing a Google Pittsburgh office on CMU’s campus. We are hiring creative...
View Article[repost ]A Programmer’s Guide to Data Mining
original:http://guidetodatamining.com/ Welcome A guide to practical data mining, collective intelligence, and building recommendation systems by Ron Zacharski Before you is a tool for learning...
View Article[repost ]Summarization with Lucene
original:http://sujitpal.blogspot.com/2009/02/summarization-with-lucene.html You may have noticed that over the last couple of months, I haven’t been writing too much about text mining. So I got...
View Article[repost ]IR Math with Java : Similarity Measures
original:http://sujitpal.blogspot.com/2008/09/ir-math-with-java-similarity-measures.html Last week, I wrote about building term document matrices based on Dr. Manu Konchady’s Text Mining Application...
View Article[repost ]Generating Unigram and Bigrams into MySQL from Hadoop SequenceFiles
original:http://sujitpal.blogspot.com/2012/04/generating-unigram-and-bigrams-into.html In my previous post, I described how I used GNU Parallel to read a fairly large Lucene index into a set of Hadoop...
View Article[repost ]An UIMA Sentence Annotator using OpenNLP
original:http://sujitpal.blogspot.com/2011/04/uima-sentence-annotator-using-opennlp.html Recently, a colleague pointed out that our sentence splitting code (written by me using Java BreakIterator) was...
View Article[repost ]奇异值分解SVD应用——LSI
original:http://blog.csdn.net/abcjennifer/article/details/8131087 潜在语义索引(Latent Semantic Indexing)是一个严重依赖于SVD的算法,本文转载自之前吴军老师《数学之美》和参考文献《机器学习中的数学》汇总。 ————————————...
View Article[repost ] KMeans和KMedoid 的Matlab实现
original:http://blog.csdn.net/abcjennifer/article/details/8197072 KMeans和KMedoid算法是聚类算法中比较普遍的方法,本文讲了其原理和matlab中实现的代码。 1.目标: 找出一个分割,使得距离平方和最小 2.K-Means算法: 1. 将数据分为k个非空子集 2....
View Article[book ]Data Mining: Concepts and Techniques, 2nd ed.
original:http://www.cs.uiuc.edu/~hanj/bk2/ Jiawei Han and Micheline Kamber Data Mining: Concepts and Techniques, 2nd ed. The Morgan Kaufmann Series in Data Management Systems, Jim Gray, Series Editor...
View Article[book ]Slides in PowerPoint form :Data Mining: Concepts and Techniques, 2nd ed.
original:http://www.cs.uiuc.edu/homes/hanj/bk2/slidesindex.htm Jiawei Han and Micheline Kamber Data Mining: Concepts and Techniques, 2nd ed. The Morgan Kaufmann Series in Data Management Systems, Jim...
View Article[repost ]DL Lecture by Rayid Ghani
original:http://yourlisten.com/channel/content/16941319/DL_Lecture_by_Rayid_Ghani DL Lecture by Rayid Ghani The second Digital Leaders Annual Lecture was delivered by Rayid Ghani, former Chief...
View Article[repost ]Statistical Data Mining Tutorials
original:http://www.autonlab.org/tutorials/ Tutorial Slides by Andrew Moore Advertisment: In 2006 I joined Google. We are growing a Google Pittsburgh office on CMU’s campus. We are hiring creative...
View Article[repost ]Statistical Data Mining Tutorials Tutorial Slides by Andrew Moore
original:http://www.autonlab.org/tutorials/ Advertisment: In 2006 I joined Google. We are growing a Google Pittsburgh office on CMU’s campus. We are hiring creative computer scientists who love...
View Article[repost ]A Programmer’s Guide to Data Mining
original:http://guidetodatamining.com/ Welcome A guide to practical data mining, collective intelligence, and building recommendation systems by Ron Zacharski. Before you is a tool for learning basic...
View Article[repost ]Choosing the right estimator
original:http://scikit-learn.org/stable/tutorial/machine_learning_map/index.html Often the hardest part of solving a machine learning problem can be finding the right estimator for the job. Different...
View Article[repost ]Top 10 Algorithms in Data Mining
original:http://www.cs.uvm.edu/~icdm/algorithms/index.shtml [April 22, 2009:] A companion book on The Top Ten Algorithms in Data Mining published in April 2009 [December 24, 2007:] A companion article...
View Article[repost ]What is Data Mining and KDD
original:http://machinelearningmastery.com/what-is-data-mining-and-kdd/ I am very interested in processes. I want to know good ways to do things, even the best way to do things if possible. Even if you...
View Article
More Pages to Explore .....