图书标签: 数据分析 数据挖掘 O'Reilly Data-Analysis Python opensource data 计算机
发表于2025-01-13
Data Analysis with Open Source Tools pdf epub mobi txt 电子书 下载 2025
Description
Real World Data Analysis shows you how you think about data and the results you want to achieve with it. Author Philipp Janert teaches you how to effectively approach data analysis problems, and how to extract all the available information from your data. Many people can apply a data analysis formula. This book shows you how to look at the results and know whether they're meaningful.
These days it seems like everyone is collecting data. But all of that data is just raw information -- to make that information meaningful, it has to be organized, filtered, and analyzed. Anyone can apply data analysis tools and get results, but without the right approach those results may be useless.
In Real World Data Analysis, author Philipp Janert teaches you how to think about data: how to effectively approach data analysis problems, and how to extract all of the available information from your data. Janert covers univariate data, data in multiple dimensions, time series data, graphical techniques, data mining, machine learning, and many other topics. He also reveals how seat-of-the-pants knowledge can lead you to the best approach right from the start, and how to assess results to determine if they're meaningful.
Philipp K. Janert
After previous careers in physics and software development, Philipp K. Janert currently provides consulting services for data analysis, algorithm development, and mathematical modeling. He has worked for small start-ups and in large corporate environments, both in the U.S. and overseas. He prefers simple solutions that work to complicated ones that don't, and thinks that purpose is more important than process. Philipp is the author of "Gnuplot in Action - Understanding Data with Graphs" (Manning Publications), and has written for the O'Reilly Network, IBM developerWorks, and IEEE Software. He is named inventor on a handful of patents, and is an occasional contributor to CPAN. He holds a Ph.D. in theoretical physics from the University of Washington. Visit his company website at www.principal-value.com.
其实我觉得70%都是在讲概率和应用数学……我是走错片场了么?(Update: 我的确走错片场了,看完了发现它想要告诉我全部细节,结果就是神马都是重点,抓狂了……)
评分这本书是好书,在读。
评分比较high-level的入门书,很好懂,理论以“都介绍一点”为主,每章也列出可以用来做这章里讲到的东西的python和R的libraries。缺点是实战例子不多。
评分Author keeps placing emphasis on insights instead of numbers while working with data. The ultimate goal of data analysis is to understand how the system works, not to show off how proficient you are at Math. That's the true spirit of professionalism. Some annoying jargon are well explained in a plain manner. Little sections on R.
评分这本书是好书,在读。
这本书的主要内容是“数据分析”,而且讲解明显不够透彻,有泛泛之嫌,总评B+,难度A,推荐指数B(SABC分级) 内容尚且如此,翻译啥的就不重要了,不过倒不是翻译的特别烂,应该说不会有明显倒胃口的情况
评分不得不说本书的翻译不敢让人恭维。拿到书后粗略翻了翻,翻译的水平勉强达到“信达雅”中的“信”吧,我想这本书应该是导师交给学生翻译的。不过买之前我已经做好心理准备:一来这个是技术书,不求文字的华丽;二来我已经有pdf的电子版,买这本中文版的目的是加快阅读。 所以,...
评分这本书的主要内容是“数据分析”,而且讲解明显不够透彻,有泛泛之嫌,总评B+,难度A,推荐指数B(SABC分级) 内容尚且如此,翻译啥的就不重要了,不过倒不是翻译的特别烂,应该说不会有明显倒胃口的情况
评分1. 30页起Rank-Order Plots, Pareto Chart。由于引入了dependent variable,个人认为这种解决方案已经不属于单变量数据的可视化,应当放在第三章(双变量数据)中加以叙述。 2. 34页,关于标准差的定义公式有2个,其中第一个是正确的,而第二个则是错误的。
评分不得不说本书的翻译不敢让人恭维。拿到书后粗略翻了翻,翻译的水平勉强达到“信达雅”中的“信”吧,我想这本书应该是导师交给学生翻译的。不过买之前我已经做好心理准备:一来这个是技术书,不求文字的华丽;二来我已经有pdf的电子版,买这本中文版的目的是加快阅读。 所以,...
Data Analysis with Open Source Tools pdf epub mobi txt 电子书 下载 2025