商业数据科学(影印版)

商业数据科学(影印版)
作 者: Foster Provost
出版社: 东南大学出版社
丛编项:
版权说明: 本书为公共版权或经版权方授权,请支持正版图书
标 签: 暂缺
ISBN 出版时间 包装 开本 页数 字数
未知 暂无 暂无 未知 0 暂无

作者简介

暂缺《商业数据科学(影印版)》作者简介

内容简介

这是一本博大精深但又不太技术的指南,向你介绍数据科学的基本原则,并带领你全程浏览从所搜集数据中抽取有用知识和商业价值所必需的“数据分析思维”。通过学习数据科学原则,你将领略当今用到的诸多数据挖掘技巧。更重要的是,这些原则支撑着通过数据挖掘技巧解决商业问题所需的手段和策略。

图书目录

Preface

1.Introduction: Data-Analytic Thinking

The Ubiquity of Data Opportunities

Example: Hurricane Frances

Example: Predicting Customer Churn

Data Science, Engineering, and Data-Driven Decision Making

Data Processing and "Big Data"

From Big Data 1.0 to Big Data 2.0

Data and Data Science Capability as a Strategic Asset

Data-Analytic Thinking

This Book

Data Mining and Data Science, Revisited

Chemistry Is Not About Test Tubes: Data Science Versus the Work of the Data Scientist

Summary

2.Business Problems and Data Science Solutions

From Business Problems to Data Mining Tasks

Supervised Versus Unsupervised Methods

Data Mining and Its Results

The Data Mining Process

Business Understanding

Data Understanding

Data Preparation

Modeling

Evaluation

Deployment

Implications for Managing the Data Science Team

Other Analytics Techniques and Technologies

Statistics

Database Querying

Data Warehousing

Regression Analysis

Machine Learning and Data Mining

Answering Business Questions with These Techniques

Summary

3.Introduction to Predictive Modeling: From Correlation to Supervised Segmentation.

Models, Induction, and Prediction

Supervised Segmentation

Selecting Informative Attributes

Example: Attribute Selection with Information Gain

Supervised Segmentation with Tree-Structured Models

Visualizing Segmentations

Trees as Sets of Rules

Probability Estimation

Example: Addressing the Churn Problem with Tree Induction

Summary

4.Fitting a Model to Data

Classification via Mathematical Functions

Linear Discriminant Functions

Optimizing an Objective Function

An Example of Mining a Linear Discriminant from Data

Linear Discriminant Functions for Scoring and Ranking Instances

Support Vector Machines, Briefly

Regression via Mathematical Functions

Class Probability Estimation and Logistic "Regression"

Logistic Regression: Some Technical Details

Example: Logistic Regression versus Tree Induction

Nonlinear Functions, Support Vector Machines, and Neural Networks

5.Overfitting and Its Avoidance

6.Similarity, Neighbors, and Clusters

7.Decision AnalyticThinking h What Is a Good Model?

8.Visualizing Model Performance

9.Evidence and Probabilities

10.Representing and Mining Text

11.Decision Analytic Thinking Ih Toward Analytical Engineering

12.Other Data Science Tasks and Techniques

13.Data Science and Business Strategy

14.Conclusion

A.Proposal ReviewGuide

B.Another Sample Proposal

Glossary

Bibliography

Index