Spark权威指南(影印版 英文版)

Spark权威指南(影印版 英文版)
作 者: 比尔·钱伯斯 马太·扎哈里亚
出版社: 东南大学出版社
丛编项:
版权说明: 本书为出版图书,暂不支持在线阅读,请支持正版图书
标 签: 暂缺
ISBN 出版时间 包装 开本 页数 字数
未知 暂无 暂无 未知 0 暂无

作者简介

暂缺《Spark权威指南(影印版 英文版)》作者简介

内容简介

暂缺《Spark权威指南(影印版 英文版)》简介

图书目录

Preface

Part I. Gentle Overview of Big Data and Spark

1. What Is Apache Spark?

Apache Spark's Philosophy

Context: The Big Data Problem

History of Spark

The Present and Future of Spark

Running Spark

Downloading Spark Locally

Launching Spark's Interactive Consoles

Running Spark in the Cloud

Data Used in This Book

2. A Gentle Introduction to Spark

Spark's Basic Architecture

Spark Applications

Spark's Language APIs

Spark's APIs

Starting Spark

The SparkSession

DataFrames

Partitions

Transformations

Lazy Evaluation

Actions

Spark UI

An End-to-End Example

DataFrames and SQL

Conclusion

3. A Tour of Spark's Too1set

Running Production Applications

Datasets: Type-Safe Structured APIs

Structured Streaming

Machine Learning and Advanced Analytics

Lower-Level APIs

SparkR

Spark's Ecosystem and Packages

Conclusion

Part II. Structured APls--DataFrames, SQL, and Datasets

4. Structured API Overview

DataFrames and Datasets

Schemas

Overview of Structured Spark Types

DataFrames Versus Datasets

Columns

Rows

Spark Types

Overview of Structured API Execution

Logical Planning

Physical Planning

Execution