Analyze large datasets directly from R. Scaling Up With R and Arrow provides a guide to working efficiently with larger-than-memory datasets using the arrow R package. As data grows in size and complexity, traditional data analysis methods in R often hit technical limitations. In this book, you'll learn how to overcome these hurdles without needing to set up complex infrastructure.
You'll learn about the Apache Arrow project's origins, goals, and its significance in bridging the gap between data science and big data ecosystems. You'll also learn how to leverage the arrow R package to work directly with files in various formats, such as CSV and Parquet, using familiar dplyr syntax. This book explores practical topics like data manipulation, file formats, working with larger datasets, and optimizing workflows for data in cloud storage. Advanced chapters examine user-defined functions, integration with other tools like DuckDB, and extending Arrow's capabilities to work with geospatial data.
Written by developers of the Arrow R package, this guide is essential for anyone looking to scale their data processing capabilities in R.
"synopsis" may belong to another edition of this title.
Nic Crane is an R developer, educator, and general enthusiast, with a background in data science and software engineering. Nic is a member of the Apache Arrow Project Management Committee (PMC) and part of the team who maintains the arrow R package.
Jonathan Keane is an engineering manager with a background in software engineering and data science. Jonathan is a part of the team who maintains the Arrow project including the Arrow R package.
Neal Richardson is an engineering leader focused on building software that helps people work with data. He is a member of the Arrow PMC and one of the top contributors to the project.
"About this title" may belong to another edition of this title.
Seller: GreatBookPrices, Columbia, MD, U.S.A.
Condition: As New. Unread book in perfect condition. Seller Inventory # 48410432
Seller: GreatBookPrices, Columbia, MD, U.S.A.
Condition: New. Seller Inventory # 48410432-n
Seller: Grand Eagle Retail, Bensenville, IL, U.S.A.
Paperback. Condition: new. Paperback. Analyze large datasets directly from R. Scaling Up With R and Arrow provides a guide to working efficiently with larger-than-memory datasets using the arrow R package. As data grows in size and complexity, traditional data analysis methods in R often hit technical limitations. In this book, you'll learn how to overcome these hurdles without needing to set up complex infrastructure.You'll learn about the Apache Arrow project's origins, goals, and its significance in bridging the gap between data science and big data ecosystems. You'll also learn how to leverage the arrow R package to work directly with files in various formats, such as CSV and Parquet, using familiar dplyr syntax. This book explores practical topics like data manipulation, file formats, working with larger datasets, and optimizing workflows for data in cloud storage. Advanced chapters examine user-defined functions, integration with other tools like DuckDB, and extending Arrow's capabilities to work with geospatial data.Written by developers of the Arrow R package, this guide is essential for anyone looking to scale their data processing capabilities in R. This book provides a guide to working efficiently with larger-than-memory datasets using the arrow R package. You'll learn how to overcome these hurdles without needing to set up complex infrastructure. Written by developers of the Arrow R package, this guide is essential for anyone looking to scale their data processing capabilities in R. This item is printed on demand. Shipping may be from multiple locations in the US or from the UK, depending on stock availability. Seller Inventory # 9781032660288
Seller: PBShop.store US, Wood Dale, IL, U.S.A.
PAP. Condition: New. New Book. Shipped from UK. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Seller Inventory # L0-9781032660288
Seller: PBShop.store UK, Fairford, GLOS, United Kingdom
PAP. Condition: New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Seller Inventory # L0-9781032660288
Quantity: Over 20 available
Seller: GreatBookPricesUK, Woodford Green, United Kingdom
Condition: New. Seller Inventory # 48410432-n
Quantity: 3 available
Seller: GreatBookPricesUK, Woodford Green, United Kingdom
Condition: As New. Unread book in perfect condition. Seller Inventory # 48410432
Quantity: 3 available
Seller: THE SAINT BOOKSTORE, Southport, United Kingdom
Paperback / softback. Condition: New. New copy - Usually dispatched within 4 working days. Seller Inventory # B9781032660288
Quantity: 1 available
Seller: Ria Christie Collections, Uxbridge, United Kingdom
Condition: New. In. Seller Inventory # ria9781032660288_new
Quantity: Over 20 available
Seller: BuchWeltWeit Ludwig Meier e.K., Bergisch Gladbach, Germany
Taschenbuch. Condition: Neu. This item is printed on demand - it takes 3-4 days longer - Neuware -Analyze large datasets directly from R. Scaling Up With R and Arrow provides a guide to working efficiently with larger-than-memory datasets using the arrow R package. As data grows in size and complexity, traditional data analysis methods in R often hit technical limitations. In this book, you'll learn how to overcome these hurdles without needing to set up complex infrastructure.You'll learn about the Apache Arrow project's origins, goals, and its significance in bridging the gap between data science and big data ecosystems. You'll also learn how to leverage the arrow R package to work directly with files in various formats, such as CSV and Parquet, using familiar dplyr syntax. This book explores practical topics like data manipulation, file formats, working with larger datasets, and optimizing workflows for data in cloud storage. Advanced chapters examine user-defined functions, integration with other tools like DuckDB, and extending Arrow's capabilities to work with geospatial data.Written by developers of the Arrow R package, this guide is essential for anyone looking to scale their data processing capabilities in R. 162 pp. Englisch. Seller Inventory # 9781032660288