Synopsis
Placing the reader in the position of a commercial data scientist, this book covers the key attributes to solve real-world problems in areas such as music, financial markets, and global news. Introducing advanced techniques in Spark, it also comprehensively explores the surrounding eco-system with innovative and scalable solutions throughout
About the Author
If you want to find out more about Andrew Morgan, check out https://www.linkedin.com/profile/view?id=AAkAAABerUUBxmKD_zCoA4XHLg6LO1XIWYuiEQ8&authType=NAME_SEARCH&authToken=hDZS&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A6204741%2CauthType%3ANAME_SEARCH%2Cidx%3A1-7-7%2CtarId%3A1449813479657%2Ctas%3Aandrew. Andrew Morgan is a specialist in data strategy and it's execution, and has deep experience in the supporting data technologies, data architecture, and data science that brings data strategies to life. With over 20 years of hands-on experience in the data industry, mainly in architecture roles, he has worked for some of its most prestigious players and their global clients on very large data projects. In 2013, he founded ByteSumo, a uniquely positioned data science consultancy, and he is presently the acting Head of Data Science and Big Data Platform Architect for a big four audit client. He also moonlights as the chair of the Hadoop Summit EU data science selection committee. An active data scientist, he is the inventor of the TrendCalculus algorithm, and regularly participates in the Data Science and Big Data communities where he lives, in London. If you want to find out more about Antoine Amend, please see https://www.linkedin.com/profile/view?id=AAkAAAPblwkBg2ZEN5aDqAf6e1EjoCdOmX8r-CU&authType=NAME_SEARCH&authToken=WFGE&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A64722697%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1449813519748%2Ctas%3AAntoin. Antoine Amend is a data scientist who is passionate about big data engineering and scalable computing. The book's theme of torturing astronomical amounts of unstructured data to gain new insights mainly comes from his background in theoretical physics. He graduated in 2008 with a MSc in Astrophysics, and he worked for large consultancy business in Switzerland before discovering the concept of big data at the early stages of Hadoop. He embraced big data technologies ever since, and is now working as the VP Hadoop Technology Lead for a world leading financial institution based in London. By combining a scientific approach with core IT skills, Antoine qualified two years running for the Big Data World Championships finals held in Austin, TX. He was placed in the top 12 in 2014 (from over 2000+ competitors) and 4th in the 2015 competition, where he additionally won the Innovation Award using the methodologies and technologies explained in this book. If you want to find out more about Matthew Hallet, see: https://www.linkedin.com/profile/view?id=AAkAAApFYnIBigEOuYVz8WCxbvmv-j4PfhGCxKg&authType=NAME_SEARCH&authToken=vdN6&locale=en_US&trk=tyah&trkInfo=clickedVertical%3Amynetwork%2CclickedEntityId%3A172319346%2CauthType%3ANAME_SEARCH%2Cidx%3A1-1-1%2CtarId%3A1449813557799%2Ctas%3Amatthew halle. Matthew Hallett is a Software Engineer and Computer Scientist with over 15 years of industry experience. He is an expert object-oriented programmer and systems engineer with extensive knowledge of low-level programming paradigms. For the last 7 years, he has developed expertise in Hadoop and distributed programming within mission critical environments comprising multi-thousand-node data centers. With consultancy experience in distributed algorithms and the implementation of distributed computing architectures in a variety of languages, Matt is currently a Consultant Data Engineer in the Data Science and Engineering team at a top four audit firm. David George is a distinguished distributed computing expert with 15+ years of big data systems experience, mainly with globally recognized IT consultancies and brands. He has worked on core Hadoop technology implementations at the largest scale, 2000+ Hadoop nodes, and he engineers scalable applications and distributed algorithms for finance customers while working with the toughest re
"About this title" may belong to another edition of this title.