Building and testing machine learning models requires access to large and diverse data. But where can you find usable datasets without running into privacy issues? This practical book introduces techniques for generating synthetic data—fake data generated from real data—so you can perform secondary analysis to do research, understand customer behaviors, develop new products, or generate new revenue.
Data scientists will learn how synthetic data generation provides a way to make such data broadly available for secondary purposes while addressing many privacy concerns. Analysts will learn the principles and steps for generating synthetic data from real datasets. And business leaders will see how synthetic data can help accelerate time to a product or solution.
This book describes:
"synopsis" may belong to another edition of this title.
Dr. Khaled El Emam is a senior scientist at the Children's Hospital of Eastern Ontario (CHEO) Research Institute and Director of the multi-disciplinary Electronic Health Information Laboratory. Lucy Mosquera has a bachelor's degree in Biology and Mathematics from Queen's University and is a current graduate student in the department of statistics at the University of British Columbia. During her time at Queen's, Lucy provided data management support on a dozen clinical trials and observational studies run through Kingston General Hospital's Clinical Evaluation Research Unit. Lucy has also worked on clinical trial data sharing methods based on homomorphic encryption and secret sharing protocols. At Replica Analytics, Lucy is responsible for developing statistical and machine learning models for data generation, and integrating subject area expertise in clinical trial data into synthetic data generation methods, as well as the statistical assessments of our synthetic data generation. Dr. Richard Hoptroff is a long term technology inventor, investor and entrepreneur.
"About this title" may belong to another edition of this title.
Seller: Bay State Book Company, North Smithfield, RI, U.S.A.
Condition: very_good. Seller Inventory # BSM.134TC
Seller: Goodbooks Company, Springdale, AR, U.S.A.
Condition: good. Book has corner edge dings and or scratches and signs of light wear. Seller Inventory # GBV.1492072745.G
Seller: Lakeside Books, Benton Harbor, MI, U.S.A.
Condition: New. Brand New! Not Overstocks or Low Quality Book Club Editions! Direct From the Publisher! We're not a giant, faceless warehouse organization! We're a small town bookstore that loves books and loves it's customers! Buy from Lakeside Books! Seller Inventory # OTF-S-9781492072744
Seller: GreatBookPrices, Columbia, MD, U.S.A.
Condition: New. Seller Inventory # 39708295-n
Seller: Rarewaves USA, OSWEGO, IL, U.S.A.
Paperback. Condition: New. Building and testing machine learning models requires access to large and diverse data. But where can you find usable datasets without running into privacy issues? This practical book introduces techniques for generating synthetic data-fake data generated from real data-so you can perform secondary analysis to do research, understand customer behaviors, develop new products, or generate new revenueData scientists will learn how synthetic data generation provides a way to make such data broadly available for secondary purposes while addressing many privacy concerns. Analysts will learn the principles and steps for generating synthetic data from real datasets. And business leaders will see how synthetic data can help accelerate time to a product or solution. This book describes:Steps for generating synthetic data using multivariate normal distributionsMethods for distribution fitting covering different goodness-of-fit metrics How to replicate the simple structure of original data An approach for modeling data structure to consider complex relationshipsMultiple approaches and metrics you can use to assess data utilityHow analysis performed on real data can be replicated with synthetic dataPrivacy implications of synthetic data and methods to assess identity disclosure. Seller Inventory # LU-9781492072744
Seller: PBShop.store US, Wood Dale, IL, U.S.A.
PAP. Condition: New. New Book. Shipped from UK. Established seller since 2000. Seller Inventory # WO-9781492072744
Seller: BargainBookStores, Grand Rapids, MI, U.S.A.
Paperback or Softback. Condition: New. Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data. Book. Seller Inventory # BBS-9781492072744
Seller: GreatBookPrices, Columbia, MD, U.S.A.
Condition: As New. Unread book in perfect condition. Seller Inventory # 39708295
Seller: PBShop.store UK, Fairford, GLOS, United Kingdom
PAP. Condition: New. New Book. Shipped from UK. Established seller since 2000. Seller Inventory # WO-9781492072744
Quantity: 2 available
Seller: Rarewaves.com USA, London, LONDO, United Kingdom
Paperback. Condition: New. Building and testing machine learning models requires access to large and diverse data. But where can you find usable datasets without running into privacy issues? This practical book introduces techniques for generating synthetic data-fake data generated from real data-so you can perform secondary analysis to do research, understand customer behaviors, develop new products, or generate new revenueData scientists will learn how synthetic data generation provides a way to make such data broadly available for secondary purposes while addressing many privacy concerns. Analysts will learn the principles and steps for generating synthetic data from real datasets. And business leaders will see how synthetic data can help accelerate time to a product or solution. This book describes:Steps for generating synthetic data using multivariate normal distributionsMethods for distribution fitting covering different goodness-of-fit metrics How to replicate the simple structure of original data An approach for modeling data structure to consider complex relationshipsMultiple approaches and metrics you can use to assess data utilityHow analysis performed on real data can be replicated with synthetic dataPrivacy implications of synthetic data and methods to assess identity disclosure. Seller Inventory # LU-9781492072744
Quantity: 1 available