Bad Data Handbook

Cleaning Up The Data So You Can Get Back To Work

Nonfiction, Computers, Database Management
Cover of the book Bad Data Handbook by Q. Ethan McCallum, O'Reilly Media
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Q. Ethan McCallum ISBN: 9781449324971
Publisher: O'Reilly Media Publication: November 7, 2012
Imprint: O'Reilly Media Language: English
Author: Q. Ethan McCallum
ISBN: 9781449324971
Publisher: O'Reilly Media
Publication: November 7, 2012
Imprint: O'Reilly Media
Language: English

What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they’ve recovered from nasty data problems.

From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.

Among the many topics covered, you’ll discover how to:

  • Test drive your data to see if it’s ready for analysis
  • Work spreadsheet data into a usable form
  • Handle encoding problems that lurk in text data
  • Develop a successful web-scraping effort
  • Use NLP tools to reveal the real sentiment of online reviews
  • Address cloud computing issues that can impact your analysis effort
  • Avoid policies that create data analysis roadblocks
  • Take a systematic approach to data quality analysis
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they’ve recovered from nasty data problems.

From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.

Among the many topics covered, you’ll discover how to:

More books from O'Reilly Media

Cover of the book Linux System Programming by Q. Ethan McCallum
Cover of the book Practical C++ Programming by Q. Ethan McCallum
Cover of the book Drupal for Designers by Q. Ethan McCallum
Cover of the book Beautiful Teams by Q. Ethan McCallum
Cover of the book Enterprise IoT by Q. Ethan McCallum
Cover of the book Intellectual Property and Open Source by Q. Ethan McCallum
Cover of the book Squid: The Definitive Guide by Q. Ethan McCallum
Cover of the book Developing Web Apps with Haskell and Yesod by Q. Ethan McCallum
Cover of the book sendmail Cookbook by Q. Ethan McCallum
Cover of the book Programming Voice Interfaces by Q. Ethan McCallum
Cover of the book Getting Started with Couchbase Server by Q. Ethan McCallum
Cover of the book YouTube: An Insider's Guide to Climbing the Charts by Q. Ethan McCallum
Cover of the book Packet Guide to Core Network Protocols by Q. Ethan McCallum
Cover of the book IPv6 Network Administration by Q. Ethan McCallum
Cover of the book Das Android-Smartphone-Buch by Q. Ethan McCallum
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy