Bad data handbook /

What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they've recovered from nasty...

Full description

Saved in:
Bibliographic Details
Other Authors: McCallum, Q. Ethan
Format: Electronic eBook
Language:English
Published: Sebastopol, CA : O'Reilly Media, ©2012.
Subjects:
Online Access:CONNECT

MARC

LEADER 00000cam a2200000Ia 4500
001 in00006055096
006 m o d
007 cr unu||||||||
008 130124s2012 caua obf 001 0 eng d
005 20220714134222.0
035 |a 1WRLDSHRocn825071196 
040 |a UMI  |b eng  |e pn  |c UMI  |d TEFOD  |d N$T  |d COO  |d WAU  |d YDXCP  |d DEBSZ  |d OCLCO  |d AU@  |d TEFOD  |d OCLCQ  |d OCLCF  |d OCLCQ  |d FEM  |d NRC  |d OCLCQ  |d HCO  |d BRL  |d CEF  |d UAB  |d OCLCO 
019 |a 827175029  |a 968037772  |a 969069540 
020 |a 9781449324988  |q (electronic bk.) 
020 |a 1449324983  |q (electronic bk.) 
020 |a 9781449324971  |q (electronic bk.) 
020 |a 1449324975  |q (electronic bk.) 
020 |a 9781449324957 
020 |a 1449324959 
020 |z 9781449321888 
020 |z 1449321887 
035 |a (OCoLC)825071196  |z (OCoLC)827175029  |z (OCoLC)968037772  |z (OCoLC)969069540 
037 |a CL0500000186  |b Safari Books Online 
037 |a 9BF1536D-FE07-4C0E-A542-531E9B9C1930  |b OverDrive, Inc.  |n http://www.overdrive.com 
050 4 |a QA76.9.D3 
082 0 4 |a 005.74  |2 23 
049 |a TXMM 
245 0 0 |a Bad data handbook /  |c [edited by] Q. Ethan McCallum. 
260 |a Sebastopol, CA :  |b O'Reilly Media,  |c ©2012. 
300 |a 1 online resource (xvi, 245 pages) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |2 rda 
588 0 |a Print version record. 
504 |a Includes bibliographical references and index. 
520 |a What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they've recovered from nasty data problems. From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it. Among the many topics covered, you'll discover how to: Test drive your data to see if it's ready for analysis Work spreadsheet data into a usable form Handle encoding problems that lurk in text data Develop a successful web-scraping effort Use NLP tools to reveal the real sentiment of online reviews Address cloud computing issues that can impact your analysis effort Avoid policies that create data analysis roadblocks Take a systematic approach to data quality analysis. 
590 |a O'Reilly Online Learning Platform: Academic Edition (SAML SSO Access) 
650 0 |a Database management  |v Handbooks, manuals, etc. 
650 0 |a Electronic data processing  |v Handbooks, manuals, etc. 
655 7 |a Handbooks and manuals.  |2 fast  |0 (OCoLC)fst01423877 
700 1 |a McCallum, Q. Ethan. 
730 0 |a WORLDSHARE SUB RECORDS 
776 0 8 |i Print version:  |a McCallum, Q. Ethan.  |t Bad data handbook.  |d Beijing ; Sebastopol [Calif.] : O'Reilly, 2012, ©2013  |z 9781449321888  |w (OCoLC)794362453 
856 4 0 |u https://go.oreilly.com/middle-tennessee-state-university/library/view/-/9781449324957/?ar  |z CONNECT  |3 O'Reilly  |t 0 
949 |a ho0 
994 |a 92  |b TXM 
998 |a wi  |d z 
999 f f |s 8ee478e3-a315-4ccf-9e2c-7793551ddb1c  |i 53612971-05da-4127-8e54-98e9001f47de  |t 0 
952 f f |a Middle Tennessee State University  |b Main  |c James E. Walker Library  |d Electronic Resources  |t 0  |e QA76.9.D3   |h Library of Congress classification 
856 4 0 |3 O'Reilly  |t 0  |u https://go.oreilly.com/middle-tennessee-state-university/library/view/-/9781449324957/?ar  |z CONNECT