{"product_id":"predictive-coding-gurus-guide-rajiv-maheshwari-9780989385008","title":"Predictive Coding Guru's Guide: Technology, Statistics, and Workflows","description":"Predictive Coding is the process of training supervised machine-learning algorithms on pre-coded example documents, and then using the trained machine to automatically predict the coding of new documents collected in the legal eDiscovery process. While supervised machine-learning has been used for over 15 years in several applications (such as detecting spam in emails, disease in patients, human faces in pictures, likely customers from a marketing database, etc.), its adoption in eDiscovery has been slow. The key reasons include insufficient understanding of the technology (often perceived as a black box), improper use of statistics (often doubted for its applicability to natural language documents), and confusion around workflows currently used in the industry (often resulting in dissatisfactory results). This book intends to challenge the status quo with: \u003cul\u003e \u003cli\u003eEasy to understand explanation of fundamental concepts of predictive coding technology including the vector space model, feature selection, and general framework used by most predictive coding algorithms. \u003c\/li\u003e \u003cli\u003eDetailed explanation of the three most common algorithms used for predictive coding - k Nearest Neighbors (kNN), Support Vector Machines (SVM), and Latent Semantic Analysis (LSA) - in plain English. \u003c\/li\u003e \u003cli\u003eWalk-through of core concepts essential for applying and interpreting statistics correctly. Practical guidance on avoiding common errors and pitfalls. \u003c\/li\u003e \u003cli\u003eLucid step-by-step explanation of the two general workflow approaches - Technology Assisted Review and Technology Suggested Review - currently used in the industry in several derivative forms. \u003c\/li\u003e \u003cli\u003eA new workflow - the Greedy Workflow - that delivers better results, and offers flexibility and other properties useful in the context of eDiscovery. \u003c\/li\u003e \u003c\/ul\u003e In addition, the book contains detailed results of testing the greedy workflow on two real eDiscovery datasets. The first dataset contained 216,594 documents excluding Excel files. The greedy workflow predictively coded 76% of the documents with 100% precision and 87% recall. The second dataset contained 93,982 Excel files only. The greedy workflow predictively coded 50% of the Excel files with 100% precision and 76% recall.\u003cbr\u003e\u003cbr\u003e\u003cb\u003eAuthor:\u003c\/b\u003e Rajiv Maheshwari\u003cbr\u003e\u003cb\u003eISBN-10:\u003c\/b\u003e 0989385000\u003cbr\u003e\u003cb\u003eISBN-13:\u003c\/b\u003e 9780989385008\u003cbr\u003e\u003cb\u003ePublisher:\u003c\/b\u003e Rajiv Maheshwari\u003cbr\u003e\u003cb\u003eLanguage:\u003c\/b\u003e English\u003cbr\u003e\u003cb\u003ePublished:\u003c\/b\u003e 05\/04\/2013\u003cbr\u003e\u003cb\u003ePages:\u003c\/b\u003e 144\u003cbr\u003e\u003cb\u003eFormat:\u003c\/b\u003e Paperback\u003cbr\u003e\u003cb\u003eWeight:\u003c\/b\u003e 0.44lbs\u003cbr\u003e\u003cb\u003eSize:\u003c\/b\u003e 9.02h x 5.98w x 0.31d","brand":"Rajiv Maheshwari","offers":[{"title":"Paperback","offer_id":44120560238847,"sku":"9780989385008","price":23.5,"currency_code":"USD","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0662\/2982\/9887\/files\/img_003b6433-d725-403f-8bb4-f347ab7abda8.jpg?v=1687414995","url":"https:\/\/www.whiterainbookhouse.com\/products\/predictive-coding-gurus-guide-rajiv-maheshwari-9780989385008","provider":"WR Book House","version":"1.0","type":"link"}