A revelatory exploration of the hottest trend in technology and the dramatic impact it will have on the economy, science, and society at large.Which paint color is most likely to tell you that a used car is in good shape? How can officials identify the most dangerous New York City manholes before they explode? And how did Google searches predict the spread of the H1N1 flu outbreak?The key to answering these questions, and many more, is big data. “Big data” refers to our burgeoning ability to crunch vast collections of information, analyze it instantly, and draw sometimes profoundly surprising conclusions from it. This emerging science can translate myriad phenomena—from the price of airline tickets to the text of millions of books—into searchable form, and uses our increasing computing power to unearth epiphanies that we never could have seen before. A revolution on par with the Internet or perhaps even the printing press, big data will change the way we think about business, health, politics, education, and innovation in the years to come. It also poses fresh threats, from the inevitable end of privacy as we know it to the prospect of being penalized for things we haven’t even done yet, based on big data’s ability to predict our future behavior.In this brilliantly clear, often surprising work, two leading experts explain what big data is, how it will change our lives, and what we can do to protect ourselves from its hazards. Big Data is the first big book about the next big thing.
Published by: Eamon Dolan/Houghton Mifflin Harcourt | Publication date: 03/05/2013Kindle book details: Kindle Edition, 257 pages
Foreword by Steven PinkerBlending the informed analysis of The Signal and the Noise with the instructive iconoclasm of Think Like a Freak, a fascinating, illuminating, and witty look at what the vast amounts of information now instantly available to us reveals about ourselves and our world—provided we ask the right questions.By the end of an average day in the early twenty-first century, human beings searching the internet will amass eight trillion gigabytes of data. This staggering amount of information—unprecedented in history—can tell us a great deal about who we are—the fears, desires, and behaviors that drive us, and the conscious and unconscious decisions we make. From the profound to the mundane, we can gain astonishing knowledge about the human psyche that less than twenty years ago, seemed unfathomable. Everybody Lies offers fascinating, surprising, and sometimes laugh-out-loud insights into everything from economics to ethics to sports to race to sex, gender and more, all drawn from the world of big data. What percentage of white voters didn’t vote for Barack Obama because he’s black? Does where you go to school effect how successful you are in life? Do parents secretly favor boy children over girls? Do violent films affect the crime rate? Can you beat the stock market? How regularly do we lie about our sex lives and who’s more self-conscious about sex, men or women? Investigating these questions and a host of others, Seth Stephens-Davidowitz offers revelations that can help us understand ourselves and our lives better. Drawing on studies and experiments on how we really live and think, he demonstrates in fascinating and often funny ways the extent to which all the world is indeed a lab. With conclusions ranging from strange-but-true to thought-provoking to disturbing, he explores the power of this digital truth serum and its deeper potential—revealing biases deeply embedded within us, information we can use to change our culture, and the questions we’re afraid to ask that might be essential to our health—both emotional and physical. All of us are touched by big data everyday, and its influence is multiplying. Everybody Lies challenges us to think differently about how we see it and the world.
Published by: Dey Street Books | Publication date: 05/09/2017Kindle book details: Kindle Edition, 357 pages
The world of business is changing at an ever accelerating rate. Businesses that choose not to adopt the latest technologies are destined to suffer the consequences of obsolescence. The best way for a small to medium sized business to improve profits and secure its future is through adopting data analytics and using big data to improve automation and solve optimization problems.Hidden within the data that we already collect for our businesses lies solutions to all types of questions. From understanding the best hours for your business to keep, to managing payroll, big data and data analytics can vastly improve the efficiency of any business. This is a concept has been well known to the world’s largest businesses, but for years has been gated behind the cost of expensive software solutions.Today there are more software solutions for small and medium size businesses than ever before. Whether you want to solve for optimization problems to better understand your business and customer base, or if you merely want to automate ordering items and paying your employees, big data and data analytics is the solution that you need. Continue reading and soon you will have the knowledge to secure your business for long into the future.
Publication date: 02/24/2017Kindle book details: Kindle Edition, 80 pages
Basic data science explained Explore the field of data science, and the way to analyze big and small data. This technical book goes over the main aspects of analyzing data correctly by using various strategies you need to implement in order to get results that are precise and beneficial. Learn about:Modeling data and visualization.The three V’s of big data and what to do with them.Software recommendations and applications.Machine algorithms and interesting side notes regarding them.Rules, infrastructure, adaptation, and other techniques.Perception and cognition basics that apply to data.Efficient uses of regression, database querying, machine learning, and data warehousing.Curious yet? Then don’t wait and start reading, so you don’t have to remain in the dark. Save yourself the time and learn from what worked for me. I will see you in the first chapter!
Publication date: 07/11/2017Kindle book details: Kindle Edition, 67 pages
This books fills the need for an easy and holistic book on essential Big Data technologies. Written in a lucid and simple language free from jargon and code, this book provides an intuition for Big Data from business as well as technological perspectives. This book is designed to provide the reader with the intuition behind this evolving area, along with a solid toolset of the major big data processing technologies such as Hadoop, MapReduce, Spark Streaming, and NoSql databases. A complete case study of developing a web log analyzer is included. The book also contains two primers on Cloud computing and Data Mining. It also contains two tutorials on installing Hadoop and Spark. The book contains caselets from real-world stories.Students across a variety of academic disciplines including business, computer science, statistics, engineering, and others attracted to the idea of harnessing Big Data for new insights and ideas from data, can use this as a textbook. Professionals in various domains, including executives, managers, analysts, professors, doctors, accountants, and others can use this book to learn in a few hours how to make the most of Big Data to monitor their infrastructure, discover new insights, and develop new data-based products. It is a flowing book that one can finish in one sitting, or one can return to it again and again for insights and techniques.Table of Contents1.Wholeness of Big Data2.Big Data Applications3.Big Data Architectures4.Distributed Systems with Hadoop5.Parallel Programming with MapReduce6.Advanced NoSQL databases7.Stream programming with Spark8.Data Ingest with Kafka9.Cloud Computing Primer10. Web Log Analyzer development 11.Data Mining Primer12.Appendix 1 on Installing Hadoop on AWS cloud13.Appendix 2 on Installing Spark
Publication date: 06/28/2016Kindle book details: Kindle Edition, 301 pages
Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems
Data is at the center of many challenges in system design today. Difficult issues need to be figured out, such as scalability, consistency, reliability, efficiency, and maintainability. In addition, we have an overwhelming variety of tools, including relational databases, NoSQL datastores, stream or batch processors, and message brokers. What are the right choices for your application? How do you make sense of all these buzzwords?In this practical and comprehensive guide, author Martin Kleppmann helps you navigate this diverse landscape by examining the pros and cons of various technologies for processing and storing data. Software keeps changing, but the fundamental principles remain the same. With this book, software engineers and architects will learn how to apply those ideas in practice, and how to make full use of data in modern applications.
- Peer under the hood of the systems you already use, and learn how to use and operate them more effectively
- Make informed decisions by identifying the strengths and weaknesses of different tools
- Navigate the trade-offs around consistency, scalability, fault tolerance, and complexity
- Understand the distributed systems research upon which modern databases are built
- Peek behind the scenes of major online services, and learn from their architectures
Published by: O'Reilly Media | Publication date: 03/16/2017Kindle book details: Kindle Edition, 614 pages
Longlisted for the National Book AwardNew York Times BestsellerA former Wall Street quant sounds an alarm on the mathematical models that pervade modern life — and threaten to rip apart our social fabricWe live in the age of the algorithm. Increasingly, the decisions that affect our lives—where we go to school, whether we get a car loan, how much we pay for health insurance—are being made not by humans, but by mathematical models. In theory, this should lead to greater fairness: Everyone is judged according to the same rules, and bias is eliminated.But as Cathy O’Neil reveals in this urgent and necessary book, the opposite is true. The models being used today are opaque, unregulated, and uncontestable, even when they’re wrong. Most troubling, they reinforce discrimination: If a poor student can’t get a loan because a lending model deems him too risky (by virtue of his zip code), he’s then cut off from the kind of education that could pull him out of poverty, and a vicious spiral ensues. Models are propping up the lucky and punishing the downtrodden, creating a “toxic cocktail for democracy.” Welcome to the dark side of Big Data.Tracing the arc of a person’s life, O’Neil exposes the black box models that shape our future, both as individuals and as a society. These “weapons of math destruction” score teachers and students, sort résumés, grant (or deny) loans, evaluate workers, target voters, set parole, and monitor our health.O’Neil calls on modelers to take more responsibility for their algorithms and on policy makers to regulate their use. But in the end, it’s up to us to become more savvy about the models that govern our lives. This important book empowers us to ask the tough questions, uncover the truth, and demand change.— Longlist for National Book Award (Non-Fiction)— Goodreads, semi-finalist for the 2016 Goodreads Choice Awards (Science and Technology)— Kirkus, Best Books of 2016— New York Times, 100 Notable Books of 2016 (Non-Fiction)— The Guardian, Best Books of 2016— WBUR's "On Point," Best Books of 2016: Staff Picks— Boston Globe, Best Books of 2016, Non-Fiction
Published by: Broadway Books | Publication date: 09/06/2016Kindle book details: Kindle Edition, 274 pages
The definitive reference book with real-world solutions you won't find anywhere else The Big Book of Dashboards presents a comprehensive reference for those tasked with building or overseeing the development of business dashboards. Comprising dozens of examples that address different industries and departments (healthcare, transportation, finance, human resources, marketing, customer service, sports, etc.) and different platforms (print, desktop, tablet, smartphone, and conference room display) The Big Book of Dashboards is the only book that matches great dashboards with real-world business scenarios. By organizing the book based on these scenarios and offering practical and effective visualization examples, The Big Book of Dashboards will be the trusted resource that you open when you need to build an effective business dashboard. In addition to the scenarios there's an entire section of the book that is devoted to addressing many practical and psychological factors you will encounter in your work. It's great to have theory and evidenced-based research at your disposal, but what will you do when somebody asks you to make your dashboard 'cooler' by adding packed bubbles and donut charts? The expert authors have a combined 30-plus years of hands-on experience helping people in hundreds of organizations build effective visualizations. They have fought many 'best practices' battles and having endured bring an uncommon empathy to help you, the reader of this book, survive and thrive in the data visualization world. A well-designed dashboard can point out risks, opportunities, and more; but common challenges and misconceptions can make your dashboard useless at best, and misleading at worst. The Big Book of Dashboards gives you the tools, guidance, and models you need to produce great dashboards that inform, enlighten, and engage.
Published by: Wiley | Publication date: 04/24/2017Kindle book details: Kindle Edition, 448 pages
Find the right big data solution for your business or organization Big data management is one of the major challenges facing business, industry, and not-for-profit organizations. Data sets such as customer transactions for a mega-retailer, weather patterns monitored by meteorologists, or social network activity can quickly outpace the capacity of traditional data management tools. If you need to develop or manage big data solutions, you'll appreciate how these four experts define, explain, and guide you through this new and often confusing concept. You'll learn what it is, why it matters, and how to choose and implement solutions that work.
- Effectively managing big data is an issue of growing importance to businesses, not-for-profit organizations, government, and IT professionals
- Authors are experts in information management, big data, and a variety of solutions
- Explains big data in detail and discusses how to select and implement a solution, security concerns to consider, data storage and presentation issues, analytics, and much more
- Provides essential information in a no-nonsense, easy-to-understand style that is empowering
Published by: For Dummies | Publication date: 04/02/2013Kindle book details: Kindle Edition, 336 pages
Written by renowned data science experts Foster Provost and Tom Fawcett, Data Science for Business introduces the fundamental principles of data science, and walks you through the "data-analytic thinking" necessary for extracting useful knowledge and business value from the data you collect. This guide also helps you understand the many data-mining techniques in use today.Based on an MBA course Provost has taught at New York University over the past ten years, Data Science for Business provides examples of real-world business problems to illustrate these principles. You’ll not only learn how to improve communication between business stakeholders and data scientists, but also how participate intelligently in your company’s data science projects. You’ll also discover how to think data-analytically, and fully appreciate how data science methods can support business decision-making.
- Understand how data science fits in your organization—and how you can use it for competitive advantage
- Treat data as a business asset that requires careful investment if you’re to gain real value
- Approach business problems data-analytically, using the data-mining process to gather good data in the most appropriate way
- Learn general concepts for actually extracting knowledge from data
- Apply data science principles when interviewing data science job candidates
Published by: O'Reilly Media | Publication date: 07/27/2013Kindle book details: Kindle Edition, 414 pages