

Big Data: A Revolution That Will Transform How We Live, Work, and Think [Mayer-Schönberger, Viktor, Cukier, Kenneth] on desertcart.com. *FREE* shipping on qualifying offers. Big Data: A Revolution That Will Transform How We Live, Work, and Think Review: Great overview of what Big Data is doing today and what can be done in the future - Big Data is a topic that is all the rage but at the same time isnt well defined. Authors Viktor Mayer-Schönberger and Kenneth Cukier give an overview of what is being done with the massive amount of data that is being generated from online interaction coupled with advances in practical statistics on the analysis of this data. The authors go through examples of how big data is being used today to give a flavour of it and then follow up the rest of the book with what is going on in the field, how it is useful, where aspects of it are going and some of the concerns we should have about our privacy. The authors start by discussing how Google using its analysis of people's queries is more predictive about flu epidemics than medical experts have been. The human genome can be codified in a fraction of the time that was required when it was being decoded for the first time. They discuss how big data has enabled entrepreneurs to inform customers about the optimal time to buy flight tickets given that airlines vary their prices according to hidden methods that big data statistics has helped to make more sense of. The examples are a good starting point to start the discussion with the reader. The authors start by discussing how we have always been trying to come up with data about our populations, desires to do census analysis has been with us for a long time. We made progress through sampling techniques and statistics helped to enable data gathering about the population at large using smaller and less time consuming samples. The authors discuss how big data is messy, it is imprecise and is helpful for overviews but not for model building with respect to figuring out the mechanics of what is being observed. When you try to get all of the data about something there will inevitably be noise and looking for correlations can sometimes be the most fruitful way to use the data to figure out empirical relationships rather than search for underlying dynamics. The authors discuss datification which means the consolidation of data into a larger database that can then be used to give much more useful guidance to the population at large about phenomenon that required a look from above at all the data together. Matthew Maury is used to reinforce the usefulness of this approach, he was a naval officer who aggregated ships logs to help inform ship captains about most useful routes and more efficient transiting. The authors move on to the more concrete and start to discuss the value of big data. They give the obvious background on the value of traditional data and then give food for thought on how having data for everything can lead to new ideas and utility that was unimaginable in the past. Big data analytics will be required for document translation, smart device coordination, smart cities and social network analysis. The value in big data is of course, the data, but the utility of that data might be further midstream or downstream that others are better placed to harvest. The authors move on to discuss the data value chain and how to think about it. The authors discuss the implication of the big data revolution and how it is enabling consumers to get the best deals and how statisticians are a highly desirable skill set. The authors move on to the risks of big data which are numerous of course. Much discussed are the privacy of the data that is generated. The ownership of that data and the licensing of it are topics which will continue to surface and the legal framework to analyze disputes will need to be further developed. Misunderstanding correlation and causation will also be a risk in big data analytics and hypotheticals like the government quarantining those who search for flu on google are used as hyperbolized examples. The authors finally leave the reader with a view on the future. They use an example of how big data statistics was used to substantially improve the ability to find overcrowded illegal slum housing as a concrete example of how we can use data to enhance our cities and improve governance and efficiency. Big data is a subject which continues to step into more and more categories as our ability to measure continues to improve. How big data can be used will be a continued subject that both academics and practitioners will continue to be thought about and experimented on. It will give rise to a new consumer culture and potentially to new ways of organizing people and infrastructure. Big Data is an excellent readable overview of how data has always been used to guide policy, how big data is being used today, what the value chain of the data industry looks like, what the risks are of big data and how big data can enhance the future. Its easy to read and illuminating. Review: Interesting and Engaging, but flawed by repetition of unsupported assertions & wacky theories; lacks any "how-to" guidance - "Big Data: A Revolution..." was often engaging and included some interesting examples, but it was a disappointment. As others mention, the authors use repetition instead of evidence or proof, and ultimately I was not convinced by many of their claims. I encountered two huge issues in the text. First, the authors repeatedly argue that it's OK if Big Data contains "messy" data, because they assert that when "n=all" then the statistical rules about sampling don't apply. This argument fails two ways: first, if n=all but if the data contains "messy" (erroneous) data points in critical places, then it will be misleading and perhaps even completely wrong. Second, when using past data where "n=all" to project future events, then it's no longer true that "n=all." Instead, we have data for "n=all(where(time=past))" and we're using that data to try to predict events in a completely separate data set ("time=future"), and it's entirely possible that there are critical differences demarcated by "time=now." The second huge issue, for me, was the authors' focus on the concept that Big Data brings with it a huge risk that we will use data to predict future behavior -- and that we will then use those predictions to punish people for acts they have not committed (e.g., the "Minority Report" problem). They distort this argument in two ways: first, by assuming that society would actually do this, and second, by asserting that any action taken based on these predictions (such as increasing scrutiny or assigning social workers to visit at-risk juveniles) is "punishment." I was also skeptical of the authors' general reverence of, and deference to, data scientists as professionals and experts. The author believe that it's plausible to expect a new profession of internal and external "algorithmists" to arise, to protect consumers' privacy interests and society's interests against the potential abuses by Big Data users. The book also failed to provide real-world "how-to" examples, instead providing only "end result" examples and conclusions that often seem incomplete and sometimes implausible. Their many useful examples of useful information extracted from Big Data all doubtless represent the end-point of many, many explorations of Big Data; they probably also represent a subset of correlations derived, after many misleading correlations were removed. Finally, note that the book's lengthy end notes, bibliography, and index represent a full one-third of the book's length. There's a lot of useful information in this book, especially for someone just trying to learn about the concept of Big Data. But there's also a lot of hype, and a lot of repetition of ideas without meaningful factual support.
| Best Sellers Rank | #1,121,658 in Books ( See Top 100 in Books ) #206 in Data Mining (Books) #259 in Business Statistics #636 in Statistics (Books) |
| Customer Reviews | 4.2 4.2 out of 5 stars (1,445) |
| Dimensions | 5.31 x 0.77 x 8 inches |
| Edition | Reprint |
| ISBN-10 | 0544227751 |
| ISBN-13 | 978-0544227750 |
| Item Weight | 2.31 pounds |
| Language | English |
| Print length | 272 pages |
| Publication date | March 4, 2014 |
| Publisher | Harper Business |
A**N
Great overview of what Big Data is doing today and what can be done in the future
Big Data is a topic that is all the rage but at the same time isnt well defined. Authors Viktor Mayer-Schönberger and Kenneth Cukier give an overview of what is being done with the massive amount of data that is being generated from online interaction coupled with advances in practical statistics on the analysis of this data. The authors go through examples of how big data is being used today to give a flavour of it and then follow up the rest of the book with what is going on in the field, how it is useful, where aspects of it are going and some of the concerns we should have about our privacy. The authors start by discussing how Google using its analysis of people's queries is more predictive about flu epidemics than medical experts have been. The human genome can be codified in a fraction of the time that was required when it was being decoded for the first time. They discuss how big data has enabled entrepreneurs to inform customers about the optimal time to buy flight tickets given that airlines vary their prices according to hidden methods that big data statistics has helped to make more sense of. The examples are a good starting point to start the discussion with the reader. The authors start by discussing how we have always been trying to come up with data about our populations, desires to do census analysis has been with us for a long time. We made progress through sampling techniques and statistics helped to enable data gathering about the population at large using smaller and less time consuming samples. The authors discuss how big data is messy, it is imprecise and is helpful for overviews but not for model building with respect to figuring out the mechanics of what is being observed. When you try to get all of the data about something there will inevitably be noise and looking for correlations can sometimes be the most fruitful way to use the data to figure out empirical relationships rather than search for underlying dynamics. The authors discuss datification which means the consolidation of data into a larger database that can then be used to give much more useful guidance to the population at large about phenomenon that required a look from above at all the data together. Matthew Maury is used to reinforce the usefulness of this approach, he was a naval officer who aggregated ships logs to help inform ship captains about most useful routes and more efficient transiting. The authors move on to the more concrete and start to discuss the value of big data. They give the obvious background on the value of traditional data and then give food for thought on how having data for everything can lead to new ideas and utility that was unimaginable in the past. Big data analytics will be required for document translation, smart device coordination, smart cities and social network analysis. The value in big data is of course, the data, but the utility of that data might be further midstream or downstream that others are better placed to harvest. The authors move on to discuss the data value chain and how to think about it. The authors discuss the implication of the big data revolution and how it is enabling consumers to get the best deals and how statisticians are a highly desirable skill set. The authors move on to the risks of big data which are numerous of course. Much discussed are the privacy of the data that is generated. The ownership of that data and the licensing of it are topics which will continue to surface and the legal framework to analyze disputes will need to be further developed. Misunderstanding correlation and causation will also be a risk in big data analytics and hypotheticals like the government quarantining those who search for flu on google are used as hyperbolized examples. The authors finally leave the reader with a view on the future. They use an example of how big data statistics was used to substantially improve the ability to find overcrowded illegal slum housing as a concrete example of how we can use data to enhance our cities and improve governance and efficiency. Big data is a subject which continues to step into more and more categories as our ability to measure continues to improve. How big data can be used will be a continued subject that both academics and practitioners will continue to be thought about and experimented on. It will give rise to a new consumer culture and potentially to new ways of organizing people and infrastructure. Big Data is an excellent readable overview of how data has always been used to guide policy, how big data is being used today, what the value chain of the data industry looks like, what the risks are of big data and how big data can enhance the future. Its easy to read and illuminating.
M**H
Interesting and Engaging, but flawed by repetition of unsupported assertions & wacky theories; lacks any "how-to" guidance
"Big Data: A Revolution..." was often engaging and included some interesting examples, but it was a disappointment. As others mention, the authors use repetition instead of evidence or proof, and ultimately I was not convinced by many of their claims. I encountered two huge issues in the text. First, the authors repeatedly argue that it's OK if Big Data contains "messy" data, because they assert that when "n=all" then the statistical rules about sampling don't apply. This argument fails two ways: first, if n=all but if the data contains "messy" (erroneous) data points in critical places, then it will be misleading and perhaps even completely wrong. Second, when using past data where "n=all" to project future events, then it's no longer true that "n=all." Instead, we have data for "n=all(where(time=past))" and we're using that data to try to predict events in a completely separate data set ("time=future"), and it's entirely possible that there are critical differences demarcated by "time=now." The second huge issue, for me, was the authors' focus on the concept that Big Data brings with it a huge risk that we will use data to predict future behavior -- and that we will then use those predictions to punish people for acts they have not committed (e.g., the "Minority Report" problem). They distort this argument in two ways: first, by assuming that society would actually do this, and second, by asserting that any action taken based on these predictions (such as increasing scrutiny or assigning social workers to visit at-risk juveniles) is "punishment." I was also skeptical of the authors' general reverence of, and deference to, data scientists as professionals and experts. The author believe that it's plausible to expect a new profession of internal and external "algorithmists" to arise, to protect consumers' privacy interests and society's interests against the potential abuses by Big Data users. The book also failed to provide real-world "how-to" examples, instead providing only "end result" examples and conclusions that often seem incomplete and sometimes implausible. Their many useful examples of useful information extracted from Big Data all doubtless represent the end-point of many, many explorations of Big Data; they probably also represent a subset of correlations derived, after many misleading correlations were removed. Finally, note that the book's lengthy end notes, bibliography, and index represent a full one-third of the book's length. There's a lot of useful information in this book, especially for someone just trying to learn about the concept of Big Data. But there's also a lot of hype, and a lot of repetition of ideas without meaningful factual support.
M**D
Very interesting book. It is one of the books that I would recommend to be used as reference book, as it contains lots of examples and quotations about individuals, who woke up to the reality of the Big data and how it could be utilised for the good and, perhaps, the more challenging way of profiling innocent people according to their names, culture, religion, political thoughts etc. I would also recommend this book to the anyone interested in studying or curious about "the concept machine learning and what role the big data can play." Sometimes, you may wonder how Cortana finds out when it is the time to leave for work or home; or it predicts how the traffic would be, while you are on your way to work or home. If you do wonder about this, then you must read this book. Author's acknowledgement of the role of "algorithmists" in Big data is also plausible. Imagine the day the nutters become part of the law society. I think this would inject honesty into the "how most lawyers handle cases that they are working on." You can skip this paragraph: If you ever wondered how Neural Network proponents will ever succeed to teach a basic Times table to algorithm that requires two input numbers, like 8 time 7, then after reading this book, you will note Big Data will may help. Note that when we are young and attending elementary schools, most of us learn the Times table by memorising. As we grow, we simply identify a strategy where we, for example, think 7 Times table goes up by 7 and 8 Times table goes up by 8. Hence, no need to memories. In this instance, Big Data can be used to bridge the gap between the Neural Network and those, like me, who very much believe that we should focusing on mimicking how our neocortext works and complement it with Algorithms that make our machines perform better than our neocortext. In this paradigm, the Big Data will be used for playing the role of the memory and experience, while still we will be able to create strategies that can be serialised into and de-serialised from the Big Data repository. The author does also go on about privacy and the challenges Big Data faces. I think the question to ask is: if we accepted to use the cloud, have we not sleepwalked into sharing our data with those, who are there to analyse data? Is it the machine that should only have an access to our private data; or also those, who own this smart machines? Would the combination of Big Data and Intelligent machines bring about the creation of all-knowing being that cannot only know our past, but can also predict our future activities. And imagine what impact this would have on currency/stock traders? Do not even think politics here, as this will get more scarier. If you have ever watched the Movie "Her" and reasoned with the poor man, who fallen for OS that knows him very well, then think about the consequence of intelligent machines, powered by Big Data! And this is another reason to read this book. However, we should never fear exploring what we are capable of doing for the good of this world and its inhabitants; but should also be prepare to ensure that the all-knowing thing, which we are in the process of creating, is not one dictator, but one that lives and functions within democratic system.
G**O
Excellente présentation du sujet - de quoi s'agit-il, avantages, risques, ... J'ai été véritablement stupéfait de lire comment les données étaient analysées aujourd'hui et l'impact que cela pouvait avoir sur le monde. Pas sûr que beaucoup de monde réalise ce qui se passe, à son insu; il m'a permis de comprendre beaucoup de choses que je constate au quotidien, bien pratiques, mais dont j'ignorais le fonctionnement (il ne rentre pas dans les détails techniques mais donne quelques info). En tout cas ça m'a réveillé et rendu encore plus méfiant, ou plutôt conscient, vis-à-vis d'internet, même si tout n'est pas noir, loin s'en faut et même si le domaine des big data va bien au-delà de l'internet. Le point de vue est business et sociétal, pas du tout technique, mais ça m'a donné fort envie d'en savoir beaucoup plus, aussi point de vue technique. Je le recommande sans hésiter. Pas besoin d'être expert pour le lire.
C**N
Consegna rapida. Contenuti del libro: buoni, forse un pò generici, ma è pur sempre un buon inizio per iniziare a capire i BigData.
J**L
Well written introduction to an emerging discipline that will affect every aspect of humanity's future. The book treats the reader as intelligent but does not require prior knowledge in any aspect of statistics, computer science, or philosophy. Its many examples make the book's points crystal clear. This is one of those few books that explain how decisions will be made in the future. At the same time, it shows humanity's role in creativity and free will that may escape the casual, uncritical reader otherwise. The author is a master of the discipline and blessed with wisdom that goes beyond the promise of the book's title.
J**M
Un libro que ayuda a comprender el transito a la era digital, muy bien explicado, con muchos ejemplos, accesible pero con u. Contenido muy interesante.
Trustpilot
2 weeks ago
2 days ago