100 Best Big Data Books of All Time
We've researched and ranked the best big data books in the world, based on recommendations from world experts, sales data, and millions of reader ratings. Learn more
Proven approaches such as service-oriented and event-driven architectures are joined by newer techniques such as microservices, reactive architectures, DevOps, and stream processing. Many of these patterns are successful by themselves, but as this practical ebook demonstrates, they provide a more holistic and compelling approach when applied together.
Author Ben Stopford explains how service-based... more
Blending the informed analysis of The Signal and the Noise with the instructive iconoclasm of Think Like a Freak, a fascinating, illuminating, and witty look at what the vast amounts of information now instantly available to us reveals about ourselves and our world—provided we ask the right questions.
By the end of an average day in the early twenty-first century, human beings searching the internet will amass eight trillion gigabytes of data. This staggering amount of information—unprecedented in history—can tell us a great deal about who we... more
Jj. Omojuwa@SympLySimi Lol. Read this book. You’d love it. https://t.co/d2cLOyoiZ9 (Source)
Ron FournierJust finished, “Everybody Lies” by @SethS_D, which in addition to being a tremendous education on Big Data, includes the best conclusion to a non-fiction book I’ve ever read. Read it. -30- (Source)
New York Times Bestseller
A former Wall Street quant sounds an alarm on the mathematical models that pervade modern life -- and threaten to rip apart our social fabric
We live in the age of the algorithm. Increasingly, the decisions that affect our lives--where we go to school, whether we get a car loan, how much we pay for health insurance--are being made not by humans, but by mathematical models. In theory, this should lead to greater fairness: Everyone is judged according to the same rules, and bias is... more
Paula BoddingtonHow the use of algorithms has affected people’s lives and occasionally ruined them. (Source)
Ramesh SrinivasanThis book is a really fantastic analysis of how quantification, the collection of data, the modelling around data, the predictions made by using data, the algorithmic and quantifiable ways of predicting behaviour based on data, are all built by elites for elites and end up, quite frankly, screwing over everybody else. (Source)
His answer is that we pay too much attention to what successful people are like, and too little attention to where they are from: that is, their culture, their family, their generation, and the idiosyncratic experiences of their upbringing. Along the way he explains the secrets of software billionaires, what it takes to be a great soccer player,... more
Bill Gates[On Bill Gates's reading list in 2011.] (Source)
James AltucherGladwell is not the first person to come up with the 10,000 hour rule. Nor is he the first person to document what it takes to become the best in the world at something. But his stories are so great as he explains these deep concepts. How did the Beatles become the best? Why are professional hockey players born in January, February and March? And so on. (Source)
Cat Williams-TreloarThe books that I've talked the most about with friends and colleagues over the years are the Malcolm Gladwell series of novels. Glorious stories that mix science, behaviours and insight. You can't go wrong with the "The Tipping Point", "Outliers", "Blink" or "David & Goliath". (Source)
Carol DweckYou would think that the relationship between training and skill would be utterly obvious in sports, but apparently it isn’t. (Source)
David PapineauIt’s a parable of the disinclination of people in general to base their practices on evidence, a parable for evidence-based policy in general. (Source)
Which paint color is most likely to tell you that a used car is in good shape? How can officials identify the most dangerous New York City manholes before they explode? And how did Google searches predict the spread of the H1N1 flu outbreak?
The key to answering these questions, and many more, is big data. “Big data” refers to our burgeoning ability to crunch vast collections of information, analyze it instantly, and draw... more
These may not sound like typical questions for an economist to ask. But Steven D. Levitt is not a typical economist. He is a much heralded scholar who studies the stuff and riddles of everyday life -- from cheating and crime to sports and child rearing -- and whose... more
Malcolm GladwellI don’t need to say much here. This book invented an entire genre. Economics was never supposed to be this entertaining. (Source)
Daymond JohnI love newer books like [this book]. (Source)
James Altucher[James Altucher recommended this book on the podcast "The Tim Ferriss Show".] (Source)
Based on an MBA course Provost has taught at New York University over the past ten years, Data Science for Business provides examples of real-world business problems to illustrate these principles. You’ll... more
Kirk BorneGreat book for Business Analytics and for building #AnalyticThinking >> “#DataScience for Business — What You Need to Know about #DataMining and Data-Analytic Thinking”: https://t.co/e9rAFnVYYQ #BigData #MachineLearning #DataStrategy #AnalyticsStrategy #Algorithms https://t.co/yEblfU2MZd (Source)
New York Times Bestseller
"Not so different in spirit from the way public intellectuals like John Kenneth Galbraith once shaped discussions of economic policy and public figures like Walter Cronkite helped sway opinion on the Vietnam War…could turn out to be one of the more momentous books of the decade."
-New York Times Book Review
"Nate Silver's The Signal and the Noise is The Soul of a New Machine for the 21st century."
-Rachel Maddow, author of Drift
"A serious... more
Bill GatesAnyone interested in politics may be attracted to Nate Silver’s The Signal and the Noise: Why So Many Predictions Fail—but Some Don't. Silver is the New York Times columnist who got a lot of attention last fall for predicting—accurately, as it turned out–the results of the U.S. presidential election. This book actually came out before the election, though, and it’s about predictions in many... (Source)
Don't have time to read the top Big Data books of all time? Read Shortform summaries.
Shortform summaries help you learn 10x faster by:
- Being comprehensive: you learn the most important points in the book
- Cutting out the fluff: you focus your time on what's important to know
- Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
Big Data shows how to build these systems using an architecture that takes advantage of... more
Vicki BoykisThis book remains a great read if you want to understand how modern data architecture works, and especially distributed data systems. (Source)
An audacious, irreverent investigation of human behavior—and a first look at a revolution in the making
Our personal data has been used to spy on us, hire and fire us, and sell us stuff we don’t need. In Dataclysm, Christian Rudder uses it to show us who we truly are.
For centuries, we’ve relied on polling or small-scale lab experiments to study human behavior. Today, a new approach is possible. As we live more of our lives online, researchers can finally observe us directly, in vast numbers, and... more
Elad Yom-TovChristian Rudder was the chief scientist of a dating website, OK Cupid. (Source)
In this 3rd edition, Steve returns with fresh perspective to reexamine the principles that made Don’t Make Me Think a classic-–with updated examples and a new chapter on mobile usability. And it’s still short, profusely illustrated…and best of all–fun to read.
If you’ve read it before, you’ll rediscover what made Don’t Make Me Think so essential to Web... more
Chris GowardHere are some of the books that have been very impactful for me, or taught me a new way of thinking: [...] Don't Make Me Think. (Source)
Nicolae AndronicI’m a technical guy. I studied the IT field and did software development for a long time until I discovered the business world. So the path for me is to slowly adapt from the clear, technical world, to the fuzzy, way more complex, business world. All the books that I recommend help this transition. “Don’t Make Me Think” - Steve Krug: for seeing software with the eyes of the user. (Source)
Nick GanjuAbout usability and making software and user interfaces that are friendly to people. (Source)
Marius Ciuchete Pauneval(ez_write_tag([[250,250],'theceolibrary_com-large-mobile-banner-2','ezslot_5',164,'0','1'])); Question: Was there a moment, specifically, when something you read in a book helped you? Answer: Yes there was. In fact, I can remember two separate sentences from two different books: The first one comes from “The Design of Everyday Things” by Don Norman. It says: “great design will help... (Source)
Grey BakerI mainly read to decompress and change my state of mind, so it’s hard to point to an insight I read that helped me. Reading fiction has pulled me out of a bad mood more times than I can count, though, and always reenergises me to attack problems that had stumped me again. That said, I read and loved Norman Norman’s “The Design of Everyday Things”, and it’s helped me think through design problems... (Source)
Kaci LambeThese three books are about how people actually use design in their lives. They helped me understand this very basic idea: There are no dumb users, only bad designers. Take the time to create based on how your design will be interacted with. Test it. Iterate. That's how you become a good designer. (Source)
Storytelling is not an inherent skill, especially when it comes to data visualization, and the tools at our disposal... more
Roger D. PengIt’s important to think in terms of what your audience needs, and what would be best for them among the many choices you could make when analysing data. (Source)
You'll explore the basic operations and common functions of Spark's structured APIs, as well as Structured Streaming, a new high-level API for building end-to-end streaming applications. Developers and system administrators will learn the fundamentals of... more
An Economist Best Book of 2015
"The most important book on decision making since Daniel Kahneman's Thinking, Fast and Slow."
—Jason Zweig, The Wall Street Journal
Everyone would benefit from seeing further into the future, whether buying stocks, crafting policy, launching a new product, or simply planning the week’s meals. Unfortunately, people tend to be terrible forecasters. As Wharton professor Philip Tetlock showed in a landmark 2005 study, even... more
Sheil KapadiaRead the book Superforecasting, had a great conversation with @bcmassey and came up with seven ideas for how NFL teams can try to find small edges during the draft process. Would love to hear feedback on this one. https://t.co/PdN1fKCagl (Source)
Julia Galef[Has] some good advice on how to improve your ability to make accurate predictions. (Source)
In April 1956, a refitted oil tanker carried fifty-eight shipping containers from Newark to Houston. From that modest beginning, container shipping developed into a huge industry that made the boom in global trade possible. "The Box" tells the dramatic story of the container's creation, the decade of struggle before it was widely adopted, and the sweeping economic consequences of the sharp fall in transportation costs that containerization brought about.
Published on the fiftieth anniversary of the first container voyage, this is the first comprehensive history of the shipping... more
Bill GatesI picked this one up after seeing it on a Wall Street Journal list of good books for investors. It was first published in 1954, but it doesn’t feel dated (aside from a few anachronistic examples—it has been a long time since bread cost 5 cents a loaf in the United States). In fact, I’d say it’s more relevant than ever. One chapter shows you how visuals can be used to exaggerate trends and give... (Source)
Tobi LütkeWe all live in Malcolm’s world because the shipping container has been hugely influential in history. (Source)
Jason ZweigThis is a terrific introduction to critical thinking about statistics, for people who haven’t taken a class in statistics. (Source)
Complete with case studies that illustrate how Hadoop solves specific problems, this book helps you:
more
But how does one exactly do data science? Do you have to hire one of these priests of the dark arts, the "data scientist," to extract this gold from your data? Nope.
Data science is little more than using straight-forward steps to process raw data into... more
Don't have time to read the top Big Data books of all time? Read Shortform summaries.
Shortform summaries help you learn 10x faster by:
- Being comprehensive: you learn the most important points in the book
- Cutting out the fluff: you focus your time on what's important to know
- Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
The human brain has some capabilities that the brains of other animals lack. It is to these distinctive capabilities that our species owes its dominant position. If machine brains surpassed human brains in general intelligence, then this new superintelligence could become extremely powerful--possibly beyond our control. As the fate of the... more
Maria RamosRamos will take the summer to examine some of the questions weighing more heavily on humankind as we contemplate our collective future: what happens when we can write our own genetic codes, and what happens when we create technology that is meaningfully more intelligent than us. The Gene: An Intimate History—Siddhartha Mukherjee Superintelligence: Paths, Dangers, Strategies—Nick Bostrom The... (Source)
Will MacAskillI picked this book because the possibility of us developing human-level artificial intelligence, and from there superintelligence—an artificial agent that is considerably more intelligent than we are—is at least a contender for the most important issue in the next two centuries. Bostrom’s book has been very influential in effective altruism, lots of people work on artificial intelligence in order... (Source)
Roger D. PengThis book is written by a powerhouse of authors in the machine learning community, true authorities in the field. But beyond that, they’re also great writers. (Source)
Yuval Noah HarariA superb and very timely survey of the impact of AI on the geopolitical system, the job market and human society. (Source)
Arianna HuffingtonKai-Fu Lee's experience as an AI pioneer, top investor, and cancer survivor has led to this brilliant book about global technology. AI Superpowers gives us a guide to a future that celebrates all the benefits that AI will bring, while cultivating what is unique about our humanity. It’s one of those books you read and think, ‘Why are people reading any other book right now when this is so clearly... (Source)
Satya NadellaKai-Fu Lee's smart analysis on human-AI coexistence is clear-eyed and a must-read. We must look deep within ourselves for the values and wisdom to guide AI's development. (Source)
Written by Wes McKinney, the main author of the pandas library, Python for Data Analysis also serves as a practical, modern introduction to scientific computing in Python for data-intensive... more
Using everyday objects and familiar language systems such as Braille and Morse code, author Charles Petzold weaves an illuminating narrative for anyone who’s ever wondered about the secret inner life of... more
This book shows you how to validate your initial idea, find the right customers, decide what to build, how to monetize your business, and how to spread the word. Packed with more than thirty case studies and insights from over a hundred business experts, Lean Analytics provides you with hard-won, real-world information no entrepreneur... more
Ola OlusogaLike Charlie Munger once said: “I’ve long believed that a certain system - which almost any intelligent person can learn - works way better than the systems most people use [to understand the world]. What you need is a latticework of mental models in your head. And, with that system, things gradually fit together in a way that enhances cognition. Just as multiple factors shape every system,... (Source)
If you have an aptitude for mathematics and some programming skills, author Joel Grus will help you get comfortable with the math and statistics at the core of data science, and with the hacking skills you need to get started as a data... more
Thorsten HellerThe Best #book to Start your #DataScience Journey - Towards #DataScience https://t.co/D8PlkkSxw6 by @benthecoder1 (Source)
Engineers from Confluent and LinkedIn who are responsible for developing Kafka explain how to deploy production Kafka clusters, write reliable event-driven microservices, and... more
In this masterwork of original thinking and research, Shoshana Zuboff provides startling insights into the phenomenon that she has named surveillance capitalism. The stakes could not be higher: a global architecture of behavior modification threatens human nature in the twenty-first century just as industrial capitalism disfigured the... more
Nicholas CarrWhatever its imperfections, The Age of Surveillance Capitalism is an original and often brilliant work, and it arrives at a crucial moment, when the public and its elected representatives are at last grappling with the extraordinary power of digital media and the companies that control it. Like another recent masterwork of economic analysis, Thomas Piketty’s 2013 Capital in the Twenty-First... (Source)
Naomi KleinFrom the very first page I was consumed with an overwhelming imperative: everyone needs to read this book as an act of digital self-defense. With tremendous lucidity and moral courage, Zuboff demonstrates not only how our minds are being mined for data but also how they are being rapidly and radically changed in the process. The hour is late and much has been lost already—but as we learn in these... (Source)
Clive Lewis MpCant make the brilliant event below? Havent had a chance to read @shoshanazuboff groundbreaking book, ‘Surveillance Capitalism’? Then listen to this brilliant interview with the author as she explains the terrifying scale&ambition of Facebook/Google et al https://t.co/DCtNlFbmE0 https://t.co/ZX0YpW5pOo (Source)
Don't have time to read the top Big Data books of all time? Read Shortform summaries.
Shortform summaries help you learn 10x faster by:
- Being comprehensive: you learn the most important points in the book
- Cutting out the fluff: you focus your time on what's important to know
- Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
This is the... more
Michael OkudaEdward Tufte's classic book, The Visual Display of Quantitative Information is a fascinating, surprisingly readable treatise for anyone interested in infographics. When I hired artists for the Star Trek graphics dept, I sometimes asked them to read it.https://t.co/cK4GQqBDxp (Source)
Each pattern is explained in context, with pitfalls and caveats clearly identified to help you avoid common design mistakes when modeling your big data architecture. This book also provides a complete overview of MapReduce that explains its origins and implementations,... more
The powers that surveil us do more than simply store this information. Corporations use surveillance to manipulate not only the news articles and advertisements we each see, but also the... more
Tim @RealscientistsIf you are interested in learning programming, there are lots of great tutorials. For data analysis, R and the R 4 data science book is a great way to go https://t.co/zezYpG0TRL, and for general R syntax, there is the swirl learning package https://t.co/Tzfpnlgo3O /20 (Source)
Mindf*ck goes deep inside Cambridge Analytica's "American operations," which were driven by Steve Bannon's vision to remake America and fueled by mysterious billionaire Robert Mercer's money, as it weaponized and wielded the massive store of data it had harvested on individuals in--excess of 87 million--to... more
In May 2013, Glenn Greenwald set out for Hong Kong to meet an anonymous source who claimed to have astonishing evidence of pervasive government spying and insisted on communicating only through heavily encrypted channels. That source turned out to be the twenty-nine-year-old NSA contractor Edward Snowden, and his revelations about the agency's widespread, systemic overreach proved to be some of the most explosive and consequential news in recent history, triggering a... more
Gilbert RwabigwiYour book, “No Place To Hide”, was a thrilling/insightful read. Can’t wait to flip through @Snowden’s memoir. 🙏🏾 https://t.co/pZPLxDpNcM (Source)
Along the way, you'll experiment... more
Working scientists and data crunchers familiar with reading and writing Python code will find this comprehensive desk reference ideal for tackling day-to-day issues: manipulating, transforming, and cleaning data; visualizing... more
Kirk Borne✨🎉🌟Must see this >> Free #Python #DataScience Coding book series for #DataScientists ...via @DataScienceCtrl Go to https://t.co/To10VVZzIl ——————— #abdsc #BigData #MachineLearning #AI #DeepLearning #BeDataBrilliant #DataLiteracy https://t.co/Msuo1jiZSm (Source)
Microservice technologies are moving quickly. Author Sam Newman provides you with a firm grounding in the concepts while diving into current solutions... more
Cracking the Coding Interview, 6th Edition is here to help you through this process, teaching you what you need to know and enabling you to perform at your very best. I've coached and interviewed hundreds of software engineers. The result is this book.
Learn how to uncover the hints and hidden details in a question,... more
Don't have time to read the top Big Data books of all time? Read Shortform summaries.
Shortform summaries help you learn 10x faster by:
- Being comprehensive: you learn the most important points in the book
- Cutting out the fluff: you focus your time on what's important to know
- Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
The story of information begins in a time profoundly unlike our own, when every thought and utterance vanishes as soon as it is born. From the invention of scripts and alphabets to the long-misunderstood talking drums of Africa, Gleick tells the story of information technologies that... more
Nicholas CarrIf Standage’s is a small book focused on a particular technology and moment in time, Gleick’s is extraordinarily broad and sweeping. It’s a very large book, in which he tries – and succeeds in many ways I think – to tell the story of information in human history. Information breaks down into two different things in essence. On the one hand it is messages – things with meaning to human beings –... (Source)
If you're a newcomer to both search and distributed systems, you'll quickly learn how to integrate Elasticsearch into your application. More experienced users will pick up lots of advanced techniques. Throughout the... more
"The Freakonomics of big data." --Stein Kretsinger, founding executive of Advertising.com
Award-winning - Used by over 30 universities - Translated into 9 languages
An introduction for everyone. In this rich, fascinating -- surprisingly accessible -- introduction, leading expert Eric Siegel reveals how predictive analytics (aka machine learning) works, and how it affects everyone every day. Rather than a "how to" for hands-on... more
Big data management is one of the major challenges facing business, industry, and not-for-profit organizations. Data sets such as customer transactions for a mega-retailer, weather patterns monitored by meteorologists, or social network activity can quickly outpace the capacity of traditional data management tools. If you need to develop or manage big data solutions, you'll appreciate how these four experts define, explain, and guide you through this new and often confusing concept. You'll learn what it is,... more
When the term “big data” first came on the scene, bestselling author Tom Davenport (Competing on Analytics, Analytics at Work) thought it was just another example of technology hype. But his research in the years that followed changed his mind.
Now, in clear, conversational language, Davenport explains what big data means—and why everyone in business needs to know about it. Big Data at Work covers all the bases: what big data means from a technical, consumer, and management... more
In The Second Machine Age MIT's Erik Brynjolfsson and Andrew McAfee—two thinkers at the forefront of their field—reveal the forces driving the...
moreMichael DellThe authors make a case for a future world that is better, not worse, than the one we inherited. That may seem far-fetched given the problems we see flashing across our screens every day. But there is reason for optimism, and it starts and ends with one of my favorite things, technology. (Source)
Dominic Steil[One of the books that had the biggest impact on .] (Source)
As Hillary Clinton's Senior Advisor for Innovation, Alec Ross travelled nearly a million miles to forty-one countries, the equivalent of two round-trips to the moon. From refugee camps in the Congo and Syrian war zones, to visiting the world's most powerful people in business and government, Ross's travels amounted to a four-year masterclass in the changing... more
Marvin LiaoMy list would be (besides the ones I mentioned in answer to the previous question) both business & Fiction/Sci-Fi and ones I personally found helpful to myself. The business books explain just exactly how business, work & investing are in reality & how to think properly & differentiate yourself. On the non-business side, a mix of History & classic fiction to understand people, philosophy to make... (Source)
Comprising dozens of examples that address different industries and departments (healthcare, transportation, finance, human resources, marketing, customer service, sports, etc.) and different platforms (print, desktop, tablet, smartphone, and conference room display) The Big Book of Dashboards is the only book that... more
Don't have time to read the top Big Data books of all time? Read Shortform summaries.
Shortform summaries help you learn 10x faster by:
- Being comprehensive: you learn the most important points in the book
- Cutting out the fluff: you focus your time on what's important to know
- Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
Concise and to the point — the book can be read during a week. During that week, you will learn almost everything modern machine learning has to offer. The author and other practitioners have spent years learning these concepts.
Companion wiki — the book has a continuously updated wiki that extends some book chapters with additional information: Q&A, code snippets, further reading, tools, and other relevant resources.
more
Kirk BorneRecent top-selling books in #AI & #MachineLearning: https://t.co/Ij9I7SzR4d ————— #BigData #DataScience #DataMining #Algorithms #PredictiveAnalytics #Python ————— ...in the TOP 10: 1)The Hundred-Page ML Book: https://t.co/dQ7nP6gwP0 2)Hands-on ML with...: https://t.co/Y0Iz3GbtGP https://t.co/72rAFN1FwW (Source)
Do you want to understand how to manage databases without all the confusion?
Well than, this is your go to guide to help you master SQL programming in no time!
This book breaks down the fundamentals elements that are essential to make you proficient in SQL programming and database management
By the end of this book you will be confident enough to take on any problems that encompass SQL
SQL software can be complex,... more
In the world's top research labs and universities, the race is on to invent the ultimate learning algorithm: one capable of discovering any knowledge from data, and doing anything we want, before we even ask. In The Master Algorithm, Pedro Domingos lifts the veil to give us a peek inside the learning machines that power Google, Amazon, and your smartphone. He assembles a blueprint for the future universal learner--the Master Algorithm--and... more
Vinod KhoslaIf you want speculation about what the master AI might need (one view). For a slightly more technical read, I’d suggest Ian Goodfellows Deep Learning. (Source)
Grokking Algorithms is a disarming take on a core computer science topic. In it, you'll learn how to apply... more
The purpose of this book is to introduce technology practitioners, including software architects and developers, as well as technology decision makers to probabilistic data structures and algorithms. Reading this book, you will get a theoretical and practical understanding of probabilistic data structures and learn about their common uses. less
Don't have time to read the top Big Data books of all time? Read Shortform summaries.
Shortform summaries help you learn 10x faster by:
- Being comprehensive: you learn the most important points in the book
- Cutting out the fluff: you focus your time on what's important to know
- Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
Statistics are everywhere, as integral to science as they are to business, and in the popular media hundreds of times a day. In this age of big data, a basic grasp of statistical literacy is more important than ever if we want to separate the fact from the fiction, the ostentatious embellishments from the raw evidence -- and even more so if we hope to participate in the future, rather than being simple bystanders.
In The Art of Statistics, world-renowned statistician David Spiegelhalter shows readers how to derive knowledge... more
Dan Davies@amoralelite @d_spiegel It's a great book. I thought it was like coming home because I've always tried to avoid calculation due to the dyspraxia and it was just "yes, that's how you think about it" (Source)
Valliappa Lakshmanan, tech lead for Google Cloud Platform, and Jordan Tigani, engineering director for the... more
Rather than run through all possible scenarios, this pragmatic operations guide calls out what works, as demonstrated in critical... more
This example-driven guide shows you how to set up and configure Hive in your environment, provides a detailed overview of Hadoop and MapReduce, and demonstrates how Hive works within the Hadoop ecosystem. You’ll also find real-world case studies that describe how companies have used... more
Technological advances have benefited our world in immeasurable ways, but there is an ominous flip side: our technology can be turned against us. Hackers can activate baby monitors to spy on families, thieves are analyzing social media posts to plot home... more
Don't have time to read the top Big Data books of all time? Read Shortform summaries.
Shortform summaries help you learn 10x faster by:
- Being comprehensive: you learn the most important points in the book
- Cutting out the fluff: you focus your time on what's important to know
- Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
"Using Flume" describes the rich set of features that enable you to write data in any format supported by the Hadoop Distributed File System (HDFS) and HBase, including MapReduce, Hive, Impala, and Pig. You'll also learn about Flume's design and implementation, as well as various features that make Flume highly... more
"Little fish, little fish, let me come in."
"Not by the skin of my finny fin fin!"
"Then I'll munch, and I'll crunch, and I'll smash your house in!"
Mama tells her three little fish that it's time to make their own homes. Jim builds his house of seaweed, but the big bad shark munches it up. Tim builds his house of sand, but the shark crunches it up. It's smart Kim who sets up house in an old sunken ship!
Children will delight in this silly story with funny, eye-popping... more
Erez Aiden and Jean-Baptiste Michel are two young scientists at Harvard who started to ask those questions. They teamed up with Google to create the Ngram Viewer, a Web-based tool that can chart words throughout the massive Google Books archive, sifting through billions of words to find... more
Hadley WickhamThis book is an illustration of the power of names in the Google era. (Source)
In many of these chapter-long lectures, data scientists from companies such as Google, Microsoft, and eBay share new algorithms, methods, and models by presenting case studies and the code they use. If you’re... more
Raluca RaduI work in digital marketing so I would [recommend]: [...] Web Analytics: An Hour A Day and Web Analytics 2.0 by Avinash Kaushik. (Source)
Each chapter provides exercises to help you learn various techniques and short quizzes to make sure you understand key concepts.
This thoroughly revised edition is ideal for students and professionals of all backgrounds and skill... more
Kafka Streams in Action teaches you everything you need to know to implement stream processing on data flowing into your Kafka platform, allowing you to focus on getting more from your data without sacrificing time or effort.
Foreword by Neha Narkhede, Cocreator of Apache Kafka
Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.
About the Technology
Not all stream-based applications require a dedicated processing cluster. The lightweight Kafka Streams... more
Don't have time to read the top Big Data books of all time? Read Shortform summaries.
Shortform summaries help you learn 10x faster by:
- Being comprehensive: you learn the most important points in the book
- Cutting out the fluff: you focus your time on what's important to know
- Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
--Tim Urban, author of Wait But Why Fully Practical, Insightful Guide to Modern Deep Learning
Deep learning is transforming software, facilitating powerful new artificial intelligence capabilities, and driving unprecedented algorithm performance. Deep Learning Illustrated is uniquely intuitive and offers a complete introduction to the discipline's techniques. Packed with... more
Kirk Borne🌟📘📊📈Awesome new book >> #DeepLearning Illustrated — A Visual, Interactive Guide to Artificial Intelligence” https://t.co/xIW48MskrR by @JonKrohnLearns ——————— #BigData #Analytics #DataScience #AI #MachineLearning #Algorithms #NeuralNetworks https://t.co/JKSrVRLpS0 (Source)
Author Emmanuel Ameisen, who worked as a data scientist at Zipcar and led Insight Data Science's AI program, demonstrates key ML concepts with code snippets,... more
"Statistics Done Wrong" comes to the rescue with cautionary tales of all-too-common statistical fallacies. It'll help you see where and why researchers often go wrong and teach you the best practices for avoiding their mistakes.
In this... more
FROM THE WALL STREET JOURNAL: "Nothing Mr. Gilder says or writes is ever delivered at anything less than the fullest philosophical decibel... Mr. Gilder sounds less like a tech guru than a poet, and his words tumble out in a romantic cascade."
“Google’s algorithms assume the world’s future is nothing more than the next moment in a random process. George Gilder shows how deep this assumption goes, what motivates people to make it, and why it’s wrong: the future depends on human action.”... more
Dominic Steil[One of the five books recommends to young people interested in his career path.] (Source)
Today, data science is a form of power. It has been used to expose injustice, improve health outcomes, and topple governments. But it has also been used to discriminate, police, and surveil. This potential for good, on the one hand, and harm, on the other, makes it essential to ask: Data science by whom? Data science for whom? Data science with whose interests in mind? The narratives around big data and data science are overwhelmingly white, male, and techno-heroic. In... more
Ideal for enterprise architects, IT managers, application architects, and data engineers, this book shows you how to overcome the many challenges that emerge during Hadoop projects. You'll explore the vast landscape of tools available in the Hadoop and... more
R is the world's most popular language for developing statistical software: Archaeologists use it to track the spread of ancient civilizations, drug companies use it to discover which medications are safe and effective, and actuaries use it to assess financial risks and keep economies running smoothly.
The Art of R Programming takes you on a guided tour of software development with R, from basic types and data structures to advanced topics like closures, recursion, and anonymous functions. No statistical knowledge is required, and your programming skills can range from...
Don't have time to read the top Big Data books of all time? Read Shortform summaries.
Shortform summaries help you learn 10x faster by:
- Being comprehensive: you learn the most important points in the book
- Cutting out the fluff: you focus your time on what's important to know
- Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
In this new book by the author of the #1 bestseller The New Rules of Marketing & PR, David Meerman Scott demystifies the new digital commercial landscape and offers inspiring... more
By working through the book's 22 lessons of 10 minutes or less, you'll learn what you need to know to take advantage of the SQL language.
Lessons cover IBM DB2, Microsoft SQL Server and SQL Server Express, MariaDB, MySQL, Oracle and Oracle express, PostgreSQL, and SQLite.
Full-color code examples help you understand how SQL statements are structured
Tips point out shortcuts and solutions
Cautions help you avoid common... more
Ideal for software engineers, data engineers, developers, and system administrators working with large-scale data applications, this book describes techniques that can reduce data infrastructure costs... more
Social media analytics, web-tracking, and other technologies help companies acquire and handle massive amounts of data to better understand their customers, products, competition, and markets. Armed with the insights from big data, companies can improve customer experience and products, add value, and increase return on investment. The tricky part for busy IT professionals and executives is how to get this done, and that's where this practical book comes in. Big Data: Understanding How Data Powers Big Business is a... more
From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.
Among the many topics covered, you’ll discover how to: more
Steve JurvetsonA series of epiphanies from [the author] and others that the world is really interesting when you look at iterative algorithms applied millions and billions of times. (Source)
Don't have time to read the top Big Data books of all time? Read Shortform summaries.
Shortform summaries help you learn 10x faster by:
- Being comprehensive: you learn the most important points in the book
- Cutting out the fluff: you focus your time on what's important to know
- Interactive exercises: apply the book's ideas to your own life with our educators' guidance.