100 Best Big Data Books of All Time

We've researched and ranked the best big data books in the world, based on recommendations from world experts, sales data, and millions of reader ratings. Learn more

Featuring recommendations from Reid Hoffman, Malcolm Gladwell, Charles T. Munger, and 161 other experts.
1
Many forces affect software today: larger datasets, geographical disparities, complex company structures, and the growing need to be fast and nimble in the face of change.

Proven approaches such as service-oriented and event-driven architectures are joined by newer techniques such as microservices, reactive architectures, DevOps, and stream processing. Many of these patterns are successful by themselves, but as this practical ebook demonstrates, they provide a more holistic and compelling approach when applied together.

Author Ben Stopford explains how service-based...
more

See more recommendations for this book...

2
Foreword by Steven Pinker

Blending the informed analysis of The Signal and the Noise with the instructive iconoclasm of Think Like a Freak, a fascinating, illuminating, and witty look at what the vast amounts of information now instantly available to us reveals about ourselves and our world—provided we ask the right questions.

By the end of an average day in the early twenty-first century, human beings searching the internet will amass eight trillion gigabytes of data. This staggering amount of information—unprecedented in history—can tell us a great deal about who we...
more
Recommended by Jj. Omojuwa, Ron Fournier, and 2 others.

Jj. Omojuwa@SympLySimi Lol. Read this book. You’d love it. https://t.co/d2cLOyoiZ9 (Source)

Ron FournierJust finished, “Everybody Lies” by @SethS_D, which in addition to being a tremendous education on Big Data, includes the best conclusion to a non-fiction book I’ve ever read. Read it. -30- (Source)

See more recommendations for this book...

3
Longlisted for the National Book Award
New York Times Bestseller


A former Wall Street quant sounds an alarm on the mathematical models that pervade modern life -- and threaten to rip apart our social fabric

We live in the age of the algorithm. Increasingly, the decisions that affect our lives--where we go to school, whether we get a car loan, how much we pay for health insurance--are being made not by humans, but by mathematical models. In theory, this should lead to greater fairness: Everyone is judged according to the same rules, and bias is...
more
Recommended by Paula Boddington, Ramesh Srinivasan, and 2 others.

Paula BoddingtonHow the use of algorithms has affected people’s lives and occasionally ruined them. (Source)

Ramesh SrinivasanThis book is a really fantastic analysis of how quantification, the collection of data, the modelling around data, the predictions made by using data, the algorithmic and quantifiable ways of predicting behaviour based on data, are all built by elites for elites and end up, quite frankly, screwing over everybody else. (Source)

See more recommendations for this book...

4

Outliers

The Story of Success

In this stunning new book, Malcolm Gladwell takes us on an intellectual journey through the world of "outliers"--the best and the brightest, the most famous and the most successful. He asks the question: what makes high-achievers different?

His answer is that we pay too much attention to what successful people are like, and too little attention to where they are from: that is, their culture, their family, their generation, and the idiosyncratic experiences of their upbringing. Along the way he explains the secrets of software billionaires, what it takes to be a great soccer player,...
more

Bill Gates[On Bill Gates's reading list in 2011.] (Source)

James AltucherGladwell is not the first person to come up with the 10,000 hour rule. Nor is he the first person to document what it takes to become the best in the world at something. But his stories are so great as he explains these deep concepts. How did the Beatles become the best? Why are professional hockey players born in January, February and March? And so on. (Source)

Cat Williams-TreloarThe books that I've talked the most about with friends and colleagues over the years are the Malcolm Gladwell series of novels. Glorious stories that mix science, behaviours and insight. You can't go wrong with the "The Tipping Point", "Outliers", "Blink" or "David & Goliath". (Source)

See more recommendations for this book...

5
Moneyball is a quest for something as elusive as the Holy Grail, something that money apparently can't buy: the secret of success in baseball. The logical places to look would be the front offices of major league teams and the dugouts, perhaps even in the minds of the players themselves. Michael Lewis mines all these possibilities - his intimate and original portraits of big league ballplayers are alone worth the price of admission - but the real jackpot is a cache of numbers - numbers! - collected over the years by a strange brotherhood of amateur baseball enthusiasts: software... more

Carol DweckYou would think that the relationship between training and skill would be utterly obvious in sports, but apparently it isn’t. (Source)

David PapineauIt’s a parable of the disinclination of people in general to base their practices on evidence, a parable for evidence-based policy in general. (Source)

Ed SmithThis is about a guy using econometrics to predict which baseball players will do better in advancing wins, a remarkable use of economic thinking. (Source)

See more recommendations for this book...

6

Big Data

A Revolution That Will Transform How We Live, Work, and Think

A revelatory exploration of the hottest trend in technology and the dramatic impact it will have on the economy, science, and society at large.

Which paint color is most likely to tell you that a used car is in good shape? How can officials identify the most dangerous New York City manholes before they explode? And how did Google searches predict the spread of the H1N1 flu outbreak?

The key to answering these questions, and many more, is big data. “Big data” refers to our burgeoning ability to crunch vast collections of information, analyze it instantly, and draw...
more

See more recommendations for this book...

7
Which is more dangerous, a gun or a swimming pool? What do schoolteachers and sumo wrestlers have in common? Why do drug dealers still live with their moms? How much do parents really matter? What kind of impact did Roe v. Wade have on violent crime? Freakonomics will literally redefine the way we view the modern world.

These may not sound like typical questions for an economist to ask. But Steven D. Levitt is not a typical economist. He is a much heralded scholar who studies the stuff and riddles of everyday life -- from cheating and crime to sports and child rearing -- and whose...
more

Malcolm GladwellI don’t need to say much here. This book invented an entire genre. Economics was never supposed to be this entertaining. (Source)

Daymond JohnI love newer books like [this book]. (Source)

James Altucher[James Altucher recommended this book on the podcast "The Tim Ferriss Show".] (Source)

See more recommendations for this book...

8
Written by renowned data science experts Foster Provost and Tom Fawcett, Data Science for Business introduces the fundamental principles of data science, and walks you through the "data-analytic thinking" necessary for extracting useful knowledge and business value from the data you collect. This guide also helps you understand the many data-mining techniques in use today.

Based on an MBA course Provost has taught at New York University over the past ten years, Data Science for Business provides examples of real-world business problems to illustrate these principles. You’ll...
more
Recommended by Kirk Borne, and 1 others.

Kirk BorneGreat book for Business Analytics and for building #AnalyticThinking >> “#DataScience for Business — What You Need to Know about #DataMining and Data-Analytic Thinking”: https://t.co/e9rAFnVYYQ #BigData #MachineLearning #DataStrategy #AnalyticsStrategy #Algorithms https://t.co/yEblfU2MZd (Source)

See more recommendations for this book...

9
One of Wall Street Journal's Best Ten Works of Nonfiction in 2012

New York Times Bestseller

"Not so different in spirit from the way public intellectuals like John Kenneth Galbraith once shaped discussions of economic policy and public figures like Walter Cronkite helped sway opinion on the Vietnam War…could turn out to be one of the more momentous books of the decade."
-New York Times Book Review

"Nate Silver's The Signal and the Noise is The Soul of a New Machine for the 21st century."
-Rachel Maddow, author of Drift

"A serious...
more
Recommended by Bill Gates, and 1 others.

Bill GatesAnyone interested in politics may be attracted to Nate Silver’s The Signal and the Noise: Why So Many Predictions Fail—but Some Don't. Silver is the New York Times columnist who got a lot of attention last fall for predicting—accurately, as it turned out–the results of the U.S. presidential election. This book actually came out before the election, though, and it’s about predictions in many... (Source)

See more recommendations for this book...

Don't have time to read the top Big Data books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
11
Services like social networks, web analytics, and intelligent e-commerce often need to manage data at a scale too big for a traditional database. As scale and demand increase, so does Complexity. Fortunately, scalability and simplicity are not mutually exclusive—rather than using some trendy technology, a different approach is needed. Big data systems use many machines working in parallel to store and process data, which introduces fundamental challenges unfamiliar to most developers.

Big Data shows how to build these systems using an architecture that takes advantage of...
more
Recommended by Vicki Boykis, and 1 others.

Vicki BoykisThis book remains a great read if you want to understand how modern data architecture works, and especially distributed data systems. (Source)

See more recommendations for this book...

12
A New York Times Bestseller

An audacious, irreverent investigation of human behavior—and a first look at a revolution in the making

 
Our personal data has been used to spy on us, hire and fire us, and sell us stuff we don’t need. In Dataclysm, Christian Rudder uses it to show us who we truly are.
 
For centuries, we’ve relied on polling or small-scale lab experiments to study human behavior. Today, a new approach is possible. As we live more of our lives online, researchers can finally observe us directly, in vast numbers, and...
more
Recommended by Elad Yom-Tov, and 1 others.

Elad Yom-TovChristian Rudder was the chief scientist of a dating website, OK Cupid. (Source)

See more recommendations for this book...

13
Since Don’t Make Me Think was first published in 2000, over 400,000 Web designers and developers have relied on Steve Krug’s guide to help them understand the principles of intuitive navigation and information design.

In this 3rd edition, Steve returns with fresh perspective to reexamine the principles that made Don’t Make Me Think a classic-–with updated examples and a new chapter on mobile usability. And it’s still short, profusely illustrated…and best of all–fun to read.

If you’ve read it before, you’ll rediscover what made Don’t Make Me Think so essential to Web...
more

Chris GowardHere are some of the books that have been very impactful for me, or taught me a new way of thinking: [...] Don't Make Me Think. (Source)

Nicolae AndronicI’m a technical guy. I studied the IT field and did software development for a long time until I discovered the business world. So the path for me is to slowly adapt from the clear, technical world, to the fuzzy, way more complex, business world. All the books that I recommend help this transition. “Don’t Make Me Think” - Steve Krug: for seeing software with the eyes of the user. (Source)

Nick GanjuAbout usability and making software and user interfaces that are friendly to people. (Source)

See more recommendations for this book...

14
Even the smartest among us can feel inept as we fail to figure out which light switch or oven burner to turn on, or whether to push, pull, or slide a door. The fault, argues this ingenious—even liberating—book, lies not in ourselves, but in product design that ignores the needs of users and the principles of cognitive psychology. The problems range from ambiguous and hidden controls to arbitrary relationships between controls and functions, coupled with a lack of feedback or other assistance and unreasonable demands on memorization. The Design of Everyday Things shows that good, usable... more

Marius Ciuchete Pauneval(ez_write_tag([[250,250],'theceolibrary_com-large-mobile-banner-2','ezslot_5',164,'0','1'])); Question: Was there a moment, specifically, when something you read in a book helped you? Answer: Yes there was. In fact, I can remember two separate sentences from two different books: The first one comes from “The Design of Everyday Things” by Don Norman. It says: “great design will help... (Source)

Grey BakerI mainly read to decompress and change my state of mind, so it’s hard to point to an insight I read that helped me. Reading fiction has pulled me out of a bad mood more times than I can count, though, and always reenergises me to attack problems that had stumped me again. That said, I read and loved Norman Norman’s “The Design of Everyday Things”, and it’s helped me think through design problems... (Source)

Kaci LambeThese three books are about how people actually use design in their lives. They helped me understand this very basic idea: There are no dumb users, only bad designers. Take the time to create based on how your design will be interacted with. Test it. Iterate. That's how you become a good designer. (Source)

See more recommendations for this book...

15
Don't simply show your data--tell a story with it! Storytelling with Data teaches you the fundamentals of data visualization and how to communicate effectively with data. You'll discover the power of storytelling and the way to make data a pivotal point in your story. The lessons in this illuminative text are grounded in theory, but made accessible through numerous real-world examples--ready for immediate application to your next graph or presentation.

Storytelling is not an inherent skill, especially when it comes to data visualization, and the tools at our disposal...
more
Recommended by Roger D. Peng, and 1 others.

Roger D. PengIt’s important to think in terms of what your audience needs, and what would be best for them among the many choices you could make when analysing data. (Source)

See more recommendations for this book...

16
Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique goals.

You'll explore the basic operations and common functions of Spark's structured APIs, as well as Structured Streaming, a new high-level API for building end-to-end streaming applications. Developers and system administrators will learn the fundamentals of...
more

See more recommendations for this book...

17

Superforecasting

The Art and Science of Prediction

New York Times Bestseller

An Economist Best Book of 2015

"The most important book on decision making since Daniel Kahneman's Thinking, Fast and Slow."
Jason Zweig, The Wall Street Journal
 
Everyone would benefit from seeing further into the future, whether buying stocks, crafting policy, launching a new product, or simply planning the week’s meals. Unfortunately, people tend to be terrible forecasters. As Wharton professor Philip Tetlock showed in a landmark 2005 study, even...
more

Sheil KapadiaRead the book Superforecasting, had a great conversation with @bcmassey and came up with seven ideas for how NFL teams can try to find small edges during the draft process. Would love to hear feedback on this one. https://t.co/PdN1fKCagl (Source)

Julia Galef[Has] some good advice on how to improve your ability to make accurate predictions. (Source)

See more recommendations for this book...

18

In April 1956, a refitted oil tanker carried fifty-eight shipping containers from Newark to Houston. From that modest beginning, container shipping developed into a huge industry that made the boom in global trade possible. "The Box" tells the dramatic story of the container's creation, the decade of struggle before it was widely adopted, and the sweeping economic consequences of the sharp fall in transportation costs that containerization brought about.
Published on the fiftieth anniversary of the first container voyage, this is the first comprehensive history of the shipping...
more

Bill GatesI picked this one up after seeing it on a Wall Street Journal list of good books for investors. It was first published in 1954, but it doesn’t feel dated (aside from a few anachronistic examples—it has been a long time since bread cost 5 cents a loaf in the United States). In fact, I’d say it’s more relevant than ever. One chapter shows you how visuals can be used to exaggerate trends and give... (Source)

Tobi LütkeWe all live in Malcolm’s world because the shipping container has been hugely influential in history. (Source)

Jason ZweigThis is a terrific introduction to critical thinking about statistics, for people who haven’t taken a class in statistics. (Source)

See more recommendations for this book...

19
Hadoop: The Definitive Guide helps you harness the power of your data. Ideal for processing large datasets, the Apache Hadoop framework is an open source implementation of the MapReduce algorithm on which Google built its empire. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems: programmers will find details for analyzing large datasets, and administrators will learn how to set up and run Hadoop clusters.

Complete with case studies that illustrate how Hadoop solves specific problems, this book helps you:
more

See more recommendations for this book...

20
Data Science gets thrown around in the press like it's magic. Major retailers are predicting everything from when their customers are pregnant to when they want a new pair of Chuck Taylors. It's a brave new world where seemingly meaningless data can be transformed into valuable insight to drive smart business decisions.

But how does one exactly do data science? Do you have to hire one of these priests of the dark arts, the "data scientist," to extract this gold from your data? Nope.

Data science is little more than using straight-forward steps to process raw data into...
more

See more recommendations for this book...

Don't have time to read the top Big Data books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
21
Superintelligence asks the questions: what happens when machines surpass humans in general intelligence? Will artificial agents save or destroy us? Nick Bostrom lays the foundation for understanding the future of humanity and intelligent life.

The human brain has some capabilities that the brains of other animals lack. It is to these distinctive capabilities that our species owes its dominant position. If machine brains surpassed human brains in general intelligence, then this new superintelligence could become extremely powerful--possibly beyond our control. As the fate of the...
more

Elon MuskWorth reading Superintelligence by Bostrom. We need to be super careful with AI. Potentially more dangerous than nukes. (Source)

Maria RamosRamos will take the summer to examine some of the questions weighing more heavily on humankind as we contemplate our collective future: what happens when we can write our own genetic codes, and what happens when we create technology that is meaningfully more intelligent than us. The Gene: An Intimate History—Siddhartha Mukherjee Superintelligence: Paths, Dangers, Strategies—Nick Bostrom The... (Source)

Will MacAskillI picked this book because the possibility of us developing human-level artificial intelligence, and from there superintelligence—an artificial agent that is considerably more intelligent than we are—is at least a contender for the most important issue in the next two centuries. Bostrom’s book has been very influential in effective altruism, lots of people work on artificial intelligence in order... (Source)

See more recommendations for this book...

22

An Introduction to Statistical Learning

With Applications in R

An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fields ranging from biology to finance to marketing to astrophysics in the past twenty years. This book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classification, resampling methods, shrinkage approaches, tree- based methods, support vector machines, clustering, and more. Color graphics and... more
Recommended by Roger D. Peng, and 1 others.

Roger D. PengThis book is written by a powerhouse of authors in the machine learning community, true authorities in the field. But beyond that, they’re also great writers. (Source)

See more recommendations for this book...

23
Dr. Kai-Fu Lee—one of the world’s most respected experts on AI and China—reveals that China has suddenly caught up to the US at an astonishingly rapid and unexpected pace.   In AI Superpowers, Kai-fu Lee argues powerfully that because of these unprecedented developments in AI, dramatic changes will be happening much sooner than many of us expected. Indeed, as the US-Sino AI competition begins to heat up, Lee urges the US and China to both accept and to embrace the great responsibilities that come with significant technological power. Most experts already say that AI will have a devastating... more

Yuval Noah HarariA superb and very timely survey of the impact of AI on the geopolitical system, the job market and human society. (Source)

Arianna HuffingtonKai-Fu Lee's experience as an AI pioneer, top investor, and cancer survivor has led to this brilliant book about global technology. AI Superpowers gives us a guide to a future that celebrates all the benefits that AI will bring, while cultivating what is unique about our humanity. It’s one of those books you read and think, ‘Why are people reading any other book right now when this is so clearly... (Source)

Satya NadellaKai-Fu Lee's smart analysis on human-AI coexistence is clear-eyed and a must-read. We must look deep within ourselves for the values and wisdom to guide AI's development. (Source)

See more recommendations for this book...

24
Looking for complete instructions on manipulating, processing, cleaning, and crunching structured data in Python? The second edition of this hands-on guide--updated for Python 3.5 and Pandas 1.0--is packed with practical cases studies that show you how to effectively solve a broad set of data analysis problems, using Python libraries such as NumPy, pandas, matplotlib, and IPython.

Written by Wes McKinney, the main author of the pandas library, Python for Data Analysis also serves as a practical, modern introduction to scientific computing in Python for data-intensive...
more

See more recommendations for this book...

25

Mining of Massive Datasets

The popularity of the Web and Internet commerce provides many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and which can be used on even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. The PageRank idea and related... more

See more recommendations for this book...

26
What do flashlights, the British invasion, black cats, and seesaws have to do with computers? In CODE, they show us the ingenious ways we manipulate language and invent new means of communicating with each other. And through CODE, we see how this ingenuity and our very human compulsion to communicate have driven the technological innovations of the past two centuries.

Using everyday objects and familiar language systems such as Braille and Morse code, author Charles Petzold weaves an illuminating narrative for anyone who’s ever wondered about the secret inner life of...
more
Recommended by Ana Bell, and 1 others.

Ana BellIt gets you to use your imagination to virtually build a computer. It’s easy to read, you can lie down on the couch and enjoy it—it’s not so much of a textbook. It demystifies the magic of a computer and what it is. (Source)

See more recommendations for this book...

27

Lean Analytics

Use Data to Build a Better Startup Faster

Whether you’re a startup founder trying to disrupt an industry or an intrapreneur trying to provoke change from within, your biggest challenge is creating a product people actually want. Lean Analytics steers you in the right direction.

This book shows you how to validate your initial idea, find the right customers, decide what to build, how to monetize your business, and how to spread the word. Packed with more than thirty case studies and insights from over a hundred business experts, Lean Analytics provides you with hard-won, real-world information no entrepreneur...
more

Ola OlusogaLike Charlie Munger once said: “I’ve long believed that a certain system - which almost any intelligent person can learn - works way better than the systems most people use [to understand the world]. What you need is a latticework of mental models in your head. And, with that system, things gradually fit together in a way that enhances cognition. Just as multiple factors shape every system,... (Source)

See more recommendations for this book...

28
To really learn data science, you should not only master the tools--data science libraries, frameworks, modules, and toolkits--but also understand the ideas and principles underlying them. Updated for Python 3.6, this second edition of Data Science from Scratch shows you how these tools and algorithms work by implementing them from scratch.

If you have an aptitude for mathematics and some programming skills, author Joel Grus will help you get comfortable with the math and statistics at the core of data science, and with the hacking skills you need to get started as a data...
more

Thorsten HellerThe Best #book to Start your #DataScience Journey - Towards #DataScience https://t.co/D8PlkkSxw6 by @benthecoder1 (Source)

See more recommendations for this book...

29

Kafka: The Definitive Guide

Real-Time Data and Stream Processing at Scale

Every enterprise application creates data, whether it's log messages, metrics, user activity, outgoing messages, or something else. And how to move all of this data becomes nearly as important as the data itself. If you're an application architect, developer, or production engineer new to Apache Kafka, this practical guide shows you how to use this open source streaming platform to handle real-time data feeds.

Engineers from Confluent and LinkedIn who are responsible for developing Kafka explain how to deploy production Kafka clusters, write reliable event-driven microservices, and...
more

See more recommendations for this book...

30
The challenges to humanity posed by the digital future, the first detailed examination of the unprecedented form of power called "surveillance capitalism," and the quest by powerful corporations to predict and control our behavior.

In this masterwork of original thinking and research, Shoshana Zuboff provides startling insights into the phenomenon that she has named surveillance capitalism. The stakes could not be higher: a global architecture of behavior modification threatens human nature in the twenty-first century just as industrial capitalism disfigured the...
more

Nicholas CarrWhatever its imperfections, The Age of Surveillance Capitalism is an original and often brilliant work, and it arrives at a crucial moment, when the public and its elected representatives are at last grappling with the extraordinary power of digital media and the companies that control it. Like another recent masterwork of economic analysis, Thomas Piketty’s 2013 Capital in the Twenty-First... (Source)

Naomi KleinFrom the very first page I was consumed with an overwhelming imperative: everyone needs to read this book as an act of digital self-defense. With tremendous lucidity and moral courage, Zuboff demonstrates not only how our minds are being mined for data but also how they are being rapidly and radically changed in the process. The hour is late and much has been lost already—but as we learn in these... (Source)

Clive Lewis MpCant make the brilliant event below? Havent had a chance to read @shoshanazuboff groundbreaking book, ‘Surveillance Capitalism’? Then listen to this brilliant interview with the author as she explains the terrifying scale&ambition of Facebook/Google et al https://t.co/DCtNlFbmE0 https://t.co/ZX0YpW5pOo (Source)

See more recommendations for this book...

Don't have time to read the top Big Data books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
31
The classic book on statistical graphics, charts, tables. Theory and practice in the design of data graphics, 250 illustrations of the best (and a few of the worst) statistical graphics, with detailed analysis of how to display data for precise, effective, quick analysis. Design of the high-resolution displays, small multiples. Editing and improving graphics. The data-ink ratio. Time-series, relational graphics, data maps, multivariate designs. Detection of graphical deception: design variation vs. data variation. Sources of deception. Aesthetics and data graphical displays.
This is the...
more
Recommended by Bret Victor, Michael Okuda, and 2 others.

Michael OkudaEdward Tufte's classic book, The Visual Display of Quantitative Information is a fascinating, surprisingly readable treatise for anyone interested in infographics. When I hired artists for the Star Trek graphics dept, I sometimes asked them to read it.https://t.co/cK4GQqBDxp (Source)

See more recommendations for this book...

32
Until now, design patterns for the MapReduce framework have been scattered among various research papers, blogs, and books. This handy guide brings together a unique collection of valuable MapReduce patterns that will save you time and effort regardless of the domain, language, or development framework you’re using.

Each pattern is explained in context, with pitfalls and caveats clearly identified to help you avoid common design mistakes when modeling your big data architecture. This book also provides a complete overview of MapReduce that explains its origins and implementations,...
more

See more recommendations for this book...

33
Your cell phone provider tracks your location and knows who’s with you. Your online and in-store purchasing patterns are recorded, and reveal if you're unemployed, sick, or pregnant. Your e-mails and texts expose your intimate and casual friends. Google knows what you’re thinking because it saves your private searches. Facebook can determine your sexual orientation without you ever mentioning it.

The powers that surveil us do more than simply store this information. Corporations use surveillance to manipulate not only the news articles and advertisements we each see, but also the...
more

See more recommendations for this book...

34

Data Science with R

Recommended by Tim @Realscientists, and 1 others.

Tim @RealscientistsIf you are interested in learning programming, there are lots of great tutorials. For data analysis, R and the R 4 data science book is a great way to go https://t.co/zezYpG0TRL, and for general R syntax, there is the swirl learning package https://t.co/Tzfpnlgo3O /20 (Source)

See more recommendations for this book...

35
For the first time, the Cambridge Analytica whistleblower tells the inside story of the data mining and psychological manipulation behind the election of Donald Trump and the Brexit referendum, connecting Facebook, WikiLeaks, Russian intelligence, and international hackers.

Mindf*ck goes deep inside Cambridge Analytica's "American operations," which were driven by Steve Bannon's vision to remake America and fueled by mysterious billionaire Robert Mercer's money, as it weaponized and wielded the massive store of data it had harvested on individuals in--excess of 87 million--to...
more

See more recommendations for this book...

36
By Glenn Greenwald, star of Citizenfour, the Academy Award-winning documentary on Edward Snowden

In May 2013, Glenn Greenwald set out for Hong Kong to meet an anonymous source who claimed to have astonishing evidence of pervasive government spying and insisted on communicating only through heavily encrypted channels. That source turned out to be the twenty-nine-year-old NSA contractor Edward Snowden, and his revelations about the agency's widespread, systemic overreach proved to be some of the most explosive and consequential news in recent history, triggering a...
more
Recommended by Gilbert Rwabigwi, and 1 others.

Gilbert RwabigwiYour book, “No Place To Hide”, was a thrilling/insightful read. Can’t wait to flip through @Snowden’s memoir. 🙏🏾 https://t.co/pZPLxDpNcM (Source)

See more recommendations for this book...

37
Collecting data is relatively easy, but turning raw information into something useful requires that you know how to extract precisely what you need. With this insightful book, intermediate to experienced programmers interested in data analysis will learn techniques for working with data in a business environment. You'll learn how to look at data to discover what it contains, how to capture those ideas in conceptual models, and then feed your understanding back into the organization through business plans, metrics dashboards, and other applications.

Along the way, you'll experiment...
more

See more recommendations for this book...

38
For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the Python Data Science Handbook do you get them all—IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and other related tools.

Working scientists and data crunchers familiar with reading and writing Python code will find this comprehensive desk reference ideal for tackling day-to-day issues: manipulating, transforming, and cleaning data; visualizing...
more
Recommended by Kirk Borne, and 2 others.

Kirk Borne✨🎉🌟Must see this >> Free #Python #DataScience Coding book series for #DataScientists ...via @DataScienceCtrl Go to https://t.co/To10VVZzIl ——————— #abdsc #BigData #MachineLearning #AI #DeepLearning #BeDataBrilliant #DataLiteracy https://t.co/Msuo1jiZSm (Source)

See more recommendations for this book...

39
Distributed systems have become more fine-grained in the past 10 years, shifting from code-heavy monolithic applications to smaller, self-contained microservices. But developing these systems brings its own set of headaches. With lots of examples and practical advice, this book takes a holistic view of the topics that system architects and administrators must consider when building, managing, and evolving microservice architectures.

Microservice technologies are moving quickly. Author Sam Newman provides you with a firm grounding in the concepts while diving into current solutions...
more

See more recommendations for this book...

40
I am not a recruiter. I am a software engineer. And as such, I know what it's like to be asked to whip up brilliant algorithms on the spot and then write flawless code on a whiteboard. I've been through this as a candidate and as an interviewer.

Cracking the Coding Interview, 6th Edition is here to help you through this process, teaching you what you need to know and enabling you to perform at your very best. I've coached and interviewed hundreds of software engineers. The result is this book.

Learn how to uncover the hints and hidden details in a question,...
more

See more recommendations for this book...

Don't have time to read the top Big Data books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
41
James Gleick, the author of the best sellers Chaos and Genius, now brings us a work just as astonishing and masterly: a revelatory chronicle and meditation that shows how information has become the modern era’s defining quality—the blood, the fuel, the vital principle of our world.
 
The story of information begins in a time profoundly unlike our own, when every thought and utterance vanishes as soon as it is born. From the invention of scripts and alphabets to the long-misunderstood talking drums of Africa, Gleick tells the story of information technologies that...
more
Recommended by Nicholas Carr, and 1 others.

Nicholas CarrIf Standage’s is a small book focused on a particular technology and moment in time, Gleick’s is extraordinarily broad and sweeping. It’s a very large book, in which he tries – and succeeds in many ways I think – to tell the story of information in human history. Information breaks down into two different things in essence. On the one hand it is messages – things with meaning to human beings –... (Source)

See more recommendations for this book...

42
Whether you need full-text search or real-time analytics of structured data--or both--the Elasticsearch distributed search engine is an ideal way to put your data to work. This practical guide not only shows you how to search, analyze, and explore data with Elasticsearch, but also helps you deal with the complexities of human language, geolocation, and relationships.

If you're a newcomer to both search and distributed systems, you'll quickly learn how to integrate Elasticsearch into your application. More experienced users will pick up lots of advanced techniques. Throughout the...
more

See more recommendations for this book...

43
"Mesmerizing & fascinating..." -- The Seattle Post-Intelligencer

"The Freakonomics of big data." --Stein Kretsinger, founding executive of Advertising.com

Award-winning - Used by over 30 universities - Translated into 9 languages

An introduction for everyone. In this rich, fascinating -- surprisingly accessible -- introduction, leading expert Eric Siegel reveals how predictive analytics (aka machine learning) works, and how it affects everyone every day. Rather than a "how to" for hands-on...
more

See more recommendations for this book...

44

Big Data for Dummies

Find the right big data solution for your business or organization

Big data management is one of the major challenges facing business, industry, and not-for-profit organizations. Data sets such as customer transactions for a mega-retailer, weather patterns monitored by meteorologists, or social network activity can quickly outpace the capacity of traditional data management tools. If you need to develop or manage big data solutions, you'll appreciate how these four experts define, explain, and guide you through this new and often confusing concept. You'll learn what it is,...
more

See more recommendations for this book...

45
Go ahead, be skeptical about big data. The author was—at first.

When the term “big data” first came on the scene, bestselling author Tom Davenport (Competing on Analytics, Analytics at Work) thought it was just another example of technology hype. But his research in the years that followed changed his mind.

Now, in clear, conversational language, Davenport explains what big data means—and why everyone in business needs to know about it. Big Data at Work covers all the bases: what big data means from a technical, consumer, and management...
more

See more recommendations for this book...

46
A revolution is under way. In recent years, Google's autonomous cars have logged thousands of miles on American highways and IBM's Watson trounced the best human Jeopardy! players. Digital technologies with hardware, software, and networks at their core will in the near future diagnose diseases more accurately than doctors can, apply enormous data sets to transform retailing, and accomplish many tasks once considered uniquely human.

In The Second Machine Age MIT's Erik Brynjolfsson and Andrew McAfee—two thinkers at the forefront of their field—reveal the forces driving the...

more

Michael DellThe authors make a case for a future world that is better, not worse, than the one we inherited. That may seem far-fetched given the problems we see flashing across our screens every day. But there is reason for optimism, and it starts and ends with one of my favorite things, technology. (Source)

Dominic Steil[One of the books that had the biggest impact on .] (Source)

See more recommendations for this book...

48
This book answers the question: 'What's next?' The Internet had a world-changing impact on businesses and the global community over the twenty years from 1994 to 2014. In the next ten years, change will happen even faster.

As Hillary Clinton's Senior Advisor for Innovation, Alec Ross travelled nearly a million miles to forty-one countries, the equivalent of two round-trips to the moon. From refugee camps in the Congo and Syrian war zones, to visiting the world's most powerful people in business and government, Ross's travels amounted to a four-year masterclass in the changing...
more
Recommended by Reid Hoffman, Marvin Liao, Ivan Kv, and 3 others.

Marvin LiaoMy list would be (besides the ones I mentioned in answer to the previous question) both business & Fiction/Sci-Fi and ones I personally found helpful to myself. The business books explain just exactly how business, work & investing are in reality & how to think properly & differentiate yourself. On the non-business side, a mix of History & classic fiction to understand people, philosophy to make... (Source)

Ivan KvI have finally finished reading your book (Industries of the Future), @AlecJRoss... My favorite read this year. #VR #AI #Genomics #cybersecurity #BigData #future https://t.co/qHsIYABkWS (Source)

See more recommendations for this book...

49
The definitive reference book with real-world solutions you won't find anywhere else The Big Book of Dashboards presents a comprehensive reference for those tasked with building or overseeing the development of business dashboards.

Comprising dozens of examples that address different industries and departments (healthcare, transportation, finance, human resources, marketing, customer service, sports, etc.) and different platforms (print, desktop, tablet, smartphone, and conference room display) The Big Book of Dashboards is the only book that...
more

See more recommendations for this book...

50
The need to handle increasingly larger data volumes is one factor driving the adoption of a new class of nonrelational "NoSQL" databases. Advocates of NoSQL databases claim they can be used to build systems that are more performant, scale better, and are easier to program." ""NoSQL Distilled" is a concise but thorough introduction to this rapidly emerging technology. Pramod J. Sadalage and Martin Fowler explain how NoSQL databases work and the ways that they may be a superior alternative to a traditional RDBMS. The authors provide a fast-paced guide to the concepts you need to know in order... more

See more recommendations for this book...

Don't have time to read the top Big Data books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
51
WARNING! To avoid buying counterfeit on Amazon, click on "See All Buying Options" and choose "Amazon.com" and not a third-party seller.

Concise and to the point — the book can be read during a week. During that week, you will learn almost everything modern machine learning has to offer. The author and other practitioners have spent years learning these concepts.

Companion wiki — the book has a continuously updated wiki that extends some book chapters with additional information: Q&A, code snippets, further reading, tools, and other relevant resources.
more
Recommended by Kirk Borne, and 1 others.

Kirk BorneRecent top-selling books in #AI & #MachineLearning: https://t.co/Ij9I7SzR4d ————— #BigData #DataScience #DataMining #Algorithms #PredictiveAnalytics #Python ————— ...in the TOP 10: 1)The Hundred-Page ML Book: https://t.co/dQ7nP6gwP0 2)Hands-on ML with...: https://t.co/Y0Iz3GbtGP https://t.co/72rAFN1FwW (Source)

See more recommendations for this book...

52
Residents in Boston, Massachusetts are automatically reporting potholes and road hazards via their smartphones. Progressive Insurance tracks real-time customer driving patterns and uses that information to offer rates truly commensurate with individual safety. Google accurately predicts local flu outbreaks based upon thousands of user search queries. Amazon provides remarkably insightful, relevant, and timely product recommendations to its hundreds of millions of customers. Quantcast lets companies target precise audiences and key demographics throughout the Web. NASA runs contests via... more

See more recommendations for this book...

53
Convert the promise of big data into real world results There is so much buzz around big data. We all need to know what it is and how it works - that much is obvious. But is a basic understanding of the theory enough to hold your own in strategy meetings? Probably. But what will set you apart from the rest is actually knowing how to USE big data to get solid, real-world business results - and putting that in place to improve performance. Big Data will give you a clear understanding, blueprint, and step-by-step approach to building your own big data strategy. This is a... more

See more recommendations for this book...

55
SQL Made Easy- The Ultimate Step by Step Guide To Success Do you want to learn SQL programming without the complicated explanations?
Do you want to understand how to manage databases without all the confusion?
Well than, this is your go to guide to help you master SQL programming in no time!
This book breaks down the fundamentals elements that are essential to make you proficient in SQL programming and database management
By the end of this book you will be confident enough to take on any problems that encompass SQL
SQL software can be complex,...
more

See more recommendations for this book...

56

Storm Applied

57

Learning From Data

A Short Course

Machine learning allows computational systems to adaptively improve their performance with experience accumulated from the observed data. Its techniques are widely applied in engineering, science, finance, and commerce. This book is designed for a short course on machine learning. It is a short course, not a hurried course. From over a decade of teaching this material, we have distilled what we believe to be the core topics that every student of the subject should know. We chose the title `learning from data' that faithfully describes what the subject is about, and made it a point to cover... more

See more recommendations for this book...

58
A thought-provoking and wide-ranging exploration of machine learning and the race to build computer intelligences as flexible as our own
In the world's top research labs and universities, the race is on to invent the ultimate learning algorithm: one capable of discovering any knowledge from data, and doing anything we want, before we even ask. In The Master Algorithm, Pedro Domingos lifts the veil to give us a peek inside the learning machines that power Google, Amazon, and your smartphone. He assembles a blueprint for the future universal learner--the Master Algorithm--and...
more
Recommended by Vinod Khosla, and 1 others.

Vinod KhoslaIf you want speculation about what the master AI might need (one view). For a slightly more technical read, I’d suggest Ian Goodfellows Deep Learning. (Source)

See more recommendations for this book...

59
An algorithm is nothing more than a step-by-step procedure for solving a problem. The algorithms you'll use most often as a programmer have already been discovered, tested, and proven. If you want to take a hard pass on Knuth's brilliant but impenetrable theories and the dense multi-page proofs you'll find in most textbooks, this is the book for you. This fully-illustrated and engaging guide makes it easy for you to learn how to use algorithms effectively in your own programs.

Grokking Algorithms is a disarming take on a core computer science topic. In it, you'll learn how to apply...
more

See more recommendations for this book...

60
A technical book about popular space-efficient data structures and fast algorithms that are extremely useful in modern Big Data applications.

The purpose of this book is to introduce technology practitioners, including software architects and developers, as well as technology decision makers to probabilistic data structures and algorithms. Reading this book, you will get a theoretical and practical understanding of probabilistic data structures and learn about their common uses.
less

See more recommendations for this book...

Don't have time to read the top Big Data books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
61
The definitive guide to statistical thinking
Statistics are everywhere, as integral to science as they are to business, and in the popular media hundreds of times a day. In this age of big data, a basic grasp of statistical literacy is more important than ever if we want to separate the fact from the fiction, the ostentatious embellishments from the raw evidence -- and even more so if we hope to participate in the future, rather than being simple bystanders.
In The Art of Statistics, world-renowned statistician David Spiegelhalter shows readers how to derive knowledge...
more
Recommended by Dan Davies, and 1 others.

Dan Davies@amoralelite @d_spiegel It's a great book. I thought it was like coming home because I've always tried to avoid calculation due to the dyspraxia and it was just "yes, that's how you think about it" (Source)

See more recommendations for this book...

62
Once considered tedious, the field of statistics is rapidly evolving into a discipline Hal Varian, chief economist at Google, has actually called “sexy.” From batting averages and political polls to game shows and medical research, the real-world application of statistics continues to grow by leaps and bounds. How can we catch schools that cheat on standardized tests? How does Netflix know which movies you’ll like? What is causing the rising incidence of autism? As best-selling author Charles Wheelan shows us in Naked Statistics, the right data and a few well-chosen statistical tools... more

See more recommendations for this book...

63
Work with petabyte-scale datasets while building a collaborative, agile workplace in the process. This practical book is the canonical reference to Google BigQuery, the query engine that lets you conduct interactive analysis of large datasets. BigQuery enables enterprises to efficiently store, query, ingest, and learn from their data in a convenient framework. With this book, you'll examine how to analyze data at scale to derive insights from large datasets efficiently.

Valliappa Lakshmanan, tech lead for Google Cloud Platform, and Jordan Tigani, engineering director for the...
more

See more recommendations for this book...

65

Hadoop Operations

If you've been asked to maintain large and complex Hadoop clusters, this book is a must. Demand for operations-specific material has skyrocketed now that Hadoop is becoming the de facto standard for truly large-scale data processing in the data center. Eric Sammer, Principal Solution Architect at Cloudera, shows you the particulars of running Hadoop in production, from planning, installing, and configuring the system to providing ongoing maintenance.

Rather than run through all possible scenarios, this pragmatic operations guide calls out what works, as demonstrated in critical...
more

See more recommendations for this book...

66

Pattern Recognition and Machine Learning

Pattern recognition has its origins in engineering, whereas machine learning grew out of computer science. However, these activities can be viewed as two facets of the same field, and together they have undergone substantial development over the past ten years. In particular, Bayesian methods have grown from a specialist niche to become mainstream, while graphical models have emerged as a general framework for describing and applying probabilistic models. Also, the practical applicability of Bayesian methods has been greatly enhanced through the development of a range of approximate inference... more

See more recommendations for this book...

67
Microsoft PowerPivot is a free add-on to Excel from Microsoft that allows users to produce new kinds of reports and analyses that were simply impossible before, and this book is the first to tackle DAX formulas, the core capability of PowerPivot, from the perspective of the Excel audience. Written by the world’s foremost PowerPivot blogger and practitioner, the book’s concepts and approach are introduced in a simple, step-by-step manner tailored to the learning style of Excel users everywhere. The techniques presented allow users to produce, in hours or even minutes, results that formerly... more

See more recommendations for this book...

68
Now expanded and updated with modern best practices, this is the most complete guide to Microsoft's DAX language for business intelligence, data modeling, and analytics. Expert Microsoft BI consultants Marco Russo and Alberto Ferrari help you master everything from table functions through advanced code and model optimization. You'll learn exactly what happens under the hood when you run a DAX expression, and use this knowledge to write fast, robust code. This edition focuses on examples you can build and run with the free Power BI Desktop, and helps you make the most of the powerful syntax of... more

See more recommendations for this book...

69

Programming Hive

Need to move a relational database application to Hadoop? This comprehensive guide introduces you to Apache Hive, Hadoop’s data warehouse infrastructure. You’ll quickly learn how to use Hive’s SQL dialect—HiveQL—to summarize, query, and analyze large datasets stored in Hadoop’s distributed filesystem.

This example-driven guide shows you how to set up and configure Hive in your environment, provides a detailed overview of Hadoop and MapReduce, and demonstrates how Hive works within the Hadoop ecosystem. You’ll also find real-world case studies that describe how companies have used...
more

See more recommendations for this book...

70
One of the world’s leading authorities on global security, Marc Goodman takes readers deep into the digital underground to expose the alarming ways criminals, corporations, and even countries are using new and emerging technologies against you—and how this makes everyone more vulnerable than ever imagined. 


Technological advances have benefited our world in immeasurable ways, but there is an ominous flip side: our technology can be turned against us. Hackers can activate baby monitors to spy on families, thieves are analyzing social media posts to plot home...
more

See more recommendations for this book...

Don't have time to read the top Big Data books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
71
Looking to use Apache Flume to stream data to Hadoop? This complete reference guide shows operations engineers how to configure, deploy, and monitor a Flume cluster, and teaches developers how to write Flume plugins and custom components to their specific use-cases.

"Using Flume" describes the rich set of features that enable you to write data in any format supported by the Hadoop Distributed File System (HDFS) and HBase, including MapReduce, Hive, Impala, and Pig. You'll also learn about Flume's design and implementation, as well as various features that make Flume highly...
more

See more recommendations for this book...

72
A hilarious retelling of THE THREE LITTLE PIGS, with foil on the cover!

"Little fish, little fish, let me come in."
"Not by the skin of my finny fin fin!"
"Then I'll munch, and I'll crunch, and I'll smash your house in!"

Mama tells her three little fish that it's time to make their own homes. Jim builds his house of seaweed, but the big bad shark munches it up. Tim builds his house of sand, but the shark crunches it up. It's smart Kim who sets up house in an old sunken ship!

Children will delight in this silly story with funny, eye-popping...
more

See more recommendations for this book...

73

Uncharted

Big Data and an Emerging Science of Human History

One of the greatest untapped resources of today isn’t offshore oil or natural gas—it’s data. Gigabytes, exabytes (that’s one quintillion bytes) of data are sitting on servers across the world. So how can we start to access this explosion of information, this “big data,” and what can it tell us?
 
Erez Aiden and Jean-Baptiste Michel are two young scientists at Harvard who started to ask those questions. They teamed up with Google to create the Ngram Viewer, a Web-based tool that can chart words throughout the massive Google Books archive, sifting through billions of words to find...
more

See more recommendations for this book...

74

Algorithm Design Manual

Recommended by Hadley Wickham, and 1 others.

Hadley WickhamThis book is an illustration of the power of names in the Google era. (Source)

See more recommendations for this book...

75

Doing Data Science

Now that people are aware that data can make the difference in an election or a business model, data science as an occupation is gaining ground. But how can you get started working in a wide-ranging, interdisciplinary field that’s so clouded in hype? This insightful book, based on Columbia University’s Introduction to Data Science class, tells you what you need to know.

In many of these chapter-long lectures, data scientists from companies such as Google, Microsoft, and eBay share new algorithms, methods, and models by presenting case studies and the code they use. If you’re...
more

See more recommendations for this book...

76
Adeptly address today's business challenges with this powerful new book from web analytics thought leader Avinash Kaushik. "Web Analytics 2.0" presents a new framework that will permanently change how you think about analytics. It provides specific recommendations for creating an actionable strategy, applying analytical techniques correctly, solving challenges such as measuring social media and multichannel campaigns, achieving optimal success by leveraging experimentation, and employing tactics for truly listening to your customers. The book will help your organization become more data... more
Recommended by Raluca Radu, and 1 others.

Raluca RaduI work in digital marketing so I would [recommend]: [...] Web Analytics: An Hour A Day and Web Analytics 2.0 by Avinash Kaushik. (Source)

See more recommendations for this book...

77
Do you want to build web pages but have no prior experience? This friendly guide is the perfect place to start. You'll begin at square one, learning how the web and web pages work, and then steadily build from there. By the end of the book, you'll have the skills to create a simple site with multicolumn pages that adapt for mobile devices.

Each chapter provides exercises to help you learn various techniques and short quizzes to make sure you understand key concepts.

This thoroughly revised edition is ideal for students and professionals of all backgrounds and skill...
more
Recommended by Jakob Nielsen, and 1 others.

See more recommendations for this book...

78
Cowritten by Ralph Kimball, the world's leading data warehousing authority, whose previous books have sold more than 150,000 copies. Delivers real-world solutions for the most time- and labor-intensive portion of data warehousing-data staging, or the extract, transform, load (ETL) process. Delineates best practices for extracting data from scattered sources, removing redundant and inaccurate data, transforming the remaining data into correctly formatted data structures, and then loading the end product into the data warehouse. Offers proven time-saving ETL techniques, comprehensive guidance... more

See more recommendations for this book...

79
Summary

Kafka Streams in Action teaches you everything you need to know to implement stream processing on data flowing into your Kafka platform, allowing you to focus on getting more from your data without sacrificing time or effort.

Foreword by Neha Narkhede, Cocreator of Apache Kafka

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

About the Technology

Not all stream-based applications require a dedicated processing cluster. The lightweight Kafka Streams...
more

See more recommendations for this book...

80
Over time we may forget some password, bank account number, keywords, social media, bills or online account info. It is a big trouble for many people. This notebook is a good helper for you to keep all your password information together and secure. The Book Contains: @60 pages @5" x 8" Buy it and Save your passwords today! less

See more recommendations for this book...

Don't have time to read the top Big Data books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
81
"The authors' clear visual style provides a comprehensive look at what's currently possible with artificial neural networks as well as a glimpse of the magic that's to come."
--Tim Urban, author of Wait But Why Fully Practical, Insightful Guide to Modern Deep Learning

Deep learning is transforming software, facilitating powerful new artificial intelligence capabilities, and driving unprecedented algorithm performance. Deep Learning Illustrated is uniquely intuitive and offers a complete introduction to the discipline's techniques. Packed with...
more
Recommended by Kirk Borne, and 1 others.

Kirk Borne🌟📘📊📈Awesome new book >> #DeepLearning Illustrated — A Visual, Interactive Guide to Artificial Intelligence” https://t.co/xIW48MskrR by @JonKrohnLearns ——————— #BigData #Analytics #DataScience #AI #MachineLearning #Algorithms #NeuralNetworks https://t.co/JKSrVRLpS0 (Source)

See more recommendations for this book...

82
Learn the skills necessary to design, build, and deploy applications powered by machine learning. Through the course of this hands-on book, you'll build an example ML-driven application from initial idea to deployed product. Data scientists, software engineers, and product managers with little or no ML experience will learn the tools, best practices, and challenges involved in building a real-world ML application step-by-step.

Author Emmanuel Ameisen, who worked as a data scientist at Zipcar and led Insight Data Science's AI program, demonstrates key ML concepts with code snippets,...
more

See more recommendations for this book...

83

Access 2019 Bible

Master database creation and management Access 2019 Bible is your, comprehensive reference to the world's most popular database management tool. With clear guidance toward everything from the basics to the advanced, this go-to reference helps you take advantage of everything Access 2019 has to offer. Whether you're new to Access or getting started with Access 2019, you'll find everything you need to know to create the database solution perfectly tailored to your needs, with expert guidance every step of the way. The companion website features all examples and databases used in... more

See more recommendations for this book...

84
Everyone knows that abuse of statistics is rampant in popular media. Politicians and marketers present shoddy evidence for dubious claims all the time. But smart people make mistakes too, and when it comes to statistics, plenty of otherwise great scientists--yes, even those published in peer-reviewed journals--are doing statistics wrong.

"Statistics Done Wrong" comes to the rescue with cautionary tales of all-too-common statistical fallacies. It'll help you see where and why researchers often go wrong and teach you the best practices for avoiding their mistakes.

In this...
more

See more recommendations for this book...

85

Build a Career in Data Science

Build a Career in Data Science is your guide to getting your first data science job, then quickly becoming a senior employee. Industry experts Jacqueline Nolis and Emily Robinson lay out the soft skills you’ll need alongside your technical know-how in order to succeed in the field. Following their clear and simple instructions you’ll craft a resume that hiring managers will love, learn how to ace your interview, and ensure you hit the ground running in your first months at your new job. Once you’ve gotten your foot in the door, learn to thrive as a data scientist by handling high... more

See more recommendations for this book...

86
FINANCIAL TIMES BOOK OF THE MONTH 

FROM THE WALL STREET JOURNAL: "Nothing Mr. Gilder says or writes is ever delivered at anything less than the fullest philosophical decibel..Mr. Gilder sounds less like a tech guru than a poet, and his words tumble out in a romantic cascade."

“Google’s algorithms assume the world’s future is nothing more than the next moment in a random process. George Gilder shows how deep this assumption goes, what motivates people to make it, and why it’s wrong: the future depends on human action.”...
more
Recommended by Dominic Steil, and 1 others.

Dominic Steil[One of the five books recommends to young people interested in his career path.] (Source)

See more recommendations for this book...

87

Data Feminism

A new way of thinking about data science and data ethics that is informed by the ideas of intersectional feminism.

Today, data science is a form of power. It has been used to expose injustice, improve health outcomes, and topple governments. But it has also been used to discriminate, police, and surveil. This potential for good, on the one hand, and harm, on the other, makes it essential to ask: Data science by whom? Data science for whom? Data science with whose interests in mind? The narratives around big data and data science are overwhelmingly white, male, and techno-heroic. In...
more

See more recommendations for this book...

88
There's a lot of information about big data technologies, but splicing these technologies into an end-to-end enterprise data platform is a daunting task not widely covered. With this practical book, you'll learn how to build big data infrastructure both on-premises and in the cloud and successfully architect a modern data platform.

Ideal for enterprise architects, IT managers, application architects, and data engineers, this book shows you how to overcome the many challenges that emerge during Hadoop projects. You'll explore the vast landscape of tools available in the Hadoop and...
more

See more recommendations for this book...

90

R is the world's most popular language for developing statistical software: Archaeologists use it to track the spread of ancient civilizations, drug companies use it to discover which medications are safe and effective, and actuaries use it to assess financial risks and keep economies running smoothly.

The Art of R Programming takes you on a guided tour of software development with R, from basic types and data structures to advanced topics like closures, recursion, and anonymous functions. No statistical knowledge is required, and your programming skills can range from...

more

See more recommendations for this book...

Don't have time to read the top Big Data books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.
91
Sales and service are being radically redefined like never before. With buyers now in possession of unlimited information, online content is quickly becoming the dominant driver for commerce. Today anyone working in sales or customer service needs to possess entirely new skills. Unfortunately most organizations are still using traditional selling and service models developed for a different time.

In this new book by the author of the #1 bestseller The New Rules of Marketing & PR, David Meerman Scott demystifies the new digital commercial landscape and offers inspiring...
more

See more recommendations for this book...

92
Sams Teach Yourself SQL in 10 Minutes offers straightforward, practical answers when you need fast results.
By working through the book's 22 lessons of 10 minutes or less, you'll learn what you need to know to take advantage of the SQL language.
Lessons cover IBM DB2, Microsoft SQL Server and SQL Server Express, MariaDB, MySQL, Oracle and Oracle express, PostgreSQL, and SQLite.
Full-color code examples help you understand how SQL statements are structured
Tips point out shortcuts and solutions
Cautions help you avoid common...
more

See more recommendations for this book...

93
Apache Spark is amazing when everything clicks. But if you haven't seen the performance improvements you expected, or still don't feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warren demonstrate performance optimizations to help your Spark queries run faster and handle larger data sizes, while using fewer resources.

Ideal for software engineers, data engineers, developers, and system administrators working with large-scale data applications, this book describes techniques that can reduce data infrastructure costs...
more

See more recommendations for this book...

95
Leverage big data to add value to your business

Social media analytics, web-tracking, and other technologies help companies acquire and handle massive amounts of data to better understand their customers, products, competition, and markets. Armed with the insights from big data, companies can improve customer experience and products, add value, and increase return on investment. The tricky part for busy IT professionals and executives is how to get this done, and that's where this practical book comes in. Big Data: Understanding How Data Powers Big Business is a...
more

See more recommendations for this book...

97
What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they’ve recovered from nasty data problems.

From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.

Among the many topics covered, you’ll discover how to:
more

See more recommendations for this book...

98
The Digital Humanities have arrived at a moment when digital Big Data is becoming more readily available, opening exciting new avenues of inquiry but also new challenges. This pioneering book describes and demonstrates the ways these data can be explored to construct cultural heritage knowledge, for research and in teaching and learning. It helps humanities scholars to grasp Big Data in order to do their work, whether that means understanding the underlying algorithms at work in search engines, or designing and using their own tools to process large amounts of information.Demonstrating what... more

See more recommendations for this book...

99

Cassandra High Availability

If you are a developer or DevOps engineer who understands the basics of Cassandra and are ready to take your knowledge to the next level, then this book is for you. Perhaps you've just got your hands dirty working on a small application, but now you have a use case that demands greater scale and zero downtime. An understanding of the essentials of Cassandra is needed, including knowing how to install and configure Cassandra, create tables, and read and write data. less

See more recommendations for this book...

100

A New Kind of Science

Recommended by Steve Jurvetson, Dan Sullivan, and 2 others.

Steve JurvetsonA series of epiphanies from [the author] and others that the world is really interesting when you look at iterative algorithms applied millions and billions of times. (Source)

See more recommendations for this book...

Don't have time to read the top Big Data books of all time? Read Shortform summaries.

Shortform summaries help you learn 10x faster by:

  • Being comprehensive: you learn the most important points in the book
  • Cutting out the fluff: you focus your time on what's important to know
  • Interactive exercises: apply the book's ideas to your own life with our educators' guidance.