Apache Cassandra and CAP Theorem: a brief note

The CAP theorem postulates that, in distributed systems, only two of three characteristics can coexist: consistency (C), availability (A) and partition tolerance (P). The principles used in relational databases or NoSQL are derived from this theorem. [1] ACID and BASE are consistency models used, essentially, in relational and NoSQL databases, respectively. Yes, these acronyms are…

Google’s Bigtable: paper review

This post analyses the following paper published in 2006: Bigtable: A Distributed Storage System for Structured Data, 7th USENIX Symposium on Operating Systems Design and Implementation – OSDI Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber. Key points Proposal: Bigtable, a distributed…

Let’s Data Podcast #001 – Pedro Albuquerque

Release of the podcast with a great guest to talk about the data scientist profile, opportunities abroad and projects involving Machine Learning Remak: podcast spoken in Brazilian Portuguese. In the first episode of Let’s Data Podcast, we talked to Pedro Albuquerque, a graduate and master in Statistics, a doctor in Administration, a post-doctor in Machine…

Disclaimer

Hello, welcome to the very first post of my website! 👋 First of all, I want to notice that this is my personal blog, where I share ideas about data science and artificial intelligence (including related topics like database, statistics, maths, programming language, etc) and thoughts about other subjects that I’m interested, like books and…