Introdução à Estatística Multivariada

Esse post é o primeiro de uma série que será feita sobre estatística multivariada! First things first: o que é estatística multivariada? É um conjunto de métodos estatísticos utilizados em situações nas quais várias variáveis são medidas simultaneamente, em cada elemento amostral (ou cada observação). As expressões em destaque são importantes, então vamos ver um…

Apache Cassandra and CAP Theorem: a brief note

The CAP theorem postulates that, in distributed systems, only two of three characteristics can coexist: consistency (C), availability (A) and partition tolerance (P). The principles used in relational databases or NoSQL are derived from this theorem. [1] ACID and BASE are consistency models used, essentially, in relational and NoSQL databases, respectively. Yes, these acronyms are…

Google’s Bigtable: paper review

This post analyses the following paper published in 2006: Bigtable: A Distributed Storage System for Structured Data, 7th USENIX Symposium on Operating Systems Design and Implementation – OSDI Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber. Key points Proposal: Bigtable, a distributed…

Let’s Data Podcast #001 – Pedro Albuquerque

Release of the podcast with a great guest to talk about the data scientist profile, opportunities abroad and projects involving Machine Learning Remak: podcast spoken in Brazilian Portuguese. In the first episode of Let’s Data Podcast, we talked to Pedro Albuquerque, a graduate and master in Statistics, a doctor in Administration, a post-doctor in Machine…

Disclaimer

Hello, welcome to the very first post of my website! 👋 First of all, I want to notice that this is my personal blog, where I share ideas about data science and artificial intelligence (including related topics like database, statistics, maths, programming language, etc) and thoughts about other subjects that I’m interested, like books and…