Azure DP-900: Microsoft Azure Data Fundamentals | Notes and Highlights

Just passed Azure DP-900 exam!

Here is what and how I did it:

And here are all notes I’ve taken during my study process:

Document Data Store

  • All data should be stored and retrieved from a single-data-structure
  • In a single document, use to store user profile data, like shipping addresses, user preferences
  • Each user can have different number of addresses and also different number of preferences
  • Document database consists of entities that have their related data stored in a single document
  • Documents supports flexible schema
  • Usually JSON format documents
  • Document data store is a type of a non-relational data store

Azure Storage

  • Enable hierarchical namespace to implement folder and directory level security in Azure Storage.
  • Hierarchical namespace allows you to organize blob containers in folders and directories
  • Hierarchical namespace allows to define PSIX compatible permissions to container
  • Hierarchical namespace allows to define RBAC to container

Azure SQL Database

  • Azure SQL Database has lower cost compared against Azure SQL Managed Instance
  • Azure SQL Database has minimal administration compared against Azure SQL Managed Instance
  • Azure SQL Database has higher availability ( 99.995% ) compared against Azure SQL Managed Instance ( 99.99% )
  • Azure SQL Database is a relational database for modern cloud apps
  • Azure SQL Database is PAAS
  • Azure SQL Database eliminate the need to manage the SQL Server configuration
  • Azure SQL Database is not as compatible to on-premises as Azure SQL Managed Instance is
  • Azure SQL Database does not allow you to choose the SQL Server engine version
  • Azure SQL Database uses the lasted stable version of the SQL Server engine
  • Azure SQL Database includes a fully managed backup service
  • Azure SQL Database automatically applies security patches on underlying operational system

Graph Database

  • NODE is an instance of data entities, like people on companies
  • EDGE are relationships between instances of data entities
  • PROPERTY are attributes of data entities, like person email, address
  • DIRECTIONS are property of an Edge. Edges can be directed or undirected
  • Graph database store semi-structured data

Azure HDInsight

  • Azure HDInsight is a big data processing service
  • Azure HDInsight is used to provision and manage a cluster of open-source analytics solution
    • like Apache spark, Hadoop, Kafka
  • Azure HDInsight is compatible with Apache Spark

Azure Data Explorer

  • Uses KQL language

Azure Table Storage

  • Azure Table Storage is composed by row key + partition key
  • Azure Table Storage number of columns may vary according to each row
  • Azure Table Storage is a NoSQL data store, uses unstructured data
  • Azure Table Storage is a key/value data store
  • Azure Table Storage must have a unique key ( row key + partition key combination )
  • Azure Table Storage number of column per row can vary
  • Azure Table Storage partition key: identify partition containing in the row
  • Azure Table Storage row key: is unique to each row in the same partition

Key / Value Data Store

  • Key / Value data store can receive real-time events from multiple sources which may vary in schema
  • Key / Value data store works like a lookup
  • Key / Value data store find rows by using unique key
  • Key / Value data store is essentially a large hash-table
  • Key / Value data store is optimized for fast data writing
  • Key / Value data store each data row is referred by a single key-value

Spark Structured Streaming

  • Spark Structured Streaming can be used with: Azure Synapse Analytics, Azure DAtabricks, Azure HDInsight
  • Spark Structured Streaming is open-source library for apache-spark based services
  • Spark Structured Streaming is a near real-time processing engine

Azure Cosmos DB

  • Azure Cosmos DB allows simpler queries to retrieve data than a relational DB
  • Azure Cosmos DB supports app with global user base
  • Azure Cosmos DB is a NoSQL database
  • Azure Cosmos DB supports storing data in denormalized way
  • Azure Cosmos DB suppoarts multi-region writes
  • Azure Cosmos DB general resource hierarchy is: 1 azure cosmos account > 2 database > 3 container > 4 item

Published by Pedro Carvalho

Apaixonado por análise de dados e Power BI

Deixe uma resposta

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

%d bloggers like this: