Power BI Embedded Walk Through Part 1 of 3

Power BI Embedded is an interesting approach to allow web applications to embed Power BI reports such that users of the application do not require any Office 365 account and user license. In trying to figure out how to do a simple demo for myself, I found the documentation to be fragmented. I will provide a … Continue reading Power BI Embedded Walk Through Part 1 of 3

Advertisements

Building a Spark Application for HDInsight using IntelliJ Part 2 of 2

In continuation from my blog article Building a Spark Application for HDInsight using IntelliJ Part 1 of 2 which outlines my experience in installing IntelliJ, other dependent SDKs and creating an HDInsight project. To add some code, right click src, create Scala Class Project folders and MainApp Scala code: Logic Read from csv file in … Continue reading Building a Spark Application for HDInsight using IntelliJ Part 2 of 2

Building a Spark Application for HDInsight using IntelliJ Part 1 of 2

For developers with a Microsoft .NET background who want to get familiar with building Spark applications with Scala programming language, this blog post series is a walk through from installing the development tools and building a simple Spark application, then submit against an HDInsight Spark cluster. My HDInsight configuration is Spark 2.0 (HDI 3.5) with … Continue reading Building a Spark Application for HDInsight using IntelliJ Part 1 of 2

Re-Create HDInsight Cluster with Pre-Existing Azure Data Lake Store and Hive Metastore

Upon creating an HDInsight cluster for big data analytics development, the costs can be quite high at around $4 an hour. This adds up, especially when running idle most of the time. To save on the costs, I delete the cluster and re-create when needed. For simple development purposes, my HDInsight depends on Azure Data … Continue reading Re-Create HDInsight Cluster with Pre-Existing Azure Data Lake Store and Hive Metastore

Text Analytics of Movie Reviews using Azure Data Lake, Cognitive Services and Power BI (part 1 of 2)

Part 1 of 2: Text Analytics of Movie Reviews using Azure Data Lake, Cognitive Services and Power BI (part 1 of 2) Take a csv file, analyze with an U-SQL script in Azure Data Lake Part 2 of 2: Text Analytics of Movie Reviews using Azure Data Lake, Cognitive Services and Power BI (part 2 of 2) … Continue reading Text Analytics of Movie Reviews using Azure Data Lake, Cognitive Services and Power BI (part 1 of 2)

HiveQL Group By and Views with Visual Studio and HDInsight

This article is for beginners looking to understand the developer experience in Visual Studio and working with hive tables in HDInsight. I developed the following HiveQL statements. My cluster is HDInsight Spark 2.0 cluster. Before executing these statements, I have the database and tables: The crimes table data looks like: Let’s query the table with … Continue reading HiveQL Group By and Views with Visual Studio and HDInsight

Azure Data Factory: Copy Data from Azure Data Lake Store to Azure SQL

Objective: From Azure Data Lake Store, copy data in a .tsv file to a database table in Azure SQL DB. My take on Azure Data Factory is that of ETL but less of the transformation and move data from on-premises and cloud sources to Azure data-oriented services. Read further about Azure Data Factory https://docs.microsoft.com/en-us/azure/data-factory/data-factory-introduction In … Continue reading Azure Data Factory: Copy Data from Azure Data Lake Store to Azure SQL

Query Hive Tables with Ambari Hive Views in HDInsight

This is an introductory walk through of querying hive tables and visualizing the data in the Ambari Hive View. This is another option to build and debug HiveQL other than in Visual Studio with the Azure Data Lake Tools plugin. In my blog article Populating data into hive tables, I demonstrated populating internal and external hive … Continue reading Query Hive Tables with Ambari Hive Views in HDInsight