Blog

April 18, 2024 3 min read

Advanced Pandas GroupBy and Window Functions for Data Engineering and Analytics

Session Outline: Advanced GroupBy and Window Functions in Pandas 1. Introduction to GroupBy and Its Role in Data Engineering (5 minutes) Overview: Recap of GroupBy as a split-apply-combine paradigm, as defined by Wes McKinney in Python for Data …
November 1, 2020 2 min read

Time Intelligence in PowerBI

Data Analysis Expressions (DAX) includes time-intelligence functions that enable you to manipulate data using time periods, including days, months, quarters, and years, and then build and compare calculations over those periods. Data Model The date …
April 26, 2020 1 min read

Handle multiple datetime formats representation using lubridate

Handling Heterogeneous datetime formats We`ll be using parse_date_time function of lubridate package. Multple datetime formats can be passed under orders attribute of the above function as shown below - library(lubridate) dates <- c( …
April 24, 2020 2 min read

Download Data from a github folder using httr package in R

Import Libraries and Request to the master branch of git The following chunk of code, import required libraries and request the master branch of the required git repository. library(httr) library(tidyverse) req <- …
December 1, 2019 2 min read

Hackerrank SQL - Advanced Select

Following problems are discussed here. To solve the same, please visit hackerrank website{:target="_blank"}. New Companies Binary Tree Nodes Occupations The PADS Type of Triangle New Companies select company.company_code, company.founder, …
December 1, 2019 1 min read

Hackerrank SQL - Aggregation

Following problems are discussed here. To solve the same, please visit hackerrank website{:target="_blank"}. Weather Observation Station 17 Weather Observation Station 18 Weather Observation Station 19 Weather Observation Station 20 …
September 15, 2019 1 min read

List, Create and Move Folder within Azure Datalake Store Gen1 using R and Rest API

The following code snippets are on creating a connection to Azure Data Lake Storage Gen1 using R with Service-to-Service authentication with client secret and client id using REST API to list, create and move any folder in a azure data lake store …
September 6, 2019 1 min read

Read and Write Excel from Azure Datalake Store using R and Rest API

The following code snippets are on creating a connection to Azure Data Lake Storage Gen1 using R with Service-to-Service authentication with client secret and client id using REST API and read and write an excel file. Import Prerequisite …
September 2, 2019 2 min read

Access Azure Active Directory Groups using R

The following code snippets are on creating a connection to Azure Active Directoy using R with Service-to-Service authentication with client secret and client id using REST API and fetch Active Directory`s groups and associated details with them …
August 26, 2019 3 min read

Part 5 - Plotting Using Seaborn - Radar

Introduction and Data preparation Please follow the folloing links regarding data preparation and previous posts to follow along - For Data Preparation - Part 0 - Plotting Using Seaborn - Data Preparation{:target="_blank"} For Part 1 - …