To be fit or to be thin. Which one will you prefer? Above question will divide people into two categories: Wondering how both are different. They already know about the difference but unable to practice it in their lifestyles. This Blog is for both categories of people. Let’s start with the basics, what do you... Continue Reading →
How to Copy Files from SharePoint to Datalake using Azure Data factory
Copying files from SharePoint to Datalake or any other target location is one task you cannot ignore as a Data Engineer. Someday someone for sure is going to ask you to do that. So, how can you achieve that? Suppose we need to copy an excel file from SharePoint to Datalake Gen 2. You need... Continue Reading →
Poem: Worth it
The path is filled with thorns of pain and suffering,I am still going on; believing your love will be its ending. Each step I take, I meet with countless hurdles and loathing,I am still going on; keeping aside the fear of failing. Don't know how, when or whether there will be any benefit, but I... Continue Reading →
Poem: I wish
I wish we could start again. Like our relationship once was, simple and innocent, filled with respect, understanding and excitement. I could forget your ignorance, you could forgive my offence,I could see your benevolence, you could feel my remorse,I could be lucky again to meet you; you could again consider me your best decision. I... Continue Reading →
How to read .xlsx file in Databricks using Pandas
Step 1: In order to read .xlsx file, you need to have the library openpyxl installed in the Databricks cluster. Steps to install library openpyxl to Databircks cluster: Step 1: Select the Databricks cluster where you want to install the library. Step 2: Click on Libraries. Step 3: Click on Install New. Step 4: Select PyPI. Step 5: Put openpyxl in the text box under Package... Continue Reading →
Basic concepts anyone starting in SQL should know about
SQL stands for Structured Query Language. SQL is a database management language for relational databases. SQL lets you access and manipulate databases. SQL queries are not case sensitive. (SELECT = select) Division of SQL: Data Manipulation Language (DML) is used to add, update or delete data . Examples: INSERT, DELETE and UPDATE. Data Definition Language (DDL) is used... Continue Reading →
Fashion : What actually is Fashion?
Fashion is other name of confidence and comfort. Whatever your wear, wear as you were born in it.
How to Decrypt PGP Encrypted files in Databricks
As a Data Engineer you may come across a project where you need to Decrypt the PGP Encrypted files in order to get the data and apply transformation logic on it
Simple Machine Learning Program in Python to Use Linear Regression to predict House Price.
Requirement: The main requirement is to be able to assign and predict House Price in a given location based on age of house, distance from Airport and number of conveniences stores within 5 km radius.
How to Create Login and Password in SQL Server
This post is to help you create a new login and password for the users who want to use the data in SQL database or Warehouse for Analysis and Reporting.
How to read .csv and .xlsx file in Databricks
How to read .xlsx file: Step 1: In order to read .xlsx file, you need to have the library com.crealytics:spark-excel_2.11:0.12.2 installed in the Databricks cluster. Steps to install library com.crealytics:spark-excel_2.11:0.12.2 to Databircks cluster: Step 1: Select the Databricks cluster where you want to install the library. Step 2: Click on Libraries. Step 3: Click on... Continue Reading →
Everything you need to know about Cloud Computing
Cloud Computing refers to the process of making computing services available to users over the internet.