Data models should be filtered and normalized. This is particularly important if datasets are sourced from high data volume repositories such as enterprise data warehouses.
Poem: Marriage
Marriage is a journey that begins with two hearts beating as one A bond of love that goes through trials and pains, but is never undone.
Which Jobs will not be affected by AI?
AI has become the Buzzword of the 21st century, anyone who even has a little idea of what AI can do is either excited, nervous, or fearful of it, and rightly so.
Which jobs are Recession Proof?
Recession in simple terms is a slowdown or massive decline of economic activities. A country is said to be in recession if there is a decline in activities like consumption, investment, government spending, and net export. If you are following the news, then you must be aware that countries like UK and USA are slowing moving... Continue Reading →
Poem: My dearest wife
For my dearest wife, the love of my life,A poem from my heart, to you, my wife. Your eyes, like stars, twinkle bright,With love and warmth, that shine through the night. Your smile, a beacon of joy and grace,Fills my heart with a sweet embrace. Your touch, gentle as a summer breeze,Brings me comfort, and... Continue Reading →
Poem: May it never end
Sleeping by your side every night, waking up next to you every day, may it never end. Sitting next to each other, not talking but still enjoying each others company, may it never end. Fighting for hours, but making up with just a simple hug, may it never end. Seldom agreeing on one point, but... Continue Reading →
How to handle duplicate records while inserting data in Databricks
Have you ever faced a challenge where records keep getting duplicated when you are inserting some new data into an existing table in Databricks? If yes, then this blog is for you. Let’s start with a simple use case: Inserting parquet data from one folder in Datalake to a Delta table using Databricks. Follow the... Continue Reading →
To be fit or to be thin?
To be fit or to be thin. Which one will you prefer? Above question will divide people into two categories: Wondering how both are different. They already know about the difference but unable to practice it in their lifestyles. This Blog is for both categories of people. Let’s start with the basics, what do you... Continue Reading →
How to Copy Files from SharePoint to Datalake using Azure Data factory
Copying files from SharePoint to Datalake or any other target location is one task you cannot ignore as a Data Engineer. Someday someone for sure is going to ask you to do that. So, how can you achieve that? Suppose we need to copy an excel file from SharePoint to Datalake Gen 2. You need... Continue Reading →
Poem: Worth it
The path is filled with thorns of pain and suffering,I am still going on; believing your love will be its ending. Each step I take, I meet with countless hurdles and loathing,I am still going on; keeping aside the fear of failing. Don't know how, when or whether there will be any benefit, but I... Continue Reading →
Poem: I wish
I wish we could start again. Like our relationship once was, simple and innocent, filled with respect, understanding and excitement. I could forget your ignorance, you could forgive my offence,I could see your benevolence, you could feel my remorse,I could be lucky again to meet you; you could again consider me your best decision. I... Continue Reading →
How to read .xlsx file in Databricks using Pandas
Step 1: In order to read .xlsx file, you need to have the library openpyxl installed in the Databricks cluster. Steps to install library openpyxl to Databircks cluster: Step 1: Select the Databricks cluster where you want to install the library. Step 2: Click on Libraries. Step 3: Click on Install New. Step 4: Select PyPI. Step 5: Put openpyxl in the text box under Package... Continue Reading →