13. What are content packs? Content packs are the container of Dashboard, reports and dataset which can be used by many users. The primary feature of content pack is that if you created a report and put it in a content pack other users can see your report and make to copy of the report and make changes to their own copy of the report while your report is safe and unaltered.
Which Jobs will not be affected by AI?
AI has become the Buzzword of the 21st century, anyone who even has a little idea of what AI can do is either excited, nervous, or fearful of it, and rightly so.
Which jobs are Recession Proof?
Recession in simple terms is a slowdown or massive decline of economic activities. A country is said to be in recession if there is a decline in activities like consumption, investment, government spending, and net export. If you are following the news, then you must be aware that countries like UK and USA are slowing moving... Continue Reading →
How to handle duplicate records while inserting data in Databricks
Have you ever faced a challenge where records keep getting duplicated when you are inserting some new data into an existing table in Databricks? If yes, then this blog is for you. Let’s start with a simple use case: Inserting parquet data from one folder in Datalake to a Delta table using Databricks. Follow the... Continue Reading →
How to Copy Files from SharePoint to Datalake using Azure Data factory
Copying files from SharePoint to Datalake or any other target location is one task you cannot ignore as a Data Engineer. Someday someone for sure is going to ask you to do that. So, how can you achieve that? Suppose we need to copy an excel file from SharePoint to Datalake Gen 2. You need... Continue Reading →
How to read .xlsx file in Databricks using Pandas
Step 1: In order to read .xlsx file, you need to have the library openpyxl installed in the Databricks cluster. Steps to install library openpyxl to Databircks cluster: Step 1: Select the Databricks cluster where you want to install the library. Step 2: Click on Libraries. Step 3: Click on Install New. Step 4: Select PyPI. Step 5: Put openpyxl in the text box under Package... Continue Reading →
How to Decrypt PGP Encrypted files in Databricks
As a Data Engineer you may come across a project where you need to Decrypt the PGP Encrypted files in order to get the data and apply transformation logic on it
Simple Machine Learning Program in Python to Use Linear Regression to predict House Price.
Requirement: The main requirement is to be able to assign and predict House Price in a given location based on age of house, distance from Airport and number of conveniences stores within 5 km radius.
How to Create Login and Password in SQL Server
This post is to help you create a new login and password for the users who want to use the data in SQL database or Warehouse for Analysis and Reporting.
How to read .csv and .xlsx file in Databricks
How to read .xlsx file: Step 1: In order to read .xlsx file, you need to have the library com.crealytics:spark-excel_2.11:0.12.2 installed in the Databricks cluster. Steps to install library com.crealytics:spark-excel_2.11:0.12.2 to Databircks cluster: Step 1: Select the Databricks cluster where you want to install the library. Step 2: Click on Libraries. Step 3: Click on... Continue Reading →
Everything you need to know about Big Data
Big Data refers to the data which is nearly impossible to process using a single computer as it is huge in size, generated frequently and is in many different formats.
Machine Learning: What is Reinforcement Learning
Reinforcement Learning is an area of Machine Learning which is applied when you don’t have training dataset and want the machine to learn from its experience.