Skip to main content

D-Tale -- One Stop Solution for EDA



D-Tale is a new recently launched(Feb 2020) tool for Exploratory Data Analysis. It is made up of Flask(for back-end) and React(for Front-end) providing a powerful analysing and visualizing tool. 

D-Tale is a Graphical User Interface platform that is not only Quick & Easy to understand but also great fun to use. It comes with so many features packed and loaded in it that reduces the manual work of Data Engineers/Scientists analysing and understanding the data and removes the load of looking for multiple different libraries used in EDA. 

Let's have a look at some features which make it so amazing:-

1. Seamless Integration -- D-tale provides seamless integration with multiple python/ipython notebooks and terminals. So, we can use it with almost any IDE of our choice.

2. Friendly UI  -- The Graphical User Interface provided by D-tale is quite simple and easy to understand, such that anybody can easily get friendly with it & start working right away. 

3. Support of multiple Python objects -- D-tale allows EDA for various Python Objects such as DataFrame, Series, MultiIndex, DatetimeIndex & RangeIndex.

4. Easy Installation -- Installing and getting ready with D-tale is very easy it takes only a few minutes of patience, & we are ready with our EDA tool to start playing with it. 

5. Tools -- It comes pre-loaded with a huge amount of tools which makes our work a lot easier.

Installation

We can install D-tale easily using conda or pip. 

Code:- 

## conda installation

conda install dtale

## pip installation 

pip install --upgrade dtale

## Jupyter Notebook installation

!pip install dtale


What are you waiting for?  Let's install it and start exploring. 

Upon completion of the installation process, we will get the "Successfully Installed" message and will be asked to restart our kernel(in the case of Jupyter Notebook). 

After restarting the kernel we are good to go with our D-tale. 

Installing D-Tale



Getting Started

Once installed, the next thing is to get the tool on the desk and start working on it. To get it going, we need to do the following:- 

Code:- 

import dtale

dtale.show(df)


that's it... and we are welcomed by a great and perfectly organised EDA of our data. 


Overview of UI

Great. We just installed and opened our first data set for EDA using D-tale.

*Please Note:- We prefer using Titanic Dataset as our first dataset for analysis. 

Saying that... let's start and see what all does D-tale has to offer us.


Code:- 

Importing and Initializing.

Import and Show

on executing the show command we are greeted with a table like structure, but it's interactive and offers a lot of options

D-tale UI


A great feature about this is that it offers an option to open the entire table in a new tab window, making the visuals clearer. We can select this option from the yellow marked arrow in the above picture in the top left corner.


Menu options


D-tale new tab



As you can see in the above image, there is a long list of features that are offered by D-tale, not only this it also provides a few options to customise the visuals like:- 
  1. Light/Dark theme option.
  2. Exporting the dataset in CSV/TSV format.
  3. Loading the data.
  4. Exporting the code.
  5. Resizing the column widths.
  6. Reloading the data.
  7. Filters.
  8. Show/Hide Columns... etc...etc.. and the list goes on...

Apart from this, main menu we also have a separate small menu option when we click on the column header (shown below).

Column menu options


This menu bar provides a few options like delete, rename, hide or lock the column. It also provides options for converting the default data type of the column, & we can also check the duplicates with just a click.


Ufff... that's a lot of features packed inside a small box... We will be discussing a few of the most amazing features from this list in the next section. 
 

Exploring the Features

Excited about exploring and diving deep into the features !!! Let's get going... 

1. Missing Analysis  --- This is a great tool to find the missing data present in the data set. It provides 4 different graphs to view missing data clearly. We can also download the graphs shown for future references.

Missing Analysis


2. Correlation --- this is yet another amazing feature, that shows the correlation between various columns of the dataset. The feature that makes it more interesting is the "Encode String" feature, which does "One Hot Encoding" for the string columns. 

Correlation 


As shown, we can select particular columns to get the correlation between them. Apart from that, selecting a particular cell from the table shows Pearson & Spearman correlation between them. 

3. Highlights -- Another great feature highlights particular fields in the table. It has 4 options:-

Highlight


These can be used to highlight various aspects of the dataset like missing data, outliers, ranges & datatypes.


4. Dataframe Functions -- This is another great feature of D-tale, which provides us with various operations, that helps in analyzing the data better. A few of the functions that it can perform are like combining different rows of data into a single row, analysing the rows data, transforming the existing data, time-series analysis etc. This is a must-use feature for completing our EDA.


Do try these and let us know how you used them for your analysis.


Dataframe functions



5. Describe -- One of the most important features of D-tale, that not only describes the data but also provides huge insights about it. This feature provides us with column-wise stats like the outliers, missing values, quantiles, mean, median, mode, variance, standard deviation, graphs etc. 

I have shown these stats for the Age column from our dataset. 

Describe


Amazing... isn't it...!!! 
Don't wait, get started and share your experience. 

Note- It is not possible to cover all the features in detail, have tried to cover 5 major functions from the library. You can still comment your doubt or ask us to explain the feature to you in the comment box below

Summary

We learned about the new library which makes the entire EDA process simple, Quick, Easy and Fun. We have covered its installation, few important features and their details.

Hope you enjoy reading this article and love using the library as much as we do. 



Comments