All things Data

# Category: Probability

• ## Analyzing Diabetes Patterns amongst Indians, A Beginner’s Guide to Pearson’s Correlation Coefficient, Deep Learning in Cyber Security & Much More!

1. Juicing out the Diabetes Patterns amongst Indians using Machine Learning The data indicates an increase of 266% in the population of diabetics is going to be witnessed by developing countries. The score of the training model was a magnificent 100% which means it classified all the elements correctly as is evident as a result […]

• ## Resources to learn Linear Regression

Linear regression shows the linear relationship between the independent(predictor) variable i.e.Linear regression is a quiet and the simplest statistical regression method used for predictive analysis in machine learning. How a Math equation is used in building a Linear Regression model? Do you know that this one equation helps in building a linear regression model in the machine learning world? Yes, you heard it right.From the school days, we have come across the equation of the straight line.

• ## Basics of Data Science using Python

Basics of Data Science using Python After going through the topic above, what are the questions that come to your mind? It must be, why python? How can we use python to implement data science? And, what are the advantages and disadvantages? We will be answering all these questions and we are also going to talk about the libraries that we can use to implement data science. 1. NumPy NumPy arrays are similar to Python’s built-in list type in some ways, but NumPy arrays provide much more efficient storage and data operations as the arrays grow in size. 2. SciPy It is built on top of the Numpy library, which provides more extensions for finding scientific mathematical formulae such as Matrix Rank, Inverse, polynomial equations, LU Decomposition, and so on. 3. Pandas Pandas is a pillar library in any data science workflow because it allows you to perform data processing, wrangling, and munging. 4. Matplotlib This library is built on NumPy arrays and includes several plots such as line charts, bar charts, histograms, and so on. 5. Scikit Learn Scikit-learn is by far one of the most important Python libraries for machine learning, as it allows you to create machine learning models while also providing utility functions for data preparation, post-model analysis, and evaluation.. 6. TensorFlow TensorFlow is a software library that uses data flow graphs to perform numerical computations. 7. Keras It was created with the goal of allowing for quick experimentation .Keras is a Python library that is widely used for deep learning model training. 8. PyTorch PyTorch takes these tensors and makes it simple to move them to GPUs for faster processing during neural network training.

• ## Probability Distributions that every Data Scientist must know

Introduction Probability of an event tells us how likely is that, the event will occur. The applications of probability begin with the numbers p0, p1, p2… that give the probability of each possible outcome. There are dozens of famous and useful possibilities for p. I will discuss four of them in this post. Before going […]