Password Strength Classifier 🔑

A Machine Learning model that predicts whether the password is strong or not.

Datset Link: https://www.kaggle.com/bhavikbb/password-strength-classifier-dataset

The passwords used in our analysis are from 000webhost leak that is available online. How did we figure out which passwords were stronger and which were weaker? Well, there is a tool called PARS by Georgia Tech university which have all the commercial password meters integrated into it. All I did was give that tool all the passwords and it gave me new files for each commercial password strength meter. The files contained the passwords with one more column i.e their strength based on the commercial password strength meters.

The commercial password strength algorithms I used are of Twitter, Microsoft and battle. How is this algorithm different from these strength meters? First of all, it is entirely based on machine learning rather than on rules. Secondly, I only kept those passwords that were flagged weak, medium and strong by all three strength meters. This means that all the passwords were indeed either weak, medium or strong.

About this file

Password - 670k unique values for password collected online. Strength - three values(0 , 1 , 2) i.e. 0 for weak, 1 for medium, 2 for strong. Strength of the password based on rules(such as containing digits, special symbols , etc.)

Plots for better understanding 📊

Value Counts of Strength 💪

Length of a Password 📏

Capital letters in a Password 🔠

Small letters in a Password 🔡

Numeric values in a Password 🔢

Special characters in a Password 🔣

The model performance 🥇

Here we have used MLP Classifier from sklearn with 2 hidden layers each having 16 nodes with ReLU activation. The accuracy of the model reached 99.99% as the model made only one mis-classification. Here is the confusion matrix for better understanding:

The scaler value as well as the model is saved in the asset folder.

This was just a tutorial how powerful can feature engineering be.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
asset		asset
img		img
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model.ipynb		model.ipynb
passwordEDA.ipynb		passwordEDA.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Password Strength Classifier 🔑

A Machine Learning model that predicts whether the password is strong or not.

About this file

Plots for better understanding 📊

Value Counts of Strength 💪

Length of a Password 📏

Capital letters in a Password 🔠

Small letters in a Password 🔡

Numeric values in a Password 🔢

Special characters in a Password 🔣

The model performance 🥇

About

Releases

Packages

Languages

License

Ankit152/Password-Strength-Classifier

Folders and files

Latest commit

History

Repository files navigation

Password Strength Classifier 🔑

A Machine Learning model that predicts whether the password is strong or not.

About this file

Plots for better understanding 📊

Value Counts of Strength 💪

Length of a Password 📏

Capital letters in a Password 🔠

Small letters in a Password 🔡

Numeric values in a Password 🔢

Special characters in a Password 🔣

The model performance 🥇

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages