Hi! I'm Hassan Farid!

A python developer, machine learning enthusiast and data scientist

Profile Image

Hi! I am Hassan.

I am a data scientist in practice skilled in Data Mining, Data Cleaning, Data Transformation, Data Visualization, Model Engineering and Automation Scripting, with Python and SQL at the center of it.

I have graduated from NED University, Karachi, Pakistan just recently in October 2022 and in my Bachelor years have implemented various data analysis and machine learning projects. I also have a knack for integrating different technologies to maximize their outcome. Aside from programming and computer science. I have a deep interest in mathematics, problem solving, reading books and teaching others.

Oct 2018 May 2021 Started BSCS Programme NED University, Karachi June 2021 Data Science Intern GRIP - Sparks Foundation Sep 2021 Oct 2021 CS Intern KDA IT Deptartment Oct 2022 Completed BSCS NED University, Karachi

Technical Skills


Need to get insights from your data? I have got you covered!

I am experienced with cleaning, processing and analyzing data as well as using it to predict future outcomes

  • Data Mining


    I make use of Numpy, Pandas and MySQL to mine valuable insights and provide a better understanding of the data

  • Data Visualization


    Using Matplotlib and Seaborn packages, I prepare aesthetic visuals to deliver valuable information from the data

  • Machine Learning


    The various features of Scikit-Learn package and Keras API allow me to model highly efficient ML/DL models

  • Automation Scripting


    I love automating use cases and do my best to exploit the relevant functionalities that can be automated via Python

BLOG


I write stories based on my vast skillset. Articles related to data science and python are my forte.

  • Overview of NLP Pre-processing Techniques

    Feb 2023   17 min read

    Text Preprocessing is one of the essential stages in training a Natural Language Processing (NLP) based machine learning model. Text Preprocessing allows processing the textual data and retrieval of a representation of textual data that is well-suited for the machine learning model being implemented...

    Read More
  • Overview of the Machine Learning Pipeline

    Jan 2023   5 min read

    Machine Learning is topping charts in terms of popularity nowadays, with the progress of AI-based art using generative image models, auto-transcribing of audios via speech-to-text systems, and general-purpose usage of LLMs to answer queries from large sets of passages, etc. With this trend in motion...

    Read More

Projects


I have worked on variety of data analysis, machine learning and python scripting projects

    Project Image

    Old Degraded Document Restoration

    Application to remove degraded patterns from scanned images of documents (in PDF format) using Deep Convolutional GAN. The application uses a Flask REST API and Keras for model implementation.

    Project Image

    Driver-Drowsiness-Detection System

    Implemented a drowsiness detection system with an auto-alarm alert using OpenCV, Keras and stream-lit, which uses computer vision to analyze the Eye Aspect Ratio and rings the alarm accordingly.

    Project Image

    Job Scrapper

    A simple command line tool that uses Beautiful Soup to fetch data from Linkedin Job Platform, extracts the useful information with the help of preprocessing and returns the result in a prettified manner.

    Project Image

    Urecipy

    A stream-lit application to generate recipe cards with ingredients and procedure of a certain recipe uploaded on YouTube. The application uses Whisper to transcribe audio into text and GPT-3 programming to extract the relevant information.

    Project Image

    Northwind Database Analysis

    Mined insights from Microsoft's Northwind Traders Database using MySQL DML operations and visualized the results with the help of Microsoft PowerBI Desktop.

    Project Image

    Car Dealership Sales Prediction

    Implemented a sales prediction model for an old car dealership. The analytical insights were gathered using Numpy, Pandas, Matplotlib and Seaborn, whereas Scikit-Learn was used for model development.

Contact Me


Feel free to contact me for any question related to my skillset. If you want to follow my work, reach me on Linkedin. Otherwise, send me an email at hassanfaridghori@gmail.com.

  • Linkedin URL
  • Github URL
  • Email Address

Copyright © Hassan Farid 2023