This repository contains the three projects completed as part of the internship program at Encryptix.
Predict the survival of passengers on the Titanic using various machine learning techniques.
-
Data Preprocessing:
- Handled missing values
- Feature engineering
-
Exploratory Data Analysis (EDA):
- Analyzed feature distribution and correlations
- Visualizations using bar charts, histograms, and heatmaps
-
Model Building:
- Tested models: Logistic Regression, Decision Trees, Random Forest, Support Vector Machines
- Used cross-validation for robustness
-
Model Evaluation:
- Metrics: accuracy, precision, recall, F1 score
- Hyperparameter tuning using Grid Search and Random Search
-
Result Interpretation:
- Final model predictions and feature importance analysis
Predict future sales of a company using historical sales data.
-
Data Collection and Preprocessing:
- Gathered and cleaned historical sales data
- Encoding categorical variables
-
Time Series Analysis:
- Analyzed trends, seasonality, and cyclic patterns
-
Feature Engineering:
- Created features like moving averages, lagged features, rolling statistics
-
Model Building:
- Explored ARIMA, Prophet, Random Forest, Gradient Boosting models
-
Model Evaluation:
- Metrics: MAE, MSE, RMSE
- Fine-tuning the best model
-
Deployment:
- Developed a pipeline for continuous prediction updates
- Visualizations and dashboards for stakeholders
Detect fraudulent credit card transactions to prevent financial losses and protect customers.
-
Data Collection and Preprocessing:
- Used a dataset of credit card transactions
- Addressed data imbalance with SMOTE and other techniques
-
Handling Imbalanced Data:
- Applied oversampling and undersampling
-
Feature Engineering:
- Created features to capture transaction patterns
- Scaling and normalization
-
Model Building:
- Tested models: Logistic Regression, Decision Trees, Random Forest, Gradient Boosting, Neural Networks
-
Model Evaluation:
- Metrics: Precision, Recall, F1 Score, AUC-ROC
- Focused on minimizing false negatives
-
Implementation:
- Real-time transaction monitoring and flagging suspicious activities
- System for ongoing model updates and retraining