A collection of machine learning projects developed as part of the Codsoft Data Science initiative. This repository includes projects ranging from predictive analytics to classification models.
- Dataset:Click Here
- The Movie Rating Prediction project estimates movie ratings based on features like genre, director, and actors. By analyzing historical data, it reveals key factors that influence ratings, providing insights into audience and critic preferences.
- Dataset: Click Here
- The Iris Classification project aims to classify iris flowers into three species—setosa, versicolor, and virginica—based on sepal and petal measurements. Using the widely recognized Iris dataset, this model demonstrates fundamental classification techniques for distinguishing between species accurately.
- Dataset: Click Here
- The Sales Prediction project focuses on forecasting product demand by analyzing factors like advertising spend, target audience, and platform choice. By applying machine learning techniques in Python, this model helps businesses optimize advertising strategies and maximize sales potential.
- Dataset: Click Here
- The Titanic Survival Prediction project uses passenger data—such as age, gender, and ticket class—to predict survival outcomes. This classic dataset allows for a beginner-friendly exploration of classification techniques to determine factors influencing passenger survival.
- Languages: Python
- Libraries: NumPy, Pandas, Scikit-Learn, etc
- Other Tools: Jupyter Notebook, Google Colab, VS Code, Git, GitHub.
- Feature Engineering: Data cleaning, normalization/Standardization
- Modeling Techniques: Linear Regression, Random Forest, Decision tree, Classification, etc.
- Evaluation Metrics: Accuracy, F1-score, MSE, MAE, R2, etc
For questions or collaboration, feel free to reach out:
Girish Kumar Email: [email protected] GitHub: https://github.com/girish119628