VIT HACK-A-THON: Applications of AI/ML in Environmental Sustainability
by kandikits Updated: Aug 17, 2023
1-Click Kit
This starter kit presents a very simple solution based on AI/ML that helps tackle the problem of water pollution caused due to poor quality detergents. This kit uses the 'IoT Sensor' dataset which has information on Detergent Quality - 'Good' or 'Bad', based on certain detergent manufacturing parameters. The solution uses the Supervised Machine Learning technique to predict whether the detergent is of good or bad quality thereby resulting in water pollution. This solution can be deployed for enhancing the footprint of detergents on Aquatic Ecosystems.
Here is how you get started!
- Identify your specific problem and your proposed use case/solution.
- Install this starter kit and customize it to suit your solution.
- Answer the questions on the submission form with a short video/screen recording/presentation explaining your solution.
For your HACK-A-THON, you may utilize this kit in the following ways:
- ‘BO5341_IoTData.csv’ is the dataset used to train the model. Replace it with the dataset of your choice.
- There is a ‘sample_predict.csv’ dataset wherein you can test this model with a similar dataset as ‘BO5341_IoTData.csv’.
- Checkout the output file ‘sample_predict_output.csv’ to see the results of the sample dataset you entered in ‘sample_predict.csv’
- ‘Logistic RegressionCV’ is used to train the model and fetch a training accuracy of 98%, and testing accuracy of 73%. You may improve the accuracy of the model using other algorithms as a submission
- You may use Matplotlib & Seaborn to plot other features to showcase your results apart from the confusion matrix used to analyze the result.
Deployment Information
Follow the below instructions to run the solution. This is our source code updated in GitHub for our VIT hackathon entry. The below repo is an example. Please replace with your source code from GitHub.
For Windows OS, Download, extract and double-click the kit installer file to install the kit. Note: Do ensure to extract the zip file before running it. The installation may take from 2 to 10 minutes based on bandwidth.
- When you're prompted during the installation of the kit, press Y to launch the app automatically and execute cells in the notebook by selecting Cell --> Run All from Menu bar
- To run the app manually, press N when you're prompted and locate the zip file 'environmental-sustainability-through-ml-main.zip'
- Extract the zip file and navigate to the directory 'environmental-sustainability-through-ml-main'
- Open command prompt in the extracted directory 'environmental-sustainability-through-ml-main' and run the command 'jupyter notebook'
For other Operating System,
- Install python
- Download the repository
- Extract the zip file and navigate to the directory 'environmental-sustainability-through-ml-main'
- Open terminal in the extracted directory 'environmental-sustainability-through-ml-main'
- Install dependencies by executing the command 'pip install -r requirements.txt'
- Run the command ‘jupyter notebook’ and select the notebook ‘Enironmental_Sustainability_IoTdata.ipynb’ on the browser window.
Instructions to Run
- Locate and open the Enironmental_Sustainability_IoTdata.ipynb notebook from the Jupyter Notebook browser window.
- Execute cells in the notebook by selecting Cell --> Run All from Menu bar You can execute the cells of notebook by selecting Cell from the menu bar.
For any support, you can reach us at FAQ & Support
Development Environment
VSCode and Jupyter Notebook are used for development and debugging. Jupyter Notebook is a web based interactive environment often used for experiments, whereas VSCode is used to get a typical experience of IDE for developers. Jupyter Notebook is used for our development.
Group Name 1
notebookby jupyter
Jupyter Interactive Notebook
notebookby jupyter
Jupyter Notebook 10204 Version:v7.0.0b4 License: Permissive (BSD-3-Clause)
Exploratory Data Analysis
For extensive analysis and exploration of data, and to deal with arrays, these libraries are used. They are also used for performing scientific computation and data manipulation.
numpyby numpy
The fundamental package for scientific computing with Python.
numpyby numpy
Python 23755 Version:v1.25.0rc1 License: Permissive (BSD-3-Clause)
pandasby pandas-dev
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
pandasby pandas-dev
Python 38689 Version:v2.0.2 License: Permissive (BSD-3-Clause)
Machine Learning
Machine learning libraries and frameworks here are helpful in providing state-of-the-art solutions using Machine learning.
scikit-learnby scikit-learn
scikit-learn: machine learning in Python
scikit-learnby scikit-learn
Python 54584 Version:1.2.2 License: Permissive (BSD-3-Clause)
Data Visualization
The patterns and relationships are identified by representing data visually and below libraries are used for generating visual plots of the data.
matplotlibby matplotlib
matplotlib: plotting with Python
matplotlibby matplotlib
Python 17559 Version:v3.7.1 License: No License
Troubleshooting
- If you encounter any error related to MS Visual C++, please install MS Visual Build tools
- While running batch file, if you encounter Windows protection alert, select More info --> Run anyway.
- During kit installer, if you encounter Windows security alert, click Allow.
- If you encounter Memory Error, check if the available memory is sufficient and it is proportion to the size of the data being used. For our dataset, the minimum required memory is 8GB.
If your computer doesn't support standard commands from windows 10, you can follow the instructions below to finish the kit installation.
- Click here to install python
- Click here to download the repository
- Extract the zip file and navigate to the directory 'energy-sustainability-through-ml-main'
- Open terminal in the extracted directory 'energy-sustainability-through-ml-main'
- Install dependencies by executing the command 'pip install -r requirements.txt'
- Run the command ‘jupyter notebook’ and select the notebook ‘Energy_Sustainability_IoTdata.ipynb’ on the browser window.