A machine learning-based solution for predicting maintenance needs and monitoring diesel generator parameters in real-time.
This project implements a predictive maintenance system for diesel generators using machine learning algorithms to analyze time series data. The system monitors critical parameters such as:
- Coolant Temperature
- Oil Pressure
- Vibration Levels
- Exhaust Gas Temperature
- Fuel Consumption Rate
- Battery Voltage
- Engine Speed (RPM)
- Load Current
- Frequency (Hz)
- Running Hours
The machine learning model predicts potential issues like the ones listed above, before they cause failures, reducing downtime and maintenance costs.
- Machine learning models for predictive analytics
- Real-time parameter monitoring via MQTT protocol
- Web-based dashboard for parameter visualization
- Comparative analysis of multiple ML algorithms
The system evaluates multiple algorithms to find the optimal predictive model:
- Linear Regression
- Logistic Regression
- Decision Trees
- Random Forest Regression (achieved 90.56% accuracy)
- Support Vector Machines
training.py
- Code for training the machine learning modelspredicting.py
- Implementation of prediction functionalitydata.csv
- Historical training datasetinputdata.csv
- Input data format for predictionstest.html
- Web dashboard for real-time monitoring
- Data collection from diesel generator sensors
- Data preprocessing and feature engineering
- Model training and evaluation
- Real-time data transmission via MQTT
- Parameter visualization on web dashboard
- Best Performing Model: Random Forest Regression
- Accuracy: 90.56%
- Key Parameters Monitored:
- Voltage
- Current
- Fuel levels
- Engine temperature
- Operational hours
Python 3.7+
pandas
scikit-learn
seaborn
paho-mqtt (for MQTT communication)
matplotlib (for visualization)
- Clone this repository
git clone https://github.com/yourusername/diesel-generator-predictive-maintenance.git
- Install required packages
pip install -r requirements.txt
- Run the training script
python training.py
- Set up prediction system
python predicting.py
-
Real-time Streaming Architecture:
- Implement Apache Kafka for high-throughput, fault-tolerant data streaming
- Build real-time analytics with Kafka Streams or Apache Flink
-
Workflow Orchestration:
- Migrate to Apache Airflow for robust pipeline scheduling and monitoring
- Implement DAGs for complex maintenance prediction workflows
-
Big Data Processing:
- Scale to distributed computing with Apache Spark for handling fleet-wide generator data
- Implement batch processing with Hadoop ecosystem for historical analysis
-
Data Warehousing & Storage:
- Implement Snowflake data warehouse for flexible scaling and analytics
- Utilize AWS S3 for cost-effective long-term storage of sensor data
-
Cloud Infrastructure:
- Migrate to AWS cloud infrastructure (EC2, Lambda, SageMaker)
- Implement containerization with Docker and Kubernetes for deployments
-
Advanced Analytics:
- Develop a data lake architecture for combining structured and unstructured maintenance data
- Implement dbt (data build tool) for analytics engineering and transformation
Real-time monitoring dashboard showing generator parameters
Comparison of different ML algorithms with Random Forest achieving 90.56% accuracy
Click to watch the demo video of the system in action
- Development Period: August 2023 - May 2024
- Institution: SSN College of Engineering
- Contributors: [Sriram Vivek , Sethuram Gautham Rajakumar]
Feel free to reach out if you have any questions or would like to collaborate!
- Email: [email protected]
- LinkedIn: https://www.linkedin.com/in/sriram-vivek-58a673269/
This project was developed as part of academic curriculum at SSN College of Engineering.