Team: Code Surgeons Innovator
Challenge: Exoplanet Detection using AI/ML
Dataset: NASA Exoplanet Archive PHOTOMETRIC Data
EXOVISION is an advanced machine learning system that automatically detects exoplanets by analyzing stellar light curves from NASA's photometric dataset. Our system uses the transit method to identify the characteristic dimming patterns that occur when an exoplanet passes in front of its host star.
- Multi-Model AI Pipeline: 5 optimized machine learning models (SVM, Random Forest, LightGBM, Gradient Boosting, Ensemble)
- Fast Training: Complete model training in under 22 seconds (15x speed improvement)
- Web Interface: Interactive Flask web application for real-time predictions
- Automated Data Processing: Intelligent feature extraction from light curve data
- NASA Dataset Integration: Direct processing of NASA Exoplanet Archive data
- Visualization Tools: Comprehensive plots for data exploration and model evaluation
| Model | Accuracy | F1-Score | Training Time |
|---|---|---|---|
| SVM (Best) ⭐ | 53.0% | 0.329 | ~4s |
| LightGBM | 61.5% | 0.238 | ~6s |
| Gradient Boosting | 64.5% | 0.145 | ~5s |
| Random Forest | 67.0% | 0.057 | ~4s |
| Ensemble | 63.5% | 0.099 | ~3s |
SVM selected as best model due to optimal F1-score for imbalanced exoplanet detection
- Backend: Flask, Python 3.12
- Machine Learning: scikit-learn, LightGBM, TensorFlow
- Data Processing: pandas, NumPy, SciPy
- Visualization: matplotlib, seaborn
- Frontend: HTML/CSS/JavaScript
- Deployment: Local Flask server
EXOVISION/
├── app.py # Flask web application
├── exoplanet_detection.ipynb # Main ML pipeline notebook
├── model_comparison.md # Model performance analysis
├── README.md # This file
├── requirements.txt # Python dependencies
├── data/
│ └── photometric_dataset.pkl # Processed NASA dataset
├── models/ # Trained ML models
│ ├── best_model_fast.pkl
│ ├── fast_svm_fast.pkl
│ ├── fast_lightgbm_fast.pkl
│ └── ...
├── static/ # Web assets
│ ├── css/style.css
│ ├── js/app.js
│ └── plots/
├── templates/
│ └── index.html # Main web interface
└── uploads/ # User data uploads
git clone <repository-url>
cd EXOVISIONpython -m venv env
env\Scripts\activate # Windows
# source env/bin/activate # Linux/Macpip install -r requirements.txtpython app.pyNavigate to http://localhost:5000 to access the web interface.
- Upload Data: Upload CSV files with light curve data
- Select Model: Choose from 5 trained ML models
- Get Predictions: Receive instant exoplanet detection results
- View Analysis: Explore detailed visualizations and confidence scores
- Open
exoplanet_detection.ipynb - Run all cells to reproduce the complete ML pipeline
- Experiment with different models and parameters
Our system identifies exoplanets using the transit method:
- Analyzes periodic dimming in stellar brightness
- Extracts 15 optimized features from light curves
- Applies robust preprocessing and scaling
- Uses ensemble voting for final predictions
Key extracted features include:
- Statistical moments (mean, variance, skewness, kurtosis)
- Periodicity detection via autocorrelation
- Transit depth and duration measurements
- Frequency domain analysis
- Outlier and anomaly detection
✅ Automated Detection: AI-powered exoplanet identification
✅ Real-time Processing: Fast predictions on new data
✅ User-Friendly Interface: Accessible web application
✅ Scientific Accuracy: Optimized for astronomical data
✅ Scalable Architecture: Handles large datasets efficiently
Our fast training approach includes:
- Hyperparameter Optimization: Reduced search space for speed
- Feature Selection: Top 15 most informative features
- Cross-validation: 2-fold validation for efficiency
- Class Balancing: Weighted models for imbalanced data
- Pipeline Integration: Automated preprocessing workflows
GET /: Main web interfacePOST /predict: Upload and analyze light curve dataGET /models: List available ML modelsPOST /batch_predict: Batch processing for multiple files
- Real-time NASA data integration
- Deep learning models (CNN/LSTM)
- Multi-planet system detection
- Advanced visualization dashboards
- Cloud deployment and scaling
Code Surgeons Innovator - NASA Space Apps Challenge 2025 participants dedicated to advancing exoplanet discovery through innovative AI/ML solutions.
This project is developed for the NASA Space Apps Challenge 2025. Please refer to NASA's data usage policies for the underlying datasets.
- NASA Exoplanet Archive for providing the photometric dataset
- NASA Space Apps Challenge organizers
- Open-source ML community (scikit-learn, TensorFlow, Flask)
Noaman Ayub
🌐 Portfolio: noamanayub.netlify.app
💼 LinkedIn: linkedin.com/in/noamanayub
💻 GitHub: github.com/noamanayub
🌍 Discovering New Worlds Through Machine Learning 🚀