Research Portfolio

Comprehensive overview of research publications, open-source packages, and ongoing projects in machine learning, survival analysis, and statistical computing.

Published Research Papers

Shape Penalized Decision Forests for Imbalanced Data Classification

Published
2024

IEEE

Classification trees often yield fragmented minority boundaries under imbalanced data. The authors propose a surface-to-volume ratio (SVR) regularization that penalizes decision-set complexity, optimized via a greedy breadth-first splitting algorithm analogous to CART.

View Paper

Papers Under Review & Accepted

MART: Moving Average Randomized Tree

Accepted
2024

Springer Machine Learning

A randomized CAGR-based split method for predicting future trends in the stock market.

To be Published, Preprint not available

Survival: A Different Approach

Under Review
2024

Developed and implemented novel data-dependent techniques to aggregate survival trees, enhancing the accuracy of survival estimates for patients.

Concordance-based Survival Cobra with Regression Type Weak Learners

Under Review
2024

A novel survival analysis method, utilizing concordance-based techniques in combination with regression type weak learners.

View Preprint

Integrated Brier Score based Survival Cobra - A Regression Based Approach

Under Review
2024

An innovative survival analysis method using an integrated Brier score-based approach combined with regression techniques.

View Preprint

Open Source Packages

Shape Penalized Decision Forests

imbalanced-spdf

Python
Machine Learning
Imbalanced Data

Shape Penalized Decision Forests, for training ensemble classifiers tailored for imbalanced datasets. Provides SPBaDF and SPBoDF implementations with Surface-to-Volume Regularization (SVR).

Combined Regression Strategy Survival

cobsurv

Python
Survival Analysis
Regression

Part of the PBSA (Proximity-Based Survival Analysis) research project. Contains combined regression strategy-based models for survival analysis with production-ready implementations.

Fast Kaplan Meier Estimator

fastkme

Python
Statistics
Survival Analysis

A faster Kaplan-Meier Estimator, specially created for working with a nonparametric Estimator. Includes weighted Kaplan-Meier curve functionality.

Ready Tensor ML Projects

Machine Learning models developed during my tenure as ML Engineer at Ready Tensor, focusing on production-ready, dockerized ML implementations.

TSMixer Time Series Forecasting

Apache License 2.0

rt_forecasting_TSMixer

Advanced time-series forecasting models using the TSMixer architecture

View Repository

PatchMixer Forecasting

Apache License 2.0

rt_forecasting_PatchMixer

PatchMixer model implementation for time series forecasting

View Repository

CatBoost Multiclass Classifier

rt_mc_class_catboost

CatBoost model implementation for multiclass classification

View Repository

AdaBoost Binary Classifier

MIT License

rt_adaboost_binary_classifier

AdaBoost algorithm implementation for binary classification

View Repository

XGBoost Binary Classifier

MIT License

rt_xgboost_binary_classifier

XGBoost model implementation for binary classification

View Repository

Research Collaboration & Academic Partnership

Open to research collaborations, academic partnerships, and discussions on machine learning, survival analysis, and statistical computing innovations.
Senior Research Fellow @ IIT Guwahati | Sorbonne Abu Dhabi SAFIR Affiliate | MIT Student