Rishab K Pattnaik

Who I Am

A Machine Learning Engineer and Electronics & Communication Engineering student at BITS Pilani Hyderabad Campus, passionate about developing AI solutions for real-world challenges. I focus on medical imaging through deep learning architectures and wavelet-CNN integration, contributing to published research with measurable diagnostic improvements. My work spans medical AI applications to intelligent document processing systems. I explore cutting-edge architectures like Vision Transformers and Mamba, building practical solutions using PyTorch and TensorFlow. Through technical blogs, I share knowledge bridging theory with practice.

Experience

Hamad Medical Corporation, Qatar

AI Research Intern

Currently engaged in Emergency Research within the Department of Surgery at Hamad Medical Corporation, under the guidance of Dr. Sarada Prasad Dakua, Principal Data Scientist. The work encompasses developing and refining AI-driven approaches to enhance patient triage and clinical decision-making in high-pressure emergency settings. This involvement aims to improve both clinical outcomes and operational efficiency by leveraging advanced data science techniques tailored to emergency care challenges.

05/2025 - Present

BITS Pilani Hyderabad

Research Assistant

As a Research Assistant in the Department of ECE at BITS Pilani, I contributed to pioneering research in medical imaging under the supervision of Dr. Rajesh Kumar Tripathy. It involves developing and applying novel deep learning models, including advanced CNNs and transformer architectures , alongside the innovative integration of wavelet-DNN techniques to enhance diagnostic accuracy from medical scans .

08/2024 - Present

IGCAR Kalpakkam (On-Site)

CV Research Intern

As a Research Intern at Indira Gandhi Center of Atomic Research, Kalpakkam, under the supervision of Raja Sekhar M (SO/E), work focused on advancing camouflaged object detection by fine-tuning Meta's Segment Anything Model (SAM) for improved identification of objects in complex visual environments. This role involved designing and implementing specialized training strategies to adapt state-of-the-art vision transformer models for challenging detection tasks.

05/2024 - 08/2024

Research & Publications

Journal Paper

Multi-Frequency Aware Deep Representation Learning Framework for Automated Detection of Bone Fractures using Muscle X-ray Images

A Journal Paper authored under the supervision of Dr. Rajesh Kumar Tripathy focuses on developing advanced deep learning methodologies for medical imaging applications, specifically targeting the automated detection and diagnosis of complex medical conditions using multi-modal data. This research aims to enhance diagnostic accuracy and clinical decision support through innovative model architectures and training techniques. The paper is currently under review and has not yet been published yet.

Elsevier Book Chapter

Deep Representation Learning for Computer-Aided Detection of Pneumonia and Tuberculosis Using Chest X-Ray Images

Published research in Elsevier's book "Non-stationary and nonlinear data processing for automated computer-aided medical diagnosis" authored by RK Tripathy, RB Pachori, Sibashankar Padhy, Maarten De Vos (ISBN:9780443314261).

Featured Projects

Research Paper Productionization

EdgeSeg-AI

EdgeSeg-AI is a research backed image segmentation framework that democratizes advanced AI by reducing memory usage 60-70% through sequential model loading. It uses Multi-Modal Architecture, which intelligently orchestrates prompt interpretation, object detection, and mask generation for seamless user experience which enables high-quality segmentation on consumer hardware.

Novel Bone Health Diagnostics App

OsteoDiagnosis.AI

OsteoDiagnosis.AI (product of our Ongoing Research) is an innovative novel deep learning architecture integrating Advanced Signal Processing techniques for three-class Osteoporosis classification achieving 90% accuracy, successfully deployed as comprehensive Android application for accessible bone health diagnostics.

Android APP POWERED BY AI

Expression.AI

Expression.AI is a real-time facial expression recognition Android app using TensorFlow Lite and OpenCV. It classifies emotions from camera or gallery images using a custom model trained on FER2013. Its fast, efficient and less than 500 Mb. You can download the app from the APK Link inside the Pop-Up. (*Note: Its Completely Virus free)

COMPUTER VISION with DEEP LEARNING

Hand Gesture Identifier

Developed an AI Hand Gesture Recognition System leveraging Apple's FastViT Vision Transformer, achieving 97.5% accuracy on the validation set. This project focuses on real-time gesture recognition for 19 distinct hand gestures, demonstrating efficient model performance for practical applications. The system utilizes transfer learning on the HaGRID dataset, making it suitable for deployment in resource-constrained environments.

NATURAL LANGUAGE PROCESSING Integrating RAG

AI Document Assistant

Smart document processing system using DeepSeek and Llama models for intelligent analysis, extraction, and summarization of complex documents.

AI Course Project

Smart AI Checkers Bot

A Checkers game featuring a Smart AI using Minimax with alpha-beta pruning against a Random AI, demonstrating strategic decision-making in game development.

Course Project of Neural Network and Fuzzy Logic

Human Face Gender Detection

Deep learning-based gender detection system using CNN & InceptionV3, achieving 94.35% accuracy on the CelebA dataset with advanced augmentation techniques.

Technical Arsenal

Programming Languages

Python

C Programming

SQL

Git

Verilog

AI/ML Frameworks

PyTorch

TensorFlow

Computer Vision

Scikit-learn

Keras

NumPy

Pandas

Matplotlib

Docker

LangChain

FastAPI

Android Studio

Streamlit

AI Specializations

Machine Learning

Deep Learning

Generative AI

LLM

Education

Birla Institute of Technology and Science, Pilani

Bachelor of Engineering in Electronics and Communication Engineering

Hyderabad, India • Currently Studying

Relevant Coursework

Machine Learning for Electronics Engineer
Neural Network and Fuzzy Logic
Deep Learning
Digital Signal Processing
Signals and Systems
Probability and Statistics
Microprocessor and Interfacing
Linear Algebra

Blogs

Deep Dive • Deep Learning • State-Space-Models • Transformers Replacement • Efficient Computation

MedMamba Explained : The first Vision Mamba for Generalized Medical Image Classification is finally here!!! (Medium ~ AI Advances)

A blog on MedMamba, the first Vision Mamba architecture specifically designed for generalized medical image classification, addressing the computational limitations of traditional CNNs and Vision Transformers. Developed by Yubiao Yue and Zhenzhang Li, this groundbreaking work positions MedMamba as a superior replacement for ViTs by achieving linear computational complexity (O(N)) compared to ViTs' quadratic complexity (O(N²)), making it ideal for resource-constrained medical environments where real-time diagnosis is critical. The comprehensive technical guide explores how MedMamba leverages State Space Models (SSMs) and the innovative 2D-Selective-Scan mechanism to maintain global receptive fields while reducing FLOPs by up to 55% compared to equivalent transformer models. The blog demonstrates why practitioners should adopt MedMamba over traditional ViTs: it delivers competitive accuracy (93.7% average across medical datasets) with significantly lower memory requirements and faster inference times, making it practical for deployment in clinical settings where computational efficiency directly impacts patient care.