Rishab K Pattnaik

0 Publications
0+ Projects
0 Organizations

Who I Am

Hi, I’m Rishab — an AI researcher and engineering student from BITS Pilani who accidentally fell in love with deep learning during my third year. That first internship was the spark: I saw what machines could actually do, and I’ve been hooked ever since. These days, you’ll find me exploring the weird and wonderful world of AI architectures — from Transformers and Diffusion models to Mamba and beyond. I love digging into how models think: attention mechanisms, fine-tuning tricks like PEFT and distillation, and the magic behind text generation and image synthesis. But I’m not just here for the theory; I genuinely enjoy building things that work. Whether it’s a production system like Moody.AI or a research project in medical imaging, I love the journey from idea to deployment. Currently, I’m part of the Avatar team at FLAM, where I get to play with image generation — GANs, diffusion, normalizing flows — and even dabble in streaming tech like WebRTC (because why limit yourself?). If you’re into AI — whether it’s research, engineering, or just staying up late debating whether attention really is all you need — I’d love to connect. Let’s geek out, collaborate, or simply share ideas. The best conversations start with curiosity.

Experience

Flam · Bangalore ● active
1const role = "AI Engineering Intern";
2const company = "Flam (Flying Flamingoes Pvt. Ltd.)";
3const location = "Bangalore, India";
4const period = { start: "Jan 2026", end: "Present" };
5
6function avatarWork() {
7 // Talking head avatars · Avatar team
8 // Real‑time B2B products
9 return {
10 data: "creation & compilation",
11 model: "architecture + losses research",
12 stream: "WebRTC deployment"
13 };
14}
Hamad Medical Corporation · Qatar ✓ completed
1const role = "AI Research Intern";
2const company = "Hamad Medical Corporation";
3const location = "Doha, Qatar (Remote)";
4const period = { start: "May 2025", end: "Aug 2025" };
5
6function currentWork() {
7 // Emergency Research · Dept of Surgery
8 // Multimodal AI for triage & CKD diagnosis
9 return {
10 focus: "emergency_triage",
11 stack: ["PyTorch", "ViT", "Mamba"],
12 active: true
13 };
14}
BITS Pilani · Hyderabad ✓ completed
1const role = "Research Assistant";
2const dept = "ECE Department";
3const institute = "BITS Pilani, Hyderabad";
4const period = { start: "Aug 2024", end: "December 2025" };
5
6function contributions() {
7 // Medical imaging · bone fracture detection
8 // Wavelet-CNN architecture development
9 return {
10 ,
11 published: true,
12 journal: "Healthcare Technology Letters, IEEE Sensor Letters"
13 };
14}
IGCAR · Kalpakkam ✓ completed
1const role = "CV Research Intern";
2const org = "IGCAR Kalpakkam";
3const period = { start: "May 2024", end: "Aug 2024" };
4
5function project() {
6 // Camouflaged object detection
7 // Fine-tuning Meta's SAM architecture
8 return {
9 model: "SAM (Segment Anything)",
10 task: "camouflage_detection",
11 method: "fine_tuning"
12 };
13}
 scroll to reveal more

Research & Publications

A Lightweight Fourier Block Transformer for Android-Based Edge‑Enabled Detection of Osteopenia and Osteoporosis

IEEE Sensors Letters · March 2026

Lightweight Fourier Block Transformer achieving 88.41% accuracy for real-time osteoporosis detection directly on Android devices using knee X-ray sensor images.

IEEE · 2026

Multi-Frequency Aware Deep Representation Learning for Bone Fracture Detection

Healthcare Technology Letters, 2025

Novel multiband-frequency aware network achieving 92.22% accuracy on bone fracture detection benchmarks.

IET / Wiley · 2025

Deep Representation Learning for Pneumonia & Tuberculosis Detection

Elsevier Book Chapter, 2025

Published in "Non-stationary and nonlinear data processing for automated computer-aided medical diagnosis".

Elsevier · 2025

Featured Projects

U-Tube AI

Visual YouTube Agent

Multimodal AI that analyzes both audio and visual streams from YouTube videos.

EdgeSeg-AI

Research Paper Re-implementation

Memory-efficient image segmentation via sequential model loading.

Moody.AI

Multimodal Sentiment AI

Emotion recognition using DINOv2, Wav2Vec2, and DistilBERT for multimodal fusion.

OsteoDiagnosis.AI

Bone Health Diagnostics App

Android app for osteoporosis classification achieving 90% accuracy.

Expression.AI

Android App

Real-time facial expression recognition using TensorFlow Lite on Android.

Hand Gesture Identifier

Computer Vision

Real-time gesture recognition using FastViT achieving 97.5% accuracy.

AI Document Assistant

NLP / RAG

Document processing and Q&A pipeline using DeepSeek and Llama models.

Smart AI Checkers Bot

AI Game

Checkers game with a Minimax AI opponent with alpha-beta pruning.

Human Face Gender Detection

Neural Networks

Gender detection using InceptionV3 achieving 94.35% accuracy.

Technical Arsenal

Languages

Python
C
SQL
Git
Verilog

AI / ML Frameworks

PyTorch
TensorFlow
OpenCV
Scikit-learn
Keras
NumPy
Pandas
Matplotlib
Docker
LangChain
FastAPI
Android Studio
Streamlit

Specializations

Machine Learning
Deep Learning
Generative AI
LLMs

Education

Birla Institute of Technology and Science, Pilani

Bachelor of Engineering in Electronics and Communication Engineering

Hyderabad, India  ·  Currently Studying

Relevant Coursework

  • Machine Learning for Electronics Engineer
  • Neural Network and Fuzzy Logic
  • Deep Learning
  • Digital Signal Processing
  • Signals and Systems
  • Probability and Statistics
  • Microprocessor and Interfacing
  • Linear Algebra

Blogs

MedMamba Explained

Deep Dive · Deep Learning · State-Space Models

The first Vision Mamba for Generalized Medical Image Classification — what it is, how it works, and why it matters.

Medium · 2024