Anannya Popat - Portfolio

About Me

Hi there! I’m all about bringing ideas to life with computer vision, LLMs, and deep learning. From addressing real-world challenges in healthcare to creating impactful solutions, I’ve had a blast blending tech and creativity through projects and internships. I’m a big fan of AI, designing, and coding—basically, anything that lets me innovate and make life a little cooler!

Masters of Science in Applied Computing with spl. in Artificial Intelligence

University of Toronto

2023 - 2025

Bachelors of Technology in Computer Science and Engineering and Business Systems

Vellore Institute of Technology

2019 - 2023

Experience

Machine Learning Specialist (Team Lead) · University Health Network

Feb 2025 - Current

Leading the deployment of an ML pipeline for a realistic and interactive 3D anatomical modeling system using patient-specific CT/MRI scans. Contributed to real-time surgical AI by advancing Tool Tissue Interaction detection with YOLOv5 and training a Vision Transformer for precise classification.

Python PyTorch Transformers OpenCV Generative AI Docker Git Rest APIs

AI Research Intern · University Health Network

May 2024 - Dec 2024

Led the development of an interactive 3D anatomical modeling system from patient-specific CT scans to revolutionize surgical planning. Implemented an nnU-Net-based segmentation algorithm to accurately generate 3D visualizations of segmented organs. Enhanced the realism of rendered anatomical models by optimizing and applying a GAN-based texture generation algorithm.

Python PyTorch OpenCV Generative AI VTK Blender NumPy NiBabel

Teaching Assistant · University of Toronto

Sep 2023 - Apr 2024

Mentored students in Python programming across diverse disciplines, simplifying complex concepts through relatable analogies and personalized problem-solving strategies. Adapted teaching methods to accommodate varied backgrounds, fostering a clear understanding for learners in management, psychology, computer science, and beyond.

Python

Data Science Intern · AdGlobal360

May 2022 - Jul 2022

Designed a lead scoring prediction model using Random Forests, Logistic Regression, and Deep Neural Networks, achieving high accuracy in identifying potential buyers from website activity. Performed exploratory data analysis and feature engineering, visually presenting key customer conversion factors beneficial to the stakeholders.

Python Tensorflow Scikit-Learn Pandas SQL/PowerBI NumPy Big Data

UI/UX Designer · International Society of Automation (ISA) - VIT

Dec 2019 - Dec 2021

Designed and prototyped user-friendly interfaces for web and mobile applications using Figma and collaborated with developers for a better digital experience as a core member of the Design Team for the ISA chapter.

Figma Adobe Photoshop Design Thinking

Skills

Languages

Python
R
Java
C/C++
SQL
HTML
CSS
JavaScript

Frameworks

PyTorch
TensorFlow
OpenCV
Scikit-Learn
Flask
Jupyter
Pandas
NumPy
Git
NiBabel
AWS
Docker
LangChain
Streamlit

Computer Graphics/Designing

Visualization Toolkit
Blender
Figma

Projects

Ink-to-Tint: Manga Artisan

An automated system for manga colorization and style conversion to enhance readability and ease artists' workload. Implements a Pix2Pix conditional GAN in PyTorch with a CNN-based discriminator and U-Net generator for colorizing black-and-white manga pages. Fine-tunes a pre-trained Stable Diffusion model for manga style transfer across four distinct art styles.

Optimized LLM Modeling: Classification and Instruction Fine-Tuning

Integrated attention mechanism alongside KV cache optimizations to improve model efficiency. Experimented with positional embedding strategies in Transformers to enhance model performance. Implemented classification and instruction fine-tuning using diverse prompt styles and data processing techniques. Successfully deployed models on AWS Sagemaker through Docker containers and an MLOps pipeline for automated training, validation, and deployment. Built a RAG Agent for Pokemon Battle Analysis using LangChain and Streamlit App.

Text-based 3D Gaussian Splatting Object Segmentation

A Python-based 3D Gaussian Splatting segmentation model that leverages LangSAM for text-driven 3D segmentation. Incorporates an optimized prompt initialization strategy using K-means clustering for efficient view selection and point sampling. Reduces computational requirements by achieving near-optimal results with only 50% of the input data.

Qualitative Badminton Player Analysis

A computer vision system for tracking player movements and classifying badminton strokes in broadcast videos. Utilizes Particle Filter and custom jersey color detection for player tracking with 99% accuracy. Predicts badminton strokes using CNNs with 81% accuracy. Detects court boundaries through image binarization, edge detection, Hough Lines, and K-Means clustering.

Handwritten Polynomial Equation Solver

A Flask-based web application with HTML and CSS for solving image-based handwritten polynomial equations. Performs image segmentation and preprocessing to isolate numerical values and symbols. Implements a CNN model using TensorFlow-Keras and OpenCV to detect handwritten numbers and symbols with 98% accuracy, enhancing usability for students.

Projects

Ink-to-Tint: Manga Artisan

Optimized LLM Modeling: Classification and Instruction Fine-Tuning

Text-based 3D Gaussian Splatting Object Segmentation

Qualitative Badminton Player Analysis

Handwritten Polynomial Equation Solver

Research

Histology Classification for Early Gastric Cancer using AI Model

Conference: Society of American Gastrointestinal and Endoscopic Surgeons (SAGES) 2025

Author(s): Hoseok Seo, Anannya Popat, Caterina Masino, Sojung Kim, Han Hong Lee, Kyo Young Song, Amin Madani

Fine-tuned ResNet50 model to classify histologic types in early gastric cancer from endoscopic images. Preprocessed a dataset of 2,944 labeled images, achieving 91% specificity for undifferentiated types and 87% specificity for differentiated type.

To Be Published

Movie Poster Genre Classification using Federated Learning

Conference: International Conference on Machine Learning and Data Engineering (ICMLDE) 2022

Author(s): Anannya Popat, Lakshya Gupta, Gaowri Naratha Meedinti, Dr. Boominathan Perumal

An image-based movie genre classification algorithm leveraging Federated Learning to ensure data privacy in graphics industry. Designed a decentralized architecture with 81% accuracy for local CNN training with distributed data, reducing storage requirements and ensuring privacy.

Elsevier

An optimized handwritten polynomial equations solver using an enhanced inception V4 model

Journal: Multimedia Tools and Applications 2023

Author(s): Sudha SenthilKumar, K. Brindha, Jyotir Moy Chatterjee, Anannya Popat, Lakshya Gupta, Abhimanyu Verma

This paper introduces a web-based system that uses an enhanced Inception V4 CNN to recognize and solve handwritten polynomial equations (cubic, quadratic, and quintic) by determining the value of 𝑥 x. The model is trained on data from MathNet (arithmetic symbols), MNIST (digits), and EMNIST (alphabet characters).

Springer

A Federated Approach to Converting Photos to Sketch

Conference: Advances in Data-Driven Computing and Intelligent Systems (ADCIS) 2022

Author(s): Gowri Namratha Meedinti, Anannya Popat, Lakshya Gupta, Boominathan Perumal

Proposed a privacy-preserving approach using Federated Learning and auto-encoding to train a camera filter for generating sketched representations of images. The method ensures data security while leveraging the CUFS database for training, addressing privacy concerns in applications like medical imaging, remote sensing, and e-commerce.

Springer

Music Genre Classification using Federated Learning

Conference: Information Systems for Intelligent Systems, Proceedings of ISBM 2022

Author(s): Lakshya Gupta, Gowri Namratha Meedinti, Anannya Popat, Boominathan Perumal

Implemented a Federated Learning (FL) approach for privacy-preserving music genre classification using CNNs and the GTZAN dataset. The method ensures data discretion and copyright protection for music corporations in large-scale collaborative machine learning projects.

Springer

Hi! I am Anannya Popat

Machine Learning Specialist and Artificial Intelligence Researcher based in Toronto

About Me

Masters of Science in Applied Computing with spl. in Artificial Intelligence

Bachelors of Technology in Computer Science and Engineering and Business Systems

Experience

Machine Learning Specialist (Team Lead) · University Health Network

AI Research Intern · University Health Network

Teaching Assistant · University of Toronto

Data Science Intern · AdGlobal360

UI/UX Designer · International Society of Automation (ISA) - VIT

Skills

Languages

Frameworks

Computer Graphics/Designing

Projects

Ink-to-Tint: Manga Artisan

Optimized LLM Modeling: Classification and Instruction Fine-Tuning

Text-based 3D Gaussian Splatting Object Segmentation

Qualitative Badminton Player Analysis

Handwritten Polynomial Equation Solver

Projects

Ink-to-Tint: Manga Artisan

Optimized LLM Modeling: Classification and Instruction Fine-Tuning

Text-based 3D Gaussian Splatting Object Segmentation

Qualitative Badminton Player Analysis

Handwritten Polynomial Equation Solver

Research

Histology Classification for Early Gastric Cancer using AI Model

Movie Poster Genre Classification using Federated Learning

An optimized handwritten polynomial equations solver using an enhanced inception V4 model

A Federated Approach to Converting Photos to Sketch

Music Genre Classification using Federated Learning

Contact Me