👋 Hello, I'm

May Sabai

Data Science Student |

I’m a Data Science student at San José State University with hands-on experience in data engineering, analytics, and machine learning systems. I’ve worked with production databases, real-time data pipelines, multi-sensor robotics data, and large public datasets. I’m especially interested in building reliable data systems and transforming raw data into actionable insights — bridging the gap between data infrastructure and decision-making.

May Sabai

Experience & Leadership

AI & Robotics Data Collection Engineer Intern

Roboforce Inc Feb 2026 - present

Operate humanoid robots in live customer scenarios, collecting 500MB+ multi-sensor (camera, LiDAR) data per session to support ML training pipelines.

Set up and calibrate sensors, improving data reliability and reducing capture errors by ~20%.

Conduct QA on 2D/ 3D/ 6D labeled datasets (1,000+ samples/week), defining validation criteria to improve downstream model performance.

Identify and document robot and sensor issues, collaborating with ML and software engineers to validate fixes and maintain >95% usable data rate.

Follow strict safety and equipment protocols, ensuring zero operational incidents.

collecting 500MB+ data

Project Management & Software Development Intern

Jamiez LLC May 2025 - Sept 2025

Managed and optimized application database using Supabase (PostgreSQL), supporting up to 1GB of production data with real-time synchronization during A/B testing and post-launch phases.

Designed authentication and role-based access controls (RBAC) to protect sensitive client data and enforce admin-level permissions across systems.

Collaborated with full-stack mobile developers, leveraging AI-assisted development tools (Cursor, Claude AI) to accelerate feature implementation, debug backend logic, and improve development efficiency.

Took initiative in project planning across development, A/B testing, and launch phases — streamlining execution and delivering a 3-month roadmap in 2 months.

Built a structured consulting intake workflow to dynamically capture customer requirements, improving quote accuracy and increasing customer satisfaction by ~20%.

Took ownership of an existing AI chatbot project; integrated Gemini API with RAG architecture, connected it to the database and application layers, and enabled 24/7 automated client support.

Promoted to Project Leader, entrusted to oversee a team of 5 developers and lead the full mobile application development lifecycle.

Promoted to Project Leader

Machine Learning Teaching Assistant – AI & Cybersecurity Summer Camp

San José State University Research Foundation June 2025 - July 2025

Co-developed and delivered curriculum on Machine Learning and Computer Vision (YOLOv8, Jetson Orin Nano) for 40+ students, covering data preprocessing, model training, and real-time inference workflows.

Instructed Python, Linux, and ML pipeline fundamentals, simplifying advanced AI concepts for diverse learners while introducing best practices in data handling and model evaluation.

Supported hands-on projects where 50%+ of student teams successfully implemented AI/ML-based solutions, deploying models on edge devices for live object detection.

simplified advanced AI concepts for diverse learners

Chief Partnership Officer

Bridge Burma May 2023 - June 2024

Led fundraising with an international team spanning Canada, the UK, China, Hong Kong, and Singapore to launch Bridge Burma’s first scholarship program.

Provided 12 students in Myanmar with access to English proficiency tests, enabling them to apply for global college scholarships.

The initiative was featured on Smart Ed DVB, a major national news channel known for spotlighting educational opportunities, amplifying our impact across the country.

launched first scholarship program
featured on Smart Ed DVB

Math & Science Tutor

MSTRC, De Anza College Oct 2022 - June 2023

Tutored advanced mathematics and sciences, helping students progress from barely passing to consistently earning As. Recognized for strong impact on student performance and promoted to Senior Tutor within 6 months.

Promoted to Senior Tutor

President & Founder

Burmese Student Association Jan 2022 - June 2023

Scaled membership from 20 to 200 students within one year by developing and executing a data-informed outreach and social media growth strategy.

Established a sustainable model that inspired the creation of BUSA chapters at other community colleges.

Increased annual cultural festival attendance from 50 to 300+ guests across 7 campuses through cross-campus partnerships, storytelling campaigns, and structured promotional planning.

Following my strategies, the annual Myanmar New Year (Thingyan) festival is now celebrated for 4 consecutive years and embraced as a tradition across campuses.

200+ Members

Student

L'SPACE Mission Concept Academy Jan 2022 - April 2022

Participated in NASA & ASU's L'SPACE Mission Concept Academy program, gaining hands-on experience in space mission design and development.

Communication Manager

Laboratory of Robotics and Artificial Intelligence (LORA) July 2020 - June 2021

Managed communications and coordinated activities within the Laboratory of Robotics and Artificial Intelligence, facilitating collaboration between research teams and external stakeholders.

Featured Projects

Rent Affordability Analysis

Analyzed rent affordability using Zillow and U.S. government datasets, cleaning and merging data with Pandas. Identified regional rent and income trends and visualized key insights using Matplotlib.

Python Pandas Matplotlib Data Analysis
2 Data Sources
Regional Trends
Visual Insights

Breast Cancer Tumor Analysis

Analyzed ultrasound imaging data to study breast cancer tumor location, size, and type. Preprocessed image data and applied a CNN model for tumor prediction and classification.

Python CNN Machine Learning Image Processing
CNN Model
Tumor Classification
Medical Imaging

BudgetFlow

Full-stack personal finance app built entirely in Python using Tkinter and SQLite, featuring secure user management, dynamic visualizations, and multi-user finance tracking.

Python Tkinter SQLite Matplotlib Figma
Full-Stack Python App
Multi-User Support
Secure Authentication

Ticket Pro

Event management platform with optimized MySQL database, achieving sub-500ms query execution times and secure authentication for 100+ users.

MySQL Node.js RESTful API bcrypt
10+ Tables
<500ms Query Time
30% Less Inconsistencies

Water Quality Analysis

Comprehensive analysis of water quality factors across multiple cities and countries, processing 3,276 data entries with advanced visualization techniques.

Python Matplotlib Linux Data Analysis
3,276 Data Points
Multiple Countries

Recipe Genius

Java application for generating recipe suggestions based on available ingredients, leading a team of 3 developers with 30% improved suggestion accuracy.

Java OOP External API Team Leadership
3 Team Members
30% Better Accuracy

Technical Skills

Programming

Python (Pandas, NumPy, Matplotlib) SQL (PostgreSQL) R Linux

Data Engineering

Database Design & Architecture (Supabase / PostgreSQL) Data Modeling ETL & Data Cleaning Real-time Data Synchronization Role-Based Access Control (RBAC) API Integration (Gemini API) Retrieval-Augmented Generation (RAG Architecture)

Data Analysis

Exploratory Data Analysis (EDA) Data Wrangling & Preprocessing Statistical Analysis Data Visualization (Matplotlib, Seaborn) Performance Metrics (Precision, Recall, F1-Score)

Machine Learning & Data Systems

PyTorch / TensorFlow CNN, ResNet18 Dataset QA & Validation Multi-Sensor Data Handling (Camera, LiDAR) Model Evaluation & Hyperparameter Tuning

Tools

Git Google Sheets (Data Tracking & QA Documentation) Jetson Orin Nano YOLOv8