Projects
Age- and Context-Appropriate AI Chat for Children
Overview: An interactive AI chatbot tailored to three developmental bandsโages 5โ7, 8โ12, and 13+; built with OpenAI ChatGPT API and Retrieval-Augmented Generation (RAG) for accurate, age-appropriate, context-aware replies.
How it works:
Collects age and region at conversation start to customize tone, vocabulary, and examples.
Uses RAG to ground responses in curated knowledge, improving safety and relevance.
Prioritizes responsible interaction patterns for kids.
Capstone Project โ Delayed Antibiotics Administration
Partner: Childrenโs Hospital of Orange County (CHOC)
Duration: Jan 2025 โ Jun 2025
Goal: Predict and classify the severity of delayed antibiotic administration to support proactive clinical intervention.
My contributions:
Data Engineering: Built an ETL pipeline to consolidate multi-source pediatric data; enabled reliable feature engineering and modeling.
Model Development: Led a Random Forest classifier; hyperparameter tuning, feature analysis, and baselines against alternative models.
Evaluation & Communication: Assessed with ROC curves and confusion matrices; presented findings and implications to clinical stakeholders.
Collaboration & Management: Translated clinician requirements, validated outputs, and maintained code/docs in GitHub.
๐ Project site
Breast Cancer Recurrence Prediction
Focus: This project applies Bayesian Logistic Regression to predict five-year breast cancer recurrence using a structured clinical dataset (n = 275). The goal is to build an interpretable, uncertainty-aware modeling workflow that supports clinical decision-making.
Outcome: The Bayesian logistic regression model identified axillary lymph node involvement (3โ11 nodes) and higher malignancy degree as strong predictors of recurrence, while small tumors (10โ14 mm) were associated with lower risk. Using a ROC-optimized threshold of 0.33, the model achieved ~67% sensitivity, ~71% specificity, and ~70% cross-validated accuracy, providing interpretable, uncertainty-aware risk estimates for breast cancer recurrence.
๐ Detailed report ๐ R code
Dolendar System Requirements
Scope: Translated a 2-hour client consultation into a comprehensive, structured requirements document.
Deliverable: Clear user stories, functional/non-functional specs, constraints, and acceptance criteria to guide development.
๐ Full report