Data Scientist & AI Engineer — Columbus, OH

Building AI systems that actually work in production.

6+ years turning complex data into decisions. From large-scale risk models at Amazon to LLM-powered research tools at Ohio State — I build things end to end, care about reliability, and communicate clearly across technical and non-technical teams.

Get in touch GitHub LinkedIn

About Me

Background, impact, and what I'm working toward

Years in data science & ML engineering

$5M+

Business impact delivered across Amazon projects

Peer-reviewed IEEE publications

200+

Analysts mentored and trained

My path into AI started with a background in Electronics Engineering and grew through five years at Amazon, where I built production ML systems for fraud detection and risk analytics at global scale. I completed my Master's in Information Technology at the University of Cincinnati in 2024, and I'm currently a Senior Research Data Analyst at Ohio State University — building ML pipelines and AI-powered tools for academic research.

Right now I'm deepening my expertise in generative AI, LLM fine-tuning, and agentic architectures through a dedicated AI engineering program — and actively building enterprise-grade AI projects in healthcare intelligence, fraud detection, and ML observability.

I hold a DP-203 Microsoft Certified: Azure Data Engineer Associate certification and have two IEEE publications in applied ML and NLP.

Technical Skills

What I work with day to day

Machine Learning

Python, PyTorch, TensorFlow, Scikit-learn, XGBoost, classification, regression, anomaly detection, time-series forecasting (SARIMA, Prophet)

Generative AI & LLMs

LangChain, LangGraph, OpenAI API, RAG pipelines, embeddings, vector databases, fine-tuning (LoRA/PEFT), agentic workflows, LLM evaluation

MLOps & Production

MLflow, Docker, GitHub Actions (CI/CD), AWS SageMaker, Azure, model versioning, drift detection, automated retraining, observability

Data Engineering

SQL (advanced), PostgreSQL, MySQL, Pandas, NumPy, ETL pipelines, Kafka, feature engineering, large-scale data processing

NLP & Text Analytics

Text classification, NER, topic modeling, sentiment analysis, summarization, Hugging Face transformers, document understanding

Visualization & BI

Power BI, Tableau, Matplotlib, Seaborn — executive dashboards, KPI reporting, data storytelling for non-technical audiences

Projects

Enterprise AI systems built from scratch

Healthcare AI

Enterprise RAG Policy Intelligence System

An enterprise-grade RAG application that ingests 100+ healthcare policy documents, compliance guidelines, and research papers. Stakeholders query across the full document corpus in natural language and receive cited, accurate answers with source attribution. Includes auto-generated summary reports and data visualizations — deployed on AWS with hallucination detection and guardrails.

LangChain OpenAI GPT-4 FAISS Streamlit AWS Docker Python

FinTech / Risk

Agentic Fraud Detection & Risk Reporting Pipeline

A multi-agent system built with LangGraph: Agent 1 ingests and validates transaction data, Agent 2 runs XGBoost anomaly detection and risk scoring, Agent 3 auto-generates executive risk reports with visualizations — fully automated end-to-end. Full audit trail logging, retries, and fallbacks. Real-time Streamlit dashboard showing risk scores and business KPIs.

LangGraph XGBoost OpenAI AWS SageMaker MLflow Streamlit Docker

MLOps

Real-Time ML Monitoring Dashboard with Automated Drift Detection

A production ML observability platform monitoring model performance, data quality, and feature drift in real time. Automatically triggers retraining when drift thresholds are breached and notifies stakeholders. Dashboard shows live accuracy, precision/recall trends, feature importance evolution, and business KPI impact — updated on a configurable schedule.

MLflow Streamlit AWS Lambda GitHub Actions Docker Python Pandas

Experience

Where I've worked and what I've built

Senior Research Data Analyst Dec 2024 — Present

Ohio State University, Columbus OH

Build and deploy ML and AI pipelines for the C-FARM interdisciplinary research initiative
LLM-powered applications using LangChain, OpenAI APIs, and RAG architectures
Power BI dashboards and Python visualizations for academic leadership stakeholders
AWS deployment with MLflow, CI/CD, Docker, and drift monitoring

Senior Data Analyst Jan 2020 — Dec 2022

Amazon Development Center — Buyer Risk Program, India

Production ML systems for fraud detection generating $500K+ annual business impact
Cross-functional fraud analytics project saving $4.9M — international stakeholders
ETL pipelines and SQL optimization — 30% latency reduction for 200+ analysts
Mentored 200 analysts; led a team of 25 as Team Lead

Data Analyst Jul 2017 — Dec 2019

Amazon Development Center — Gift Cards, India

Predictive ML models for segmentation, churn, and demand forecasting
International marketplace launches across AU, TR, DE, FR, IT, ES
Power BI dashboards tracking KPIs and predictive metrics

Research Intern Jan 2023 — Jul 2024

University of Cincinnati, OH

Built production web application with CI/CD, Docker, and REST API
Python, SQL, and ML training delivered to 200+ students and staff

Publications

Peer-reviewed IEEE research

IEEE ICMI 2024

Predicted Cyberbullying Behavior in Social Media Using Machine Learning Techniques

NLP classification pipeline on large-scale Twitter data. CatBoost achieved 83% accuracy and 84% F1 score for content classification across 10 cyberbullying categories.

IEEE CPS, CSCI 2023

Smart Temperature Management in Buildings Using Predictive Analysis by Machine Learning Algorithms

Evaluated regression and ensemble ML models to predict building energy loads. Random Forest achieved best performance on R², MAE, and MSE metrics.

Get in Touch

Open to data science and AI engineering opportunities

I'm actively looking for roles in data science, ML engineering, and applied AI — particularly in healthcare, FinTech, or AI/SaaS environments. If you're working on something interesting, I'd love to hear from you.

ritikadharamkar@gmail.com linkedin.com/in/ritikadharamkar github.com/RitikaDharamkarJ