Deepika Reddy Madduri | AI Software Engineer

terminal · profile.log

▋

I'm Deepika Reddy Madduri, an AI software engineer based in Jersey City, NJ (NYC area). I have about three years building data pipelines, ML models, and production backends in Python, SQL, and Java — lately focused on document AI, RAG, and agentic systems for pharma supply chain and fintech.

View AI work Ask my AI agent

Python · SQL · JavaETL & validationRAG & LLMsFlask · Spring BootJenkins · CI/CD

Deepika · AIJersey City · UTC-5

Pipelines, ML models, and production backends — ETL, validation, and APIs that hold up when the data doesn't.

stack: Python · SQL · Java · LLMscat > ideas.txt

~3Years experience (approx.)

3.9M.S. GPA (Montclair State)

25K+Student records (datasets)

10K+Weekly pipeline records

400KRAG embeddings (project)

About

AI software engineer with about three years building data pipelines, ML models, and production backends in Python, SQL, and Java. Recent focus on document AI, RAG, and agentic systems for pharma supply chain and fintech. Strong background in ETL, validation, REST APIs, NLP (including BERT / Hugging Face), and classical ML (logistic regression, XGBoost), with experience improving data quality, model performance, and reliability on moderate-scale datasets.

Tech Stack

Tools I trust

Programming

Python
Java
SQL
JavaScript/TypeScript

ML & AI

Logistic Regression
XGBoost
BERT (Hugging Face)
ROC-AUC, F1, precision/recall
Threshold tuning
Feature engineering
SHAP
Drift (PSI)
RAG
LLM integration
NER
Intent classification

Data

ETL
Validation
Schema evolution
Quality checks
Normalization
Backfill
Cohort analysis

Backend & APIs

Flask
FastAPI
Spring Boot
REST
Microservices

Data & AI (documents)

PyMuPDF
pdfplumber
Tesseract
PaddleOCR
EasyOCR
LlamaIndex
FAISS
Chroma

Databases

PostgreSQL
SQLAlchemy
Indexing
Query optimization

DevOps & quality

Jenkins
CI/CD
JUnit
Mockito
JSON schema validation

Frontend (as needed)

React
Next.js
Tailwind
Gradio

Selected Work

Projects that ship

Pfizer Supply Chain — Document Processing Extern

Externship (via Extern), Remote · Feb 2026 – Present. Python pipelines for pharmaceutical PDFs (PyMuPDF, pdfplumber); OCR benchmarking (Tesseract, PaddleOCR, EasyOCR); RAG with LlamaIndex, FAISS, and Chroma; LLM evaluation (Gemini, Mistral, Phi-2) and a Gradio chatbot for document Q&A.

Python · PyMuPDF · OCR · LlamaIndex · FAISS · Chroma · Gradio

Autonomous multi-agent bargain detection

7-agent system on 100+ RSS feeds; RAG with ~400K embeddings (SentenceTransformer); MAE/RMSLE evaluation; deployed on Modal (~3s responses).

Python · LLMs · RAG · ChromaDB · Modal

Multi-agent trading with MCP

4-agent platform with MCP tools; Polygon.io, Brave Search, multiple LLMs; SQLite state and Gradio dashboard (~25–30 reasoning steps per cycle).

Python · AutoGen · SQLite · Gradio

BCG GenAI financial chatbot (Forage)

Structured 10-K/10-Q data and rule-based NLP for metrics and insights (June 2025 simulation).

Python · Pandas · NLP · rules

Career

Where I've built

Montclair State University — Technical Analyst, Center for Academic Success and Tutoring

Montclair, NJ · Feb 2024 – Dec 2025

Generated and maintained operational datasets across teams for 25,000+ student records; built dashboards and reports for real-time monitoring of engagement and program health.
Architected fault-tolerant data pipelines integrating Tutor.com, study hall, and recitation data with schema evolution, freshness SLAs (<24h), and automated backfill; sustained <1.5% failure rate and reduced cross-system inconsistencies ~30% across 10K+ weekly records.
Implemented validation and drift monitoring (PSI) with weekly cycles and alerts (PSI > 0.2); held >95% precision and ~85% recall on anomalies, cutting invalid records ~20%.
Built an early-warning model (logistic regression, engineered features, class weights) with threshold 0.7 for recall under ≥0.65 precision; reached 0.81 ROC-AUC vs. 0.63 baseline, with rule-based fallback for low-confidence bands.
Delivered retention modeling with XGBoost and SHAP on cohorts; improved F1 ~17% vs. heuristics and supported interventions that raised retention ~13% in high-risk segments.
Quantified dropout drivers with confidence intervals and effect sizes; thresholds adopted by stakeholders, reducing false positives in targeting ~12%.

Accenture — Software Engineer (Full Stack)

India · Feb 2022 – Aug 2023

Fine-tuned BERT (Hugging Face) for intent and NER on ~15K multilingual fintech queries; raised precision ~18% (with regional recall tradeoffs).
Built locale-aware preprocessing (Unicode normalization, rules, regex) across 6+ countries; cut parsing failures ~35%.
Developed ETL with Pandas/SQLAlchemy on ~8K–10K daily records; resolved schema/null/duplicate issues; cut ingestion errors ~40%.
Exposed inference via Flask REST APIs (~20–25 req/sec, ~150–200 DAU); improved latency to ~280–320ms via tracing, connection reuse, and payload tuning.
Strengthened Jenkins pipelines (build → validation → deploy) with schema gates; cut deployment breakages ~20% on multi-region releases.
Also supported large-scale platforms: SQL/PostgreSQL, Kibana, incident response (200+ critical incidents), and performance work contributing to reliability at scale.

Infor — Junior Software Engineer

Hyderabad, India · Mar 2021 – Jan 2022

Refactored Spring Boot endpoints using composite BTREE-indexed queries and pagination; cut latency ~480ms → ~310ms under ~20 concurrent users.
Built an OCR microservice (Python, Tesseract, logistic regression features); precision ~0.68 → ~0.81 on ~1.2K labeled samples.
Enforced integrity via JSON schema, type/null checks, and uniqueness; cut duplicates ~25% and stabilized transaction success >97% on ~8K weekly entries.
Improved tests (JUnit/Mockito) and reduced GC pressure ~12% via profiling and DTO refactors; standardized Jenkins configs and cut deployment failures ~20%.

Ask my AI Agent

Ask anything about my background, projects, or how I approach AI engineering problems. This agent is powered by the same tooling I use in production systems.

Examples

The agent is tuned to talk about Deepika's background, projects, and engineering decisions.

Contact

Let's build something real

Open to AI/ML and backend-heavy roles in the New York metro area (Jersey City, NJ).