Augustine Nguyen
Data Scientist · Financial Risk & Growth Analytics · Computer Vision · Republic of Korea
Where credit intelligence meets customer growth — turning data into revenue and trust.
Introduction
I am a Data Scientist with deep industry experience spanning financial risk modeling, growth analytics, enterprise AI systems, and computer vision. My work sits at the boundary between statistical rigor and production engineering — where models must be both mathematically defensible and reliable at scale.
I hold a dual focus that few practitioners combine: the quantitative discipline of credit risk and fraud detection (probabilistic scoring, causal identification, regulatory interpretability) and the experimental culture of growth analytics (attribution modeling, behavioral segmentation, Bayesian experimentation). I believe these domains reinforce each other — risk thinking sharpens experimentation, and growth thinking keeps models accountable to business outcomes. Increasingly, this work extends into perception systems — object detection and multi-object tracking — where modeling meets the physical world at the edge.
Selected Work
Champion — FPT AI Hackathon 3 (2025)
Global competition across all FPT offices worldwide. Team Cẩm Y Vệ (FKR-KTX) placed 1st in the KT/KB Technical Solutions track with AISOL — a production-grade Japanese-language enterprise RAG platform. The system processes heterogeneous documents (text, tables, diagrams) via a parallelized multimodal pipeline built on Qdrant · Gemini · Qwen, with sub-second retrieval over large corpora.
Research & Technical Focus
Financial Data Science
- Credit Risk Modeling — PD/LGD/EAD estimation, scorecard development, vintage analysis, model monitoring
- Fraud Detection — real-time anomaly detection, graph-based fraud network analysis, behavioral sequencing
- Investment Analytics — portfolio risk attribution, securities quality monitoring, performance measurement
- Causal Inference — instrumental variables, difference-in-differences, propensity score matching, policy evaluation
Marketing & Growth Analytics
- Customer Segmentation — RFM analysis, behavioral clustering, customer lifetime value (CLV) modeling
- Churn & Retention — survival analysis, early-warning systems, uplift modeling, retention triggers
- Experimentation — hypothesis design, statistical power, CUPED variance reduction, Bayesian A/B testing
- Attribution — multi-touch attribution, media mix modeling (MMM), incrementality testing
- Funnel Analytics — conversion optimization, pricing elasticity, personalization engines
Generative AI & Retrieval Systems
- Retrieval-Augmented Generation — dense and sparse retrieval, hybrid search, re-ranking, multimodal pipelines
- LLM Engineering — prompt optimization, fine-tuning, agentic workflows, evaluation frameworks
Computer Vision & Perception
- Object Detection — YOLO and SSD families, dataset curation, model evaluation
- Multi-Object Tracking — DeepSORT, ByteTrack, StrongSORT, re-identification, Kalman filtering
- Edge Inference — TensorRT optimization, NVIDIA Jetson deployment, ROS 2 integration
Core Toolkit
| Domain | Methods & Tools |
|---|---|
| Generative AI & RAG | LLM Pipelines · RAG · Vector Search · Qdrant · Gemini · Qwen · LangChain · Prompt Engineering |
| Computer Vision | YOLO · SSD · DeepSORT · ByteTrack · StrongSORT · Re-ID · TensorRT · OpenCV · Jetson · ROS 2 |
| ML & Modeling | Scikit-learn · XGBoost · LightGBM · Statsmodels · Survival Analysis · SHAP · LIME |
| Financial Analytics | Credit Scoring · Fraud Rules Engine · Risk Dashboards · Basel Compliance Metrics |
| Marketing Analytics | Cohort Analysis · CLV Modeling · MMM · Funnel Analytics · Propensity Scoring |
| Experimentation | A/B · Multivariate · Bayesian A/B · Causal Inference (IV, DiD, PSM) · CUPED |
| Data & BI | SQL · Pandas · dbt · Apache Spark · Tableau · Metabase · Chart.js |
| Streaming & Pipelines | Apache Kafka · Airflow · ELK Stack · Kinesis · ETL/ELT |
| Databases | PostgreSQL · Oracle · ClickHouse · MongoDB · Redis · OpenSearch |
| Cloud & Infra | AWS · Azure · Docker · Kubernetes |
| Languages | Python · Rust · SQL · R · Golang · SAS · TypeScript |
Credentials
- AWS Certified Data Engineer — Associate
- AWS Certified AI Practitioner — Foundational
- Microsoft Certified: Azure AI Fundamentals
- Nanodegree: Data Engineering with Microsoft Azure
- Astronomer Certification: Apache Airflow Fundamentals
View my Projects or browse the full CV.
Contact: augustino0890@gmail.com · LinkedIn · GitHub · Twitter — open to DS roles in fintech, banking, growth, product analytics, and enterprise AI / RAG systems.
