Augustine Nguyen

Data Scientist · Financial Risk & Growth Analytics · Computer Vision · Republic of Korea

Where credit intelligence meets customer growth — turning data into revenue and trust.


Introduction

I am a Data Scientist with deep industry experience spanning financial risk modeling, growth analytics, enterprise AI systems, and computer vision. My work sits at the boundary between statistical rigor and production engineering — where models must be both mathematically defensible and reliable at scale.

I hold a dual focus that few practitioners combine: the quantitative discipline of credit risk and fraud detection (probabilistic scoring, causal identification, regulatory interpretability) and the experimental culture of growth analytics (attribution modeling, behavioral segmentation, Bayesian experimentation). I believe these domains reinforce each other — risk thinking sharpens experimentation, and growth thinking keeps models accountable to business outcomes. Increasingly, this work extends into perception systems — object detection and multi-object tracking — where modeling meets the physical world at the edge.


Selected Work

Champion — FPT AI Hackathon 3 (2025)
Global competition across all FPT offices worldwide. Team Cẩm Y Vệ (FKR-KTX) placed 1st in the KT/KB Technical Solutions track with AISOL — a production-grade Japanese-language enterprise RAG platform. The system processes heterogeneous documents (text, tables, diagrams) via a parallelized multimodal pipeline built on Qdrant · Gemini · Qwen, with sub-second retrieval over large corpora.


Research & Technical Focus

Financial Data Science

  • Credit Risk Modeling — PD/LGD/EAD estimation, scorecard development, vintage analysis, model monitoring
  • Fraud Detection — real-time anomaly detection, graph-based fraud network analysis, behavioral sequencing
  • Investment Analytics — portfolio risk attribution, securities quality monitoring, performance measurement
  • Causal Inference — instrumental variables, difference-in-differences, propensity score matching, policy evaluation

Marketing & Growth Analytics

  • Customer Segmentation — RFM analysis, behavioral clustering, customer lifetime value (CLV) modeling
  • Churn & Retention — survival analysis, early-warning systems, uplift modeling, retention triggers
  • Experimentation — hypothesis design, statistical power, CUPED variance reduction, Bayesian A/B testing
  • Attribution — multi-touch attribution, media mix modeling (MMM), incrementality testing
  • Funnel Analytics — conversion optimization, pricing elasticity, personalization engines

Generative AI & Retrieval Systems

  • Retrieval-Augmented Generation — dense and sparse retrieval, hybrid search, re-ranking, multimodal pipelines
  • LLM Engineering — prompt optimization, fine-tuning, agentic workflows, evaluation frameworks

Computer Vision & Perception

  • Object Detection — YOLO and SSD families, dataset curation, model evaluation
  • Multi-Object Tracking — DeepSORT, ByteTrack, StrongSORT, re-identification, Kalman filtering
  • Edge Inference — TensorRT optimization, NVIDIA Jetson deployment, ROS 2 integration

Core Toolkit

DomainMethods & Tools
Generative AI & RAGLLM Pipelines · RAG · Vector Search · Qdrant · Gemini · Qwen · LangChain · Prompt Engineering
Computer VisionYOLO · SSD · DeepSORT · ByteTrack · StrongSORT · Re-ID · TensorRT · OpenCV · Jetson · ROS 2
ML & ModelingScikit-learn · XGBoost · LightGBM · Statsmodels · Survival Analysis · SHAP · LIME
Financial AnalyticsCredit Scoring · Fraud Rules Engine · Risk Dashboards · Basel Compliance Metrics
Marketing AnalyticsCohort Analysis · CLV Modeling · MMM · Funnel Analytics · Propensity Scoring
ExperimentationA/B · Multivariate · Bayesian A/B · Causal Inference (IV, DiD, PSM) · CUPED
Data & BISQL · Pandas · dbt · Apache Spark · Tableau · Metabase · Chart.js
Streaming & PipelinesApache Kafka · Airflow · ELK Stack · Kinesis · ETL/ELT
DatabasesPostgreSQL · Oracle · ClickHouse · MongoDB · Redis · OpenSearch
Cloud & InfraAWS · Azure · Docker · Kubernetes
LanguagesPython · Rust · SQL · R · Golang · SAS · TypeScript

Credentials

  • AWS Certified Data Engineer — Associate
  • AWS Certified AI Practitioner — Foundational
  • Microsoft Certified: Azure AI Fundamentals
  • Nanodegree: Data Engineering with Microsoft Azure
  • Astronomer Certification: Apache Airflow Fundamentals

View my Projects or browse the full CV.
Contact: augustino0890@gmail.com · LinkedIn · GitHub · Twitter — open to DS roles in fintech, banking, growth, product analytics, and enterprise AI / RAG systems.