Experiences

October 2024 - Present

AI Conversation
- Team Lead

Developed RAG (Retrieval-Augmented Generation) system for document-based QA and summarization.
Self-hosted LLM and retrieval pipelines for internal knowledge management and AI assistants.
Implemented hybrid retrieval (semantic + keyword) for multilingual documents.
Integrated document embedding, chunking, and query expansion logic for improved accuracy.
Deployed full-stack system with scalable self-host infrastructure.

August 2024 - October 2024

AI Univoice
- Team Lead

Built speech-to-speech translation system supporting 5 languages: Japanese, Chinese, English, Korean, and Vietnamese.
Integrated VAD, ASR, LLM, and TTS into a unified pipeline for real-time multilingual speech conversion.
Self-hosted entire architecture using Kubernetes (K8s) with GPU scaling and load balancing.
Designed modular architecture for multi-language voice translation and synthesis.

June 2024 - July 2024

AI Offline Transcription
- Team Lead

Developed fully self-hosted transcription system for Japanese audio uploads.
Implemented diarization, timestamp alignment, and automatic summarization.
Optimized pipeline using local GPU inference with Whisper and LLM summarizers.
Built user interface for audio upload and real-time progress tracking.

HBLab | Ha Noi City, Viet Nam
AI Engineer

October 2019 - Now

Proposal and develop AI features according to customer requirements.
Maintains developed features and updates new features.
Writes documentation for developed features.
Update new knowledge in the field of AI.
Research and develop new models (mainly Text Detection and Text Recognition)
Develop AI features and deploy on cloud service (AWS, Google Cloud)
System Design and monitoring app on cloud service (AWS)

WOWAI | New York City, USA
AI Engineer - Remote (Freelancer)

March 2023 - May 2023

Proposal and develop AI features according to requirements.
Develop AI features and deploy on cloud service (AWS)

Projects

November 2023 - Now

Horse Ranking Realtime
- Team

Use YoloV7 and Bytetrack (BotSort) for horse detection and horse tracking in the stadium. Optimized pre-process and post-process to reduce image processing time
Use optical flow to stabilize the frame when the camera shakes
Fine-tune Re-ID model (Torch-REID) when overlapping horses
Find the horse's position using perspective transform
Convert weight YOLOV7 pytorch to Libtorch (C++) github

August 2023 - September 2023

Battery Health Check
- Indivisual

Use Prophet, XGBoost, CatBoost to predict when to replace the battery in a car
Finetune and Ensemble 3 models for best results
Deploy model to predict daily Voltage price on Azuze Cloud

March 2023 - September 2023

Human Tracking Realtime
- Team

Use YoloX and Bytetrack (BotSort) for people detection and store commuter tracking. Optimized pre-process and post-process to reduce image processing time
Use Retina-Net to detect faces, modified code to allow training model with custom data without landmark
Implement age-gender estimator from paper
Use face recognition and Faiss to identify people in the VIP or Black list
Build docker image that runs on multiple OS (Windows, Ubuntu, Rocky)

June 2023 - July 2023

DeepFake App
- Individual

Deploy an app that swaps faces into an existing video
Face Detection, Face Landmark for crop face image
Use GAN for swapping 2 face
Write API and deploy AI model with Cloud

May 2023 - June 2023

Remove fog in images
- Indivisual

Use Cycle Gan to sharpen images with fog images
Implement code from this paper (paper)
Write API and deploy AI model with Cloud

April 2023 - May 2023

Convert Code Python to C++ (OpenCV)
- Team

Convert code OpenCV python to c++ to run on mobile devices

February 2023 - March 2023

C++ Optimization
- Team

Algorithm optimization to speed up AI Engine runtime.

September 2022 - March 2023

Extract Information from Health Certificate
- Individual

Using YOLO V7 for table extraction.
Using Text Detection and Text Recognition to extract text in the table:

Text Detection model is developed based on DBNet but gives higher accuracy and less inference time.
Self-developed Text Recognition model according to transformer architecture with rectangular image size.
Generate synthetic data for OCR

Write logic for extracting key-value in table
Build Docker image for deploying AI features

June 2022 - September 2022

Cost Optimization
- Individual

Based on the actual situation, propose a cost-effective solution, and use offline API with vertex AI (Google Cloud).
Re-write API code using Gunicorn
Using Google Cloud Storage for store data
Write Dockerfile for building docker image and upload the image to Google Container Registry
Write Config for CI/CD with Cloud Run (Google Cloud)

January 2022 - June 2022

Key Information Extraction from Invoice
- Individual

Using Text Detection and Text Recognition to extract text in the invoice
Using UNet for feature extraction

Extract feature from an image containing text (output from Text Detection)

Implement SDMG-R for key information extraction

Using features extracted from UNet combined with results from normalized OCR to build a Graph Convolution Network model for extracting key-value information

Deploy API with AWS EC2 and serving with AWS API Gateway

Sole Segmentation App (app info)

June 2021 - January 2022

- Team

AI Feature

Write a tool for crawler data
Using OpenCV for processing image
Training Segmentation model Deeplab V3+ with SOTA backbone (HRNet)

Backend

Convert model format to torch-neuron to deploy the model to amazon Sagemaker (AWS EC2 Inf1)
Setting the system including VPC, Sagemaker, RDS, API Gateway, Lambda, Event Bridge, CloudFront, Cloud Watch, ...
Setting CI/CD with Codecommit, ECR, ECS
Setting Auto Scale EC2 instances, monitoring and alert with Grafana
Schedule to automatically update data (Event Bridge, Lambda, RDS)
Setting WAF

May 2021 - June 2021

Human Temperature Identification
- Individual

Using YOLO V5 for object detection.
Text Recognition to extract text in the object
Convert pytorch model to ONNX

April 2021 - May 2021

Airbag Segmentation
- Team

Using Deeplab v3 to extract airbag in an accident

April 2021 - May 2021

Remove object background
- Individual

AI Feature

Using edge detection (Deep Learning) to detect the edge of an object
Using OpenCV to smooth the edges of the object

Backend

Deploy AI Feature with AWS EC2
Using Lambda and API Gateway for serving API
Write Terraform code for setting AWS Environment

October 2020 - April 2021

Predict player stats in baseball game
- Team

EDA and ETL tabular data
Build model with machine learning algorithms (XGBoost, CatBoost, ...)
Integrate AI model into mobile application

April 2020 - October 2020

Real estate recommendation System
- Team

Collect data from the website
EDA and ETL tabular data
Build a hybrid recommendation system with Content-based Filtering and Collaborative Filtering
Integrate AI model into web application