Le Trong Hieu
Education
Bachelors of Science - Electronics and Telecommunication (7.2/10.0)
Posts and Telecommunications Institute of Technology, Ha Noi City, Viet Nam
Relevant Coursework
Computer Science: Data Structures, Algorithms, C++
Experiences
October 2024 - Present
AI Conversation -
Team Lead
- Developed RAG (Retrieval-Augmented Generation) system for document-based QA and summarization.
- Self-hosted LLM and retrieval pipelines for internal knowledge management and AI assistants.
- Implemented hybrid retrieval (semantic + keyword) for multilingual documents.
- Integrated document embedding, chunking, and query expansion logic for improved accuracy.
- Deployed full-stack system with scalable self-host infrastructure.
August 2024 - October 2024
AI Univoice -
Team Lead
- Built speech-to-speech translation system supporting 5 languages: Japanese, Chinese, English, Korean, and Vietnamese.
- Integrated VAD, ASR, LLM, and TTS into a unified pipeline for real-time multilingual speech conversion.
- Self-hosted entire architecture using Kubernetes (K8s) with GPU scaling and load balancing.
- Designed modular architecture for multi-language voice translation and synthesis.
June 2024 - July 2024
AI Offline Transcription -
Team Lead
- Developed fully self-hosted transcription system for Japanese audio uploads.
- Implemented diarization, timestamp alignment, and automatic summarization.
- Optimized pipeline using local GPU inference with Whisper and LLM summarizers.
- Built user interface for audio upload and real-time progress tracking.
HBLab | Ha Noi City, Viet Nam
AI Engineer
October 2019 - Now
- Proposal and develop AI features according to customer requirements.
- Maintains developed features and updates new features.
- Writes documentation for developed features.
- Update new knowledge in the field of AI.
- Research and develop new models (mainly Text Detection and Text Recognition)
- Develop AI features and deploy on cloud service (AWS, Google Cloud)
- System Design and monitoring app on cloud service (AWS)
WOWAI | New York City, USA
AI Engineer - Remote (Freelancer)
March 2023 - May 2023
- Proposal and develop AI features according to requirements.
- Develop AI features and deploy on cloud service (AWS)
Skills
Programming Languages: Python, C++, and JavaScript
Framework: Pytorch, OpenCV
Operating Systems: Linux/Unix system, Windows
Software: Pycharm, Visual Code
Projects
November 2023 - Now
Horse Ranking Realtime -
Team
- Use YoloV7 and Bytetrack (BotSort) for horse detection and horse tracking in the stadium. Optimized pre-process and post-process to reduce image processing time
- Use optical flow to stabilize the frame when the camera shakes
- Fine-tune Re-ID model (Torch-REID) when overlapping horses
- Find the horse's position using perspective transform
- Convert weight YOLOV7 pytorch to Libtorch (C++) github
August 2023 - September 2023
Battery Health Check -
Indivisual
- Use Prophet, XGBoost, CatBoost to predict when to replace the battery in a car
- Finetune and Ensemble 3 models for best results
- Deploy model to predict daily Voltage price on Azuze Cloud
March 2023 - September 2023
Human Tracking Realtime -
Team
- Use YoloX and Bytetrack (BotSort) for people detection and store commuter tracking. Optimized pre-process and post-process to reduce image processing time
- Use Retina-Net to detect faces, modified code to allow training model with custom data without landmark
- Implement age-gender estimator from paper
- Use face recognition and Faiss to identify people in the VIP or Black list
- Build docker image that runs on multiple OS (Windows, Ubuntu, Rocky)
June 2023 - July 2023
DeepFake App -
Individual
- Deploy an app that swaps faces into an existing video
- Face Detection, Face Landmark for crop face image
- Use GAN for swapping 2 face
- Write API and deploy AI model with Cloud
May 2023 - June 2023
Remove fog in images -
Indivisual
- Use Cycle Gan to sharpen images with fog images
- Implement code from this paper (paper)
- Write API and deploy AI model with Cloud
April 2023 - May 2023
Convert Code Python to C++ (OpenCV) -
Team
- Convert code OpenCV python to c++ to run on mobile devices
February 2023 - March 2023
C++ Optimization -
Team
- Algorithm optimization to speed up AI Engine runtime.
September 2022 - March 2023
Extract Information from Health Certificate -
Individual
- Using YOLO V7 for table extraction.
- Using Text Detection and Text Recognition to extract text in the table:
- Text Detection model is developed based on DBNet but gives higher accuracy and less inference time.
- Self-developed Text Recognition model according to transformer architecture with rectangular image size.
- Generate synthetic data for OCR
- Write logic for extracting key-value in table
- Build Docker image for deploying AI features
June 2022 - September 2022
Cost Optimization -
Individual
- Based on the actual situation, propose a cost-effective solution, and use offline API with vertex AI (Google Cloud).
- Re-write API code using Gunicorn
- Using Google Cloud Storage for store data
- Write Dockerfile for building docker image and upload the image to Google Container Registry
- Write Config for CI/CD with Cloud Run (Google Cloud)
January 2022 - June 2022
Key Information Extraction from Invoice -
Individual
- Using Text Detection and Text Recognition to extract text in the invoice
- Using UNet for feature extraction
- Extract feature from an image containing text (output from Text Detection)
- Implement SDMG-R for key information extraction
- Using features extracted from UNet combined with results from normalized OCR to build a Graph Convolution Network model for extracting key-value information
- Deploy API with AWS EC2 and serving with AWS API Gateway
Sole Segmentation App (
app info)
June 2021 - January 2022
-
Team
- AI Feature
- Write a tool for crawler data
- Using OpenCV for processing image
- Training Segmentation model Deeplab V3+ with SOTA backbone (HRNet)
- Backend
- Convert model format to torch-neuron to deploy the model to amazon Sagemaker (AWS EC2 Inf1)
- Setting the system including VPC, Sagemaker, RDS, API Gateway, Lambda, Event Bridge, CloudFront, Cloud Watch, ...
- Setting CI/CD with Codecommit, ECR, ECS
- Setting Auto Scale EC2 instances, monitoring and alert with Grafana
- Schedule to automatically update data (Event Bridge, Lambda, RDS)
- Setting WAF
May 2021 - June 2021
Human Temperature Identification -
Individual
- Using YOLO V5 for object detection.
- Text Recognition to extract text in the object
- Convert pytorch model to ONNX
April 2021 - May 2021
Airbag Segmentation -
Team
- Using Deeplab v3 to extract airbag in an accident
April 2021 - May 2021
Remove object background -
Individual
- AI Feature
- Using edge detection (Deep Learning) to detect the edge of an object
- Using OpenCV to smooth the edges of the object
- Backend
- Deploy AI Feature with AWS EC2
- Using Lambda and API Gateway for serving API
- Write Terraform code for setting AWS Environment
October 2020 - April 2021
Predict player stats in baseball game -
Team
- EDA and ETL tabular data
- Build model with machine learning algorithms (XGBoost, CatBoost, ...)
- Integrate AI model into mobile application
April 2020 - October 2020
Real estate recommendation System -
Team
- Collect data from the website
- EDA and ETL tabular data
- Build a hybrid recommendation system with Content-based Filtering and Collaborative Filtering
- Integrate AI model into web application
Contributor
VinBigData Chest X-ray Abnormalities Detection (
Kaggle)
2020
-
Individual
- Anchor Boxes Analysis using K-Means (Kaggle)
- Smart data split (train/eval) for Object Detection (Kaggle)
- Bounding Boxes Optimization (Kaggle)
- Training and Inference with EfficientDet
Global Wheat Detection (
Kaggle)
2020
-
Individual
- Training and Inference with YOLO V5
- Using Weighted Box Fusion instead of Non-maximum Suppression to optimize accuracy
- Using Pseudo labeling technique
Traffic Light Detection (ZALO AI) 2021
-
Individual
- Using YOLO V5 for detection
- Implement algorithms to increase accuracy with small objects (Image Tiling, ...)
Motorbike Generator (ZALO AI) 2019
-
Individual
- Using K-Mean for clustering motorbike
- Implement DCGan to generate motorbike
Awards
Kaggle Notebook Master (Highest Ranking 356 / 210.000)
2020
Top 5 ZALO AI Challenge
2019
First Prize ICPC PTIT Tier 2
2018
Certification
AWS Certified Solutions Architect - Associate
Toeic 645