Kỹ Sư Phân Tích & Nhận Dạng Tài Liệu (OCR)

CÔNG TY TNHH GALAXY DIGITAL HOLDINGS

Toà nhà PV Gas Tower 673 Nguyễn Hữu Thọ, Xã Phước Kiển, Huyện Nhà Bè, Thành phố Hồ Chí Minh, Việt Nam

Tổng quan

Mức lương:  Thoả thuận

Loại công việc:  Toàn thời gian

Kinh nghiệm: 3 năm kinh nghiệm

Số lượng tuyển: 1

Hạn nộp hồ sơ: 2025-12-04

Ngày đăng: 2025-11-21 17:37

Danh mục:  Nhóm nghề khác

Mô tả công việc

Model Development & Optimization

  • Maintain and enhance existing AI models for OCR on Vietnamese ID cards (CCCD) and extend to other document types (passports, driver licenses, bank documents).
  • Fine-tune and adapt state-of-the-art OCR/document models (Donut) for production use.
  • Optimize training and inference pipelines for performance, scalability, and cost efficiency.

Data Pipeline & Quality Management

  • Manage large datasets combining synthetic and real-world document images.
  • Build preprocessing and augmentation pipelines: image quality checks, blur/rotation detection, Vietnamese text normalization, PII masking.
  • Ensure data quality and evaluation consistency across multiple document types.

Accuracy & Performance Evaluation

  • Define and monitor evaluation metrics: character/word accuracy, exact match rate, edit distance, latency.
  • Analyze failed predictions (e.g., accents, truncated fields, misrecognized entities) and integrate findings into retraining cycles.
  • Implement image/document quality control to prevent poor inputs from degrading OCR accuracy.

Production & Monitoring

  • Deploy, monitor, and maintain OCR models serving production workloads (100k+ documents/month).
  • Investigate and resolve production failures, manage rollbacks, and improve system robustness.
  • Collaborate with backend engineers to integrate OCR APIs with downstream systems.

Collaboration & Leadership

  • Mentor junior engineers in computer vision and OCR best practices.
  • Contribute to the long-term roadmap for Document AI, beyond ID cards, to support broader fintech/eKYC and document processing needs.
  • Document experiments, model updates, and operational practices.

Yêu cầu

Must-have

  • 3+ years of AI/ML engineering experience with Python and PyTorch.
  • Practical experience in OCR or Computer Vision (e.g., image preprocessing, OpenCV).
  • Experience with Vietnamese text processing (accents, tokenization, normalization).
  • Familiarity with deep learning model training and fine-tuning, preferably with HuggingFace Transformers or OCR frameworks (PaddleOCR, Tesseract).
  • Experience deploying ML models into production environments.
  • Experience scaling machine learning services for high traffic.
  • Knowledge of Linux, Docker, and Git.

Nice-to-have

  • Knowledge of MLOps tools (Weights & Biases, MLflow, DVC).
  • Model optimization skills: quantization, distillation, ONNX/TensorRT.
  • Background in fintech/eKYC or handling sensitive/PII data.

Soft Skills

  • Strong ownership mindset: accountable for the full lifecycle of OCR models.
  • Problem-solving ability: capable of debugging training and inference issues.
  • Communication skills: explain ML concepts and findings to technical and non-technical stakeholders.
  • Collaborative attitude: work closely with backend, product, and QA teams.

Tech Stack

  • Python, PyTorch, HuggingFace Transformers, PaddleOCR
  • OpenCV, PIL
  • Docker, Linux
  • Git, DVC (optional)
  • MLflow / Weights & Biases (nice-to-have)

Phúc lợi

Competitive salary package (Base salary and performance bonuses).

Probation period salary is 100% of the official salary.

Comprehensive health and accident insurance.

15 days of annual leave, 3 remote work days per month.

Provision of work equipment (Macbook/ Laptop, mouse, monitor, etc.).

A creative and modern working environment.

`