Michael Tan

An innovative, Industrious and capable engineer. Software engineer at Acer. Focused on "Spatial Computing" and applying "Deep Learning" and "Computer Vision" solutions for our products.

✓ Specialize in "Deep Learning" and "Computer Vision"

✓ About 3 Years Software Engineer, solid C++ / Python algorithm development experience

✓ Develop computer vision and machine learning algorithms for "SpatialLabs™ " (TW)

(Glasses-free stereoscopic 3D solutions)

- Delivers glasses-free 3D sensation via advanced optical, display, and eye-tracking technology. And

utilizes AI technology to instantly fully convert 2D content into stereo 3D

✓ Design, deploy and optimize applications and machine learning algorithms

✓ 15 Patents applied

✓ 350+ hours Taiwan AI Academy^9th certified & won semester presentation 1^st

2020.01 ~ 2022.11, Acer, Software Engineer, SW Platform Enabling / Compute Software Technology / IT Products Business

2021.05 ~ 2021.08, Taiwan AI Academy 9th
2017.09 ~ 2019.08, Master’s degree, Mechanical Engineering, National Taiwan University of Science and Technology (NTUST)
2013.09 ~ 2017.08, Bachelor’s degree, Mechanical Engineering, Yuan Ze University

Stereo Image Generation Method and Electronic Apparatus using the Same (TW [TWM626646U] / US [US20220286658A1] / CH), 2020
Augmented Reality Screen System and Augmented Reality Screen Display Method (TW [TWI757824B] / US [US11328493B2] / CH), 2020
Anti-cheating Method and Electronic Device (TW [TWI779566B] / CH), 2020
Side by Side Image Detection Method and Electronic Apparatus using the Same (TW [TW202236207A] / US [US20220284701A1] / CH), 2020
Stereoscopic Image Playback Apparatus (TW [TWM630947U] / EU / CH), 2021
Stereo Image Generating Device (TW [TWM628629U] / US), 2021
Computer Architecture for 3D Display Devices with Side-by-Side Generation (TW / US / CH), 2022 [Under verification]
SBS Detection by Stereo Matching (TW / US / CH / EU), 2022 [Under verification]
AI Smart Ledger (TW), 2022 [Under verification]
Multiview Rendering Pipeline for Side-by-Side images in 3D Display Devices (TW / US / CH), 2022 [Under verification]
Tree-Dimensional View in Devices Without 3D Display Using Eye Tracking (TW / US /CH), 2022 [Under verification]
Real-time background QR code detection and decoding (TW), 2022 [Under verification]
Acer Bar AI Computing Overlap (TW), 2022 [Under verification]
Depth Enhancement Using Face Landmark Detection (TW), 2022 [Under verification]
Multithread Architecture for Three-Dimensional Image Generation (TW), 2022 [Under verification]

Stereoscopic Image Generation from Single View Image

The rapid development of autostereoscopic 3D monitors offers users a brand-new 3D visualization experience. However, this technology only works with 3D content inputs like side-by-side images. Most images and videos on the internet are 2D single-view content, which makes it difficult for the technology to gain widespread popularity. This project uses "Machine Learning" and "Computer Vision" to instantly convert 2D content into stereo 3D

Achievement:

✓ Develop computer vision and machine learning algorithms for "SpatialLabs™ " (TW)

- Delivers glasses-free 3D sensation via optical, display, and eye-tracking technology. And utilizes AI technology instantly convert 2D content into stereo 3D

✓ Deploy and optimize "Depth Estimation" AI solutions on mobile devices using Pytorch (Libtorch) / Windows ML / OpenVINO / ONNX Runtime

✓ Machine learning model quantization, and deploy solutions on VPU (AI chip)

✓ Experience with GPU accelerated computing (CUDA kernel function) for image processing

✓ Experience with OpenCV & Direct3D for image processing

✓ Experience with product development and deployment strategies with C++ / Python. And familiar Window Installers

✓ Cooperate with Intel & ITRI

✓ Patents:

- Stereo Image Generation Method and Electronic Apparatus using the Same (TW [TWM626646U] / US [US20220286658A1] / CH), 2020

- Stereoscopic Image Playback Apparatus (TW [TWM630947U] / EU / CH), 2021

- Stereo Image Generating Device (TW [TWM628629U] / US), 2021

- Computer Architecture for 3D Display Devices with Side-by-Side Generation (TW / US / CH), 2022 [Under verification]

- Multiview Rendering Pipeline for Side-by-Side images in 3D Display Devices (TW / US / CH), 2022 [Under verification]

- Tree-Dimensional View in Devices Without 3D Display Using Eye Tracking (TW / US /CH), 2022 [Under verification]

- Depth Enhancement Using Face Landmark Detection (TW), 2022 [Under verification]

- Multithread Architecture for Three-Dimensional Image Generation (TW), 2022 [Under verification]

Generate stereoscopic images from monocular images instantly

Generate stereoscopic images from monocular images instantly (side-by-side view)

Monocular depth estimation using convolutional neural network for stereoscopic image generation

Keywords :
Deep Learning, Convolutional Neural Networks, Computer Vision, 3D, Quantization
Pytorch (Libtorch), Windows ML, TensorRT, OpenVINO, ONNX Runtime, D3D11, CUDA Kernel Function, OpenCV
C++, Python

Stereo (Side by Side) Image Detection

As mentioned, there is no need to generate images if the source is already stereoscopic. However, if the source is single-view images, stereoscopic images need to be generated. In order to make the system more intelligent, Using "Machine Learning" to automatically detect whether the content is 3D or not.

Achievement:

✓ cooperate with ITRI

✓ Patents:

- Side by Side Image Detection Method and Electronic Apparatus using the Same (TW [TW202236207A] / US [US20220284701A1] / CH), 2020

- SBS Detection by Stereo Matching (TW / US / CH / EU), 2022 [Under verification]

Detect content is a stereo image or not

Keywords :
Deep Learning, Convolutional Neural Networks, Computer Vision, Feature Extraction
keras, Tensorflow, OpenCV
Python, C++

End to end motion planner using Deep Reinforcement Learning

Service robots are appearing more and more in our daily lives. The key technologies for service robots involve many fields, including navigation, system control, mechanism modules, vision modules, voice modules, and artificial intelligence. In this research, we present a learning-based mapless motion planner that saves us from using traditional methods like "SLAM" to create maps and allows for navigation.

Achievement:

✓ Develop and deploy "Deep Reinforcement Learning" algorithms

✓ Optimize the model by batch normalization to prevent the gradient from vanishing

✓ Has 100+ stars repository on Github

Teach robots how to reach goals without a map

Keywords :
Deep Reinforcement Learning (DRL), Deep Deterministic Policy Gradient (DDPG), Motion Planner, Navigation
ROS, Tensorflow
Python

QR code detection

Scanning QR codes is easy with a mobile phone. But if the QR code is displayed on a computer screen, it becomes a different scenario. Most people would use their phone to scan the code and then send the URL to themselves or crop the QR code and use a QR code decoder to get the URL. This process is complicated and frustrating. To solve this problem, we have developed a service that uses machine learning to detect QR codes on the user's screen and decode the QR code of interest.

Achievement:

✓ Experience with deploy "Object detection" algorithms

✓ Model quantization and deploy with inference engine - OpenVINO

✓ Patents:

- Real-time background QR code detection and decoding (TW), 2022 [Under verification]

QR code detection using object detection model

Keywords :
Deep Learning, Convolutional Neural Networks, Computer Vision, Object Detection, YOLO
Pytorch, OpenVINO, ONNX Runtime, OpenCV
Python, C++

AI Fashion Persuader

Have you ever seen an outfit from a KOL on social media that you liked, but had trouble finding something similar? In this project, we use "Machine Learning" and "Computer Vision" to develop a system that can find the most similar clothes in a database based on the user's input images.

Achievement:

✓ First place in the semester presentation contest of Taiwan AI Academy

✓ Deploy "Segmentation" and "Feature Extraction" algorithms

✓ Integrate with Line Bot

Base on user input finds the most similar item(clothes) in the database

Keywords :
Deep Learning, Convolutional Neural Networks, Computer Vision, Segmentation, Feature Extraction
Pytorch, Tensorflow, Keras, OpenCV
Python

AR Virtual Display

Augmented reality (AR) has seen rapid growth in recent years, resulting in more companies developing AR devices and applications, such as Microsoft's Hololens. However, most current AR applications are used for entertainment purposes. In order to increase productivity with AR, we have come up with a new idea: getting more virtual screens through AR glasses. This way, users don't have to spend a lot of money on physical monitors.

Achievement:

✓ Develop and deploy "Computer Vision" algorithms for AR application

✓ Co-worked with ITRI

✓ Patents:

- Augmented Reality Screen System and Augmented Reality Screen Display Method (TW [TWI757824B] / US [US11328493B2] / CH), 2020

Create a virtual monitor by AR device

Keywords :
Augmented reality (AR), Computer Vision
OpenCV, Unity
Python, C#

Monitor Landmark Detection / Pose Estimation

Landmark detection is typically used in facial tracking. In this project, we apply the same concept to detect landmarks of interest to calculate the pose of an object.

Achievement:

✓ Research and deploy machine learning algorithms for "Landmark Detection"

Detecting landmark which we are interested in

Keywords :
Deep Learning, Convolutional Neural Networks, Landmark Detection, Pose Estimation, Computer Vision
Pytorch, OpenCV
Python

A Convolutional Neural Network for Real Time Robot Pose Estimation by RGB Image

Localization is crucial for navigation. SLAM has a good performance in indoor localization. Commonly used sensors are mainly divided into lasers or cameras. The advantage of laser SLAM is its high localization accuracy. However, the lack of image information leads to restrictions on some applications. Visual SLAM relies on RGB image and depth map. The disadvantage is that a large number of features extracting and matching, cause a large amount of computation. Therefore, this research will focus on the pose estimation only by RGB image, without features extracting and matching.

Achievement:

✓ Thesis: A Convolutional Neural Network for Real-Time Robot Pose Estimation by RGB Image

✓ Develop "Pose Estimation" machine learning algorithms and deploy on "TurtleBot"

✓ Experience with SLAM for training data collection

✓ Experience with ROS

Using pose estimation model do robot indoor localization task

Keywords :
Deep Learning, Convolutional Neural Networks, Localization, Pose Estimation, Computer Vision
Pytorch, OpenCV, ROS
Python

Foreground-background Subtraction

Semantic Segmentation is a computer vision task that involves grouping together similar parts of the image that belong to the same class. For this project, we propose a simple convolutional autoencoder model to do foreground-background subtraction for small-scale images.

Achievement:

✓ Design and deploy "Autoencoder" for foreground-background subtraction

✓ Optimize loss function

✓ Data augmentation to improve accuracy

✓ Patents:

- Anti-cheating Method and Electronic Device (TW [TWI779566B] / CH), 2020

Foreground-background Subtraction

Keywords :
Deep Learning, Convolutional Neural Networks, Semantic Segmentation, Background Subtraction, AutoEncoder, Computer vision
Tensorflow, OpenCV
Python

Facial Recognition (Side project)

Deep learning has demonstrated high accuracy in facial recognition in recent years and has been applied in various fields. This project uses an open-source machine learning algorithm for face recognition that only requires a single photo per person as data in order to improve understanding of this technology.

Achievement:

✓ Deploy with inference engine - OpenVINO

Facial recognition using inference engine

Keywords :
Deep Learning, Convolutional Neural Networks, Face Recognition, Computer vision
OpenVINO, OpenCV
Python, C++

I enjoy working out in my free time. it not only can improve my fitness but also relieve my pressure. The most important is that it can let me get more confident.

Michael Tan

Experience

Education

Patents

Projects

Stereoscopic Image Generation from Single View Image

Stereo (Side by Side) Image Detection

End to end motion planner using Deep Reinforcement Learning

QR code detection

AI Fashion Persuader

AR Virtual Display

Monitor Landmark Detection / Pose Estimation

A Convolutional Neural Network for Real Time Robot Pose Estimation by RGB Image

Foreground-background Subtraction

Facial Recognition (Side project)

Interest