• 2020.01 ~ 2022.11, Acer, Software Engineer, SW Platform Enabling / Compute Software Technology / IT Products Business
  • Stereo Image Generation Method and Electronic Apparatus using the Same (TW [TWM626646U] / US [US20220286658A1] / CH), 2020
  • Augmented Reality Screen System and Augmented Reality Screen Display Method (TW [TWI757824B] / US [US11328493B2] / CH), 2020
  • Anti-cheating Method and Electronic Device (TW [TWI779566B] / CH), 2020
  • Side by Side Image Detection Method and Electronic Apparatus using the Same (TW [TW202236207A] / US [US20220284701A1] / CH), 2020
  • Stereoscopic Image Playback Apparatus (TW [TWM630947U] / EU / CH), 2021
  • Stereo Image Generating Device (TW [TWM628629U] / US), 2021
  • Computer Architecture for 3D Display Devices with Side-by-Side Generation (TW / US / CH), 2022 [Under verification]
  • SBS Detection by Stereo Matching (TW / US / CH / EU), 2022 [Under verification]
  • AI Smart Ledger (TW), 2022 [Under verification]
  • Multiview Rendering Pipeline for Side-by-Side images in 3D Display Devices (TW / US / CH), 2022 [Under verification]
  • Tree-Dimensional View in Devices Without 3D Display Using Eye Tracking (TW / US /CH), 2022 [Under verification]
  • Real-time background QR code detection and decoding (TW), 2022 [Under verification]
  • Acer Bar AI Computing Overlap (TW), 2022 [Under verification]
  • Depth Enhancement Using Face Landmark Detection (TW), 2022 [Under verification]
  • Multithread Architecture for Three-Dimensional Image Generation (TW), 2022 [Under verification]
    Stereoscopic Image Generation from Single View Image

    The rapid development of autostereoscopic 3D monitors offers users a brand-new 3D visualization experience. However, this technology only works with 3D content inputs like side-by-side images. Most images and videos on the internet are 2D single-view content, which makes it difficult for the technology to gain widespread popularity. This project uses "Machine Learning" and "Computer Vision" to instantly convert 2D content into stereo 3D

     

    Achievement:

    Develop computer vision and machine learning algorithms for "SpatialLabs™ " (TW)

       - Delivers glasses-free 3D sensation via optical, display, and eye-tracking technology. And utilizes AI technology instantly convert 2D content into stereo 3D

    ✓ Deploy and optimize "Depth Estimation" AI solutions on mobile devices using Pytorch (Libtorch) / Windows ML / OpenVINO / ONNX Runtime

    Machine learning model quantization, and deploy solutions on VPU (AI chip)

    ✓ Experience with GPU accelerated computing (CUDA kernel function) for image processing

    ✓ Experience with OpenCV & Direct3D for image processing

    ✓ Experience with product development and deployment strategies with C++ / Python. And familiar Window Installers

    ✓ Cooperate with Intel & ITRI

    ✓ Patents:

         - Stereo Image Generation Method and Electronic Apparatus using the Same (TW [TWM626646U] / US [US20220286658A1] / CH), 2020

         - Stereoscopic Image Playback Apparatus (TW [TWM630947U] / EU / CH), 2021

         - Stereo Image Generating Device (TW [TWM628629U] / US), 2021

         - Computer Architecture for 3D Display Devices with Side-by-Side Generation (TW / US / CH), 2022 [Under verification]

         - Multiview Rendering Pipeline for Side-by-Side images in 3D Display Devices (TW / US / CH), 2022 [Under verification]

         - Tree-Dimensional View in Devices Without 3D Display Using Eye Tracking (TW / US /CH), 2022 [Under verification]

         - Depth Enhancement Using Face Landmark Detection (TW), 2022 [Under verification]

         - Multithread Architecture for Three-Dimensional Image Generation (TW), 2022 [Under verification]


    Generate stereoscopic images from monocular images instantly

     

    Generate stereoscopic images from monocular images instantly (side-by-side view)

    Monocular depth estimation using convolutional neural network for stereoscopic image generation


    Keywords :
    Deep Learning, Convolutional Neural Networks, Computer Vision, 3D, Quantization
    Pytorch (Libtorch), Windows ML, TensorRT, OpenVINO, ONNX Runtime, D3D11, CUDA Kernel Function, OpenCV
    C++, Python


    Stereo (Side by Side) Image Detection

    As mentioned, there is no need to generate images if the source is already stereoscopic. However, if the source is single-view images, stereoscopic images need to be generated. In order to make the system more intelligent, Using "Machine Learning" to automatically detect whether the content is 3D or not.

     

    Achievement:

    ✓ cooperate with ITRI

    ✓ Patents:

         - Side by Side Image Detection Method and Electronic Apparatus using the Same (TW [TW202236207A] / US [US20220284701A1] / CH), 2020

         - SBS Detection by Stereo Matching (TW / US / CH / EU), 2022 [Under verification]


    Detect content is a stereo image or not


    Keywords :
    Deep Learning, Convolutional Neural Networks, Computer Vision, Feature Extraction
    keras, Tensorflow, OpenCV
    Python, C++


    End to end motion planner using Deep Reinforcement Learning

    Service robots are appearing more and more in our daily lives. The key technologies for service robots involve many fields, including navigation, system control, mechanism modules, vision modules, voice modules, and artificial intelligence. In this research, we present a learning-based mapless motion planner that saves us from using traditional methods like "SLAM" to create maps and allows for navigation.

     

    Achievement:

    ✓ Develop and deploy "Deep Reinforcement Learning" algorithms

    ✓ Optimize the model by batch normalization to prevent the gradient from vanishing

    ✓ Has 100+ stars repository on Github


    Teach robots how to reach goals without a map


    Keywords :
    Deep Reinforcement Learning (DRL), Deep Deterministic Policy Gradient (DDPG), Motion Planner, Navigation
    ROS, Tensorflow
    Python


    QR code detection

    Scanning QR codes is easy with a mobile phone. But if the QR code is displayed on a computer screen, it becomes a different scenario. Most people would use their phone to scan the code and then send the URL to themselves or crop the QR code and use a QR code decoder to get the URL. This process is complicated and frustrating. To solve this problem, we have developed a service that uses machine learning to detect QR codes on the user's screen and decode the QR code of interest.

     

    Achievement:

    ✓ Experience with deploy "Object detection" algorithms

    Model quantization and deploy with inference engine - OpenVINO

    ✓ Patents:

         - Real-time background QR code detection and decoding (TW), 2022 [Under verification]


    QR code detection using object detection model


    Keywords :
    Deep Learning, Convolutional Neural Networks, Computer Vision, Object Detection, YOLO
    Pytorch, OpenVINO, ONNX Runtime, OpenCV
    Python, C++


    AI Fashion Persuader

    Have you ever seen an outfit from a KOL on social media that you liked, but had trouble finding something similar? In this project, we use "Machine Learning" and "Computer Vision" to develop a system that can find the most similar clothes in a database based on the user's input images.

     

    Achievement:

    ✓ First place in the semester presentation contest of Taiwan AI Academy

    ✓ Deploy "Segmentation" and "Feature Extraction" algorithms

    ✓ Integrate with Line Bot


    Base on user input finds the most similar item(clothes) in the database


    Keywords :
    Deep Learning, Convolutional Neural Networks, Computer Vision, Segmentation, Feature Extraction
    Pytorch, Tensorflow, Keras, OpenCV
    Python


    AR Virtual Display

    Augmented reality (AR) has seen rapid growth in recent years, resulting in more companies developing AR devices and applications, such as Microsoft's Hololens. However, most current AR applications are used for entertainment purposes. In order to increase productivity with AR, we have come up with a new idea: getting more virtual screens through AR glasses. This way, users don't have to spend a lot of money on physical monitors.

     

    Achievement:

    ✓ Develop and deploy "Computer Vision" algorithms for AR application

    ✓ Co-worked with ITRI

    ✓ Patents:

         - Augmented Reality Screen System and Augmented Reality Screen Display Method (TW [TWI757824B] / US [US11328493B2] / CH), 2020


    Create a virtual monitor by AR device


    Keywords :
    Augmented reality (AR), Computer Vision
    OpenCV, Unity
    Python, C#


    Monitor Landmark Detection / Pose Estimation

    Landmark detection is typically used in facial tracking. In this project, we apply the same concept to detect landmarks of interest to calculate the pose of an object.

     

    Achievement:

    ✓ Research and deploy machine learning algorithms for "Landmark Detection"


    Detecting landmark which we are interested in


    Keywords :
    Deep Learning, Convolutional Neural Networks, Landmark Detection, Pose Estimation, Computer Vision
    Pytorch, OpenCV
    Python


    A Convolutional Neural Network for Real Time Robot Pose Estimation by RGB Image

    Localization is crucial for navigation. SLAM has a good performance in indoor localization. Commonly used sensors are mainly divided into lasers or cameras. The advantage of laser SLAM is its high localization accuracy. However, the lack of image information leads to restrictions on some applications. Visual SLAM relies on RGB image and depth map. The disadvantage is that a large number of features extracting and matching, cause a large amount of computation. Therefore, this research will focus on the pose estimation only by RGB image, without features extracting and matching.

     

    Achievement:

    ✓ Thesis: A Convolutional Neural Network for Real-Time Robot Pose Estimation by RGB Image

    ✓ Develop "Pose Estimation" machine learning algorithms and deploy on "TurtleBot"

    ✓ Experience with SLAM for training data collection

    ✓ Experience with ROS


    Using pose estimation model do robot indoor localization task


    Keywords :
    Deep Learning, Convolutional Neural Networks, Localization, Pose Estimation, Computer Vision
    Pytorch, OpenCV, ROS
    Python


    Foreground-background Subtraction

    Semantic Segmentation is a computer vision task that involves grouping together similar parts of the image that belong to the same class. For this project, we propose a simple convolutional autoencoder model to do foreground-background subtraction for small-scale images.

     

    Achievement:

    ✓ Design and deploy "Autoencoder" for foreground-background subtraction

    ✓ Optimize loss function

    Data augmentation to improve accuracy

    ✓ Patents:

         - Anti-cheating Method and Electronic Device (TW [TWI779566B] / CH), 2020

    Foreground-background Subtraction


    Keywords :
    Deep Learning, Convolutional Neural Networks, Semantic Segmentation, Background Subtraction, AutoEncoder, Computer vision
    Tensorflow, OpenCV
    Python


    Facial Recognition (Side project)

    Deep learning has demonstrated high accuracy in facial recognition in recent years and has been applied in various fields. This project uses an open-source machine learning algorithm for face recognition that only requires a single photo per person as data in order to improve understanding of this technology.

     

    Achievement:

    ✓ Deploy with inference engine - OpenVINO

    Facial recognition using inference engine


    Keywords :
    Deep Learning, Convolutional Neural Networks, Face Recognition, Computer vision
    OpenVINO, OpenCV
    Python, C++


    I enjoy working out in my free time. it not only can improve my fitness but also relieve my pressure. The most important is that it can let me get more confident.