Portrait of Vinh Ngo

Computer and Network Systems
Chalmers University of Technology
MSCA RELAX Doctoral Network

Data Summarization • Concurrent & Parallel Computing • HPC • Algorithms • ML

PHD
Marie Skłodowska-Curie Fellow

CONCURRENT DATA SUMMARIZATION

I am a Marie Skłodowska-Curie PhD student in the Distributed Computing and Systems group at Chalmers University of Technology, specializing in Continuous and Concurrent Data Summarization. Currently funded by the MSCA RELAX-DN project, I am supervised by Prof. Marina Papatriantafilou, with co-supervision from Prof. Philippas Tsigas and Prof. Vincenzo Gulisano. With a Valedictorian Honour Degree, 2.5 years of tech industry experience, and 2.5 years of academic research, I am continuously advancing my expertise in High-Performance Computing for Data Science and Machine Learning.

RELAX-DN

MSCA Doctoral Network

RELAX-DN (Relaxed Semantics Across the Data Analytics Stack) is funded by the EU under Horizon Europe (Grant 101072456).

article Latest Blogposts

View All
MSCA RELAX-DN 4th Training Week in Dublin 3 min
2025.11.07

MSCA RELAX-DN 4th Training Week in Dublin

VLDB 2025: Presenting Cuckoo Heavy Keeper 5 min
2025.09.03

VLDB 2025: Presenting Cuckoo Heavy Keeper

DEBS 2025 & RELAX Workshop 4 min
2025.06.13

DEBS 2025 & RELAX Workshop

Hosting MSCA RELAX-DN 3rd Training Week at Chalmers 3 min
2025.05.23

Hosting MSCA RELAX-DN 3rd Training Week at Chalmers

school Research

View All

Awards & Honors

  • Marie Skłodowska-Curie PhD Fellowship European Union (Horizon Europe), 2023
  • Valedictorian (Top-1) Commendation & Medal HCM City People's Committee, 2023
  • Valedictorian (Top-1) Science Subjects University Entrance Exam, 2019

Selected Publications

  • RESKETCH: A Mergeable, Partitionable, and Resizable Sketch
    Under Submission Submitted to VLDB2026
  • Cuckoo Heavy Keeper and the Balancing Act of Maintaining Heavy Hitters in Stream Processing
    VLDB 2025 A* Conference
  • STERR-GAN: Spatio-temporal Re-rendering for Facial Video Restoration
    IEEE MultiMedia Q1 Journal
  • CLUE — Clustering-Based Load Understanding and Exploration: Summarizing High-Dimensional Electricity Grid Data for Scenario Analysis
    RELAX Workshop 2025 Workshop Paper

Research Dissemination

  • Hardware/software co-optimization for machine learning at the edge
    Research Seminar • Nov 26, 2025
  • Concurrent Data Summarization & Cuckoo Heavy Keeper
    Presentation + Poster • Nov 3, 2025
  • HierarchyScope: Spatio-Temporal Data Summarization
    Poster • Oct 15, 2025

bar_chart Visualizations

View All
Cuckoo Heavy Keeper INTERACTIVE
V1.0.0

Cuckoo Heavy Keeper

An interactive visualization of the Cuckoo Heavy Keeper algorithm — a frequency estimation data structure that uses cuckoo hashing with lobby/heavy entry separation, probabilistic decay, and promotion/kickout mechanics to identify heavy hitters in data streams.

ReSketch INTERACTIVE
V1.0.0

ReSketch

An interactive visualization of the ReSketch algorithm, a resizable frequency estimation sketch that uses consistent hashing rings and KLL quantile sketches to support dynamic expand and shrink operations while preserving estimation accuracy.

Bloom Filter INTERACTIVE
V1.0.0

Bloom Filter

A space-efficient probabilistic data structure visualization demonstrating insertion, membership queries, and false positive rates under varying hash functions and bit array sizes.