Experience

  1. Research Scientist

    Oak Ridge National Laboratory (ORNL)
    • Designed LLM-based systems to enable predictive analytics and streamline operational data queries
    • Spearheaded energy efficiency initiatives for exascale systems, improving power usage insights and enabling sustainable HPC operation
    • Advanced research in data-driven and simulation based modeling techniques (machine learning, digital twins) for HPC operations to increase operational efficiency of ORNL’s supercomputers.
    • Developed scalable realtime data pipelines and analytics frameworks for energy and thermal monitoring on Summit and Frontier supercomputers, significantly enhancing operational decision-making.
  2. Research Assistant

    Seoul National University, South Korea
    • Developed cross-layer SSD optimizations, integrating custom FTLs, OS enhancements, and FPGA-based emulation and prototyping
    • Designed high-performance SSD storage architectures for HPC, reducing tail latency in key-value store
    • Developed a custom key-value storage engine with Samsung SSD garbage collection APIs, improving latency consistency demonstrating 6-9x reduction in 99.9999 percentile read latency
  3. Research Engineer | Software Engineer

    TmaxSoft
    • Designed and developed a non-intrusive middleware transaction instrumentation framework (LD_PRELOAD-based), enabling end-to-end performance monitoring of enterprise applications. Built function-hooking transaction latency monitoring modules for products such as BEA Tuxedo, TmaxSoft Tmax, and Oracle using function interception, lock-free shared memory based IPC.
    • Led the application instrumentation layer deployment effort of the LG Display Zero Failure Project (LG Display Ltd.), delivering a function intercept-based middleware application transaction monitoring system to their mission critical Manufacturing Execution System (MES).
  4. Software Developer

    Samsung Networks, South Korea (merged into Samsung SDS)
    • Maintained and enhanced NMSPlus 3.0–3.1, a network monitoring system collecting SNMP, ping & Netflow statistics from Cisco, Alcatel, and Juniper devices.
    • Developed SNMP-based data collection modules for ATM switches and L4 switches, expanding network monitoring capabilities.

Education

  1. PhD, Distributed Computing Systems (MA & Ph.D. integrated)

    Seoul University
    Thesis on optimizing the Linux I/O stack for SSDs, Supervised by Heonyoung Yeom.
    Read Thesis
  2. BS Computer Science

    Korea University
Skills
Technical Leadership
Strategic planning & execution

Develops and implements long-term strategies that align technical innovation with business objectives, ensuring measurable impact (multi-scale planning, directed R&D, influence based leadership, vision driven)

Public speaking

Engages global audiences at conferences, panels, and industry events to share insights on cutting-edge technologies and industry trends (keynote, panels, birds-of-feathers, workshops)

Technical writing

Produces high-quality technical documents, research publications, and white papers that influence technology adoption and decision-making (technical papers, white papers)

Interlisciplinary and global collaboration

Facilitates collaboration across diverse teams, disciplines, and geographies to foster innovation and solve complex technical challenges (committee, working groups, community efforts, organization)

Technical Skills
Programming & Development

C-based systems programming (Linux, AIX, Solaris, HP-UX), Python for data pipelines, APIs, automation; test-driven development (Pytest, Robot Framework).

HPC & Distributed Systems

HPC job scheduling (Slurm, LSF, Torque Moab), scalable data engineering (Kafka, Apache Spark, Dask, Parquet), and workflow orchestration (Apache Airflow).

Cloud & Infrastructure

Kubernetes application deployment (Kustomize, Helm), GitOps CI/CD (GitLab, ArgoCD), Ansible for automation, high-availability services (Redis, PostgreSQL, RabbitMQ, Kafka, MinIO, Apache Druid, Spark Cluster).

Data Center & HPC Monitoring

Prometheus + Grafana, industrial telemetry (BACnet/IP, Modbus/TCP), LD_PRELOAD-based system instrumentation (Oracle OCI, Tibco, OS file I/O).

AI & Machine Learning

Applied ML in HPC for automation and operational analytics. Hands-on with generative AI (LangChain, LiteLLM / Agents, RAG); collaborated with data scientists to integrate AI into HPC workflows

Storage & Embedded Systems

Linux kernel & device drivers, NVMe & SSD optimization, Xilinx FPGA (Zynq-7000, Virtex-7), high-performance storage tuning.