Hi, Iām Catalin - ML & GPU kernel engineer, started at 19, now 21.
I work on low-level GPU optimization (HIP, CUDA, PTX) and production ML systems. Currently optimizing kernels for AMD Mi300X/Mi355X and NVIDIA H100 at ISA level, and building CV systems that run in production.
I like going deep - from writing inline PTX to designing full MLOps pipelines. I iterate fast and put research into practice through real projects.
Right now focused on GPU kernels, attention mechanisms, and Re-ID.
Technical:
Soft:
Programming languages/frameworks:
Python
Pytorch
C/C++
GPU Kernels
C#
Svelte
Computer Vision
85 %
Natural Language Processing
55 %
Large Language Models
55 %
Generative AI
75 %
Auto Encoders
60 %
Machine Learning Algorithms
45 %
Data Analysis
45 %
Reinforcment Learning
20 %
Graph NNs
20 %
Reccurent Networks
55 %
Energy Based Models
25 %
GPU Kernels
35 %