Aritro Roy

Projects

Chinchilla-Optimal Transformer Pre-training for Music

Fall 2025

Independent ML Systems Project GitHub

Independently designed and trained decoder-only Transformer models (NanoGPT) on the Lakh MIDI Dataset
Optimized training throughput on NVIDIA H100 GPUs using BFloat16 mixed precision, Flash Attention, and torch.compile
Achieved a test perplexity of 2.20 with 100% syntactically valid output

Stack: Python, PyTorch, CUDA, Flash Attention

Toy Load Balancer with Consistent Hashing

2024

Systems Engineering Project GitHub

Built a custom load balancer implementing consistent hashing to distribute traffic across dynamic server nodes
Containerized the entire architecture (API Gateway, Nodes, Analytics) using Docker Compose for easy deployment
Implemented a management API to dynamically add/remove servers and visualize request rebalancing in real-time

Stack: Docker, Python, Node.js, Shell