my research sits at the intersection of reinforcement learning theory, ML systems, and the hardware-software interface.
machine unlearning for safe and adaptive robotics — supervised by prof. triantafillou, university of warwick. investigating selective forgetting mechanisms for robots operating in dynamic environments.
lévy-driven pides via physics-informed neural networks — developing convergence guarantees for neural solvers on lévy-driven partial integro-differential equations. targeting applications in stochastic volatility modelling.
rxl — efficient rl kernels library — c++20 library for reinforcement learning with mathematical complexity guarantees. cuda-accelerated kernels with pybind11 bindings. framed around provable efficiency, not empirical benchmarking.
current deep-dives: