Skip to main content

Linear video coding

Conventional and learning-based video compression — codecs, rate–distortion, and the boundary between hand-designed and neural pipelines.

Frugal and efficient AI

Pruning, quantization, low-rank methods, and other tools to make deep models small enough to deploy.

Geometric deep learning

Learning on graphs, manifolds, and structured domains — where the geometry of the data shapes the architecture.

Multimodal learning

Joint models for vision, language, and beyond — alignment, fusion, and grounded reasoning.