db Digbalay Bose
active · last updated

Digbalay Bose

Research Scientist at Adobe Research, Bengaluru. I work on multi-modal document understanding.

Digbalay Bose
currently
Building multi-modal document understanding systems at Adobe Research.
open to
Collaborations on multi-agent systems, multimodal document understanding, and controllable generation.
contact

01about

I build agentic pipelines that mine multimodal information from document collections and synthesize it into new media — videos, podcasts, slide decks.

My work sits at the intersection of vision-language modeling, controllable image/video generation, and and multi-agent systems. I am interested in building systems that reason over heterogeneous inputs—text, images, audio, video—without requiring massive supervision.

read more on the research

02by the numbersall papers →

19
papers
9
venues
8
first-author
1
us patent
publications per year · 2016–2025

03selected workall 19 →

04recent notesall notes →

email scholar github linkedin orcid