Digbalay Bose

Digbalay_Bose.png

RTH 320

3710 McClintock Avenue,

Los Angeles, CA 90089

I am a Ph.D. candidate in the Department of Electrical and Computer Engineering at the University of Southern California, Viterbi School of Engineering. I am a member of Signal Analysis and Interpretation Laboratory and currently advised by Prof. Shrikanth Narayanan. Prior to USC, I worked as a Research Software Engineer at IBM Research, India from 2016 to 2018. I received my Masters in Electrical Engineering from Indian Institute of Technology, Bombay in 2016, where I was advised by Prof. Subhasis Chaudhuri. Previously, I obtained my Bachelors degree in Electronics and Telecommunication Engineering from Jadavpur University in 2014.

My research interests broadly span across the domains of multimodal machine learning, affective computing and applications of computer vision in healthcare. I am particularly interested in multimodal learning with major focus along the following dimensions:

  • Robustness: Improving model performance under adversarial settings including modality corruption.
  • Generalization: Enhancing the generalization capabilities of multi-modal foundation models to novel reasoning tasks across diverse groups/cultures and domains.

news

Sep 29, 2023 Passed Ph.D. Qualifying Examination and advanced to candidacy.
Aug 18, 2023 Completed my summer internship at NVIDIA Maxine AI.
Aug 18, 2023 Our findings on demographic representation in Indian TV shows with Google Research and Geena Davis Institute on Gender in Media published in Google AI India blog and received media coverage
Jul 25, 2023 Our papers MM-AU: Towards Multimodal Understanding of Advertisement Videos and SEAR: Semantically-grounded Audio Representations accepted to ACM MM 2023.
May 16, 2023 Joined NVIDIA Maxine AI as a Computer Vision and Graphics intern.

selected publications

  1. WACV 2023
    Movieclip: Visual scene recognition in movies
    Digbalay Bose ,  Rajat Hebbar ,  Krishna Somandepalli , and 5 more authors
    In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision , 2023
  2. KDD 2023
    FedMultimodal: A Benchmark For Multimodal Federated Learning
    Tiantian Feng ,  Digbalay Bose ,  Tuo Zhang , and 6 more authors
    arXiv preprint arXiv:2306.09486, 2023
  3. FPSAM 2022
    Automatic Analysis of Asymmetry in Facial Paralysis Patients Using Landmark-Based Measures
    Digbalay Bose ,  Krishna Somandepalli ,  Tymon Tai , and 3 more authors
    Facial Plastic Surgery & Aesthetic Medicine, 2022