cv
Basics
| Name | Digbalay Bose | 
| Label | Ph.D. Candidate | 
| dbose@usc.edu/digbose92@gmail.com | |
| Url | https://digbose92.github.io/ | 
Work
-  2023.05 - 2023.08Computer Vision and Graphics InternNVIDIA Maxine AI- Developed end-to-end deep learning models for controllable portrait video animation as part of NVIDIA Maxine.
 
-  2022.05 - 2022.08Software Engineering InternNVIDIA Maxine AI- Developed end-to-end visual and audio-visual deep learning models for high-fidelity facial animation of avatars as part of Maxine ARSDK.
 
-  2016 - 2018Research Software EngineerIBM Research India- Developed an end-to-end soil moisture extraction system from satellite images by incorporating image interpolation techniques as a part of IBM Geospatial Analytics suite.
- Developed explainable deep learning models in the domains of image classification and visual search as a part of retail and operations effort.
 
-  2013.05 - 2013.07Research internIndian Statistical Institute, Kolkata- Developed a key recovery scheme based on the properties of Slid Pairs for stream cipher Salsa20.
 
Education
-  2018 - 2024 Los Angeles, CA Ph.D.University of Southern CaliforniaMing Hsieh Department of Electrical and Computer Engineering- Grounding Natural Language
- Advanced Computer Vision
- Affective Computing
- Mathematics of High Dimensional Data
 
-  2014 - 2016 Mumbai, India M.TechIndian Institute of Technology, BombayElectrical Engineering- Matrix Computations
- Machine Learning
- High-performance Computing
- Optimal Control
 
-  2010 - 2014 Kolkata, India 
Skills
| Languages | |
| Python | |
| C | |
| C++ | |
| R | |
| Javascript | |
| HTML | |
| Bash | 
| Machine Learning Frameworks | |
| PyTorch | |
| Tensorflow | |
| Keras | |
| Caffe | |
| Scikit-learn | |
| OpenCV | 
| Softwares | |
| Maya | |
| Blender | |
| VTK | 
Languages
| English | |
| Fluent | 
| Hindi | |
| Fluent | 
| Bengali | |
| Native speaker | 
Projects
-  2022 - 2023Automated analysis of advertisement videosIntroduced large-scale advertisement benchmark dataset (MM-AU) and multimodal models for semantic video understanding tasks.- Keywords: multimodal learning, media understanding, advertisements
- Work published in ACM MM 2023 proceedings.
 
-  2022 - 2022Context driven human affect perceptionDeveloped multimodal context fusion module for emotion recognition in context-driven scenarios.- Keywords: multimodal fusion , emotion recognition
- Work published in ICASSP 2023 proceedings.
 
-  2022 - 2023Multimodal federated learningCo-developed multimodal benchmark tasks and baseline models for federated learning applications- Keywords: multimodal fusion, federated learning
- Work done in collaboration with Amazon Alexa AI
- Work published in KDD 2023 proceedings.
 
-  2021 - 2022Visual scene understandingProposed a large-scale weakly labeled movie-centered scene dataset (MovieCLIP) and knowledge transfer to scene and genre classification tasks across diverse domains.- Keywords: visual scene recognition, automatic labeling
- Work done in collaboration with Google Research
- Work published in WACV 2023 proceedings.
 
-  2021 - 2022Automated analysis of facial paralysis patientsDeveloped a facial-landmark based video pipeline involving novel asymmetry measures for predicting standardized scores in a mixed effects modeling setup.- Keywords: facial landmarks, automated analysis, linear mixed effects model
- Work done in collaboration with Keck School of Medicine and Pacific Neuroscience Institute.
- Work published in Facial Plastic Surgery & Aesthetic Medicine proceedings.
 
-  2021 - 2021Understanding emotion perception in art workDeveloped multimodal transformer based architectures with configurable image features for evoked emotion recognition in art images.- Keywords: multimodal transformer, emotion recognition, art images
- Work published in ICCV CLVL workshop 2021.
 
-  2020 - 2023Computational analysis of gender portrayal in mediaAnalyzed emerging trends in TV shows and advertisements across the dimensions of age, perceived skintone and gender- Keywords: representation in media, media understanding
- Work done in collaboration with Geena Davis Institute on Gender in Media and Google Research.