Abdul Monaf Chowdhury

Research Assistant @ University of Dhaka, Bangladesh
Incoming Lecturer @ BRAC University, Bangladesh
CS PhD Aspirant

prof_pic.png

I am a Research Assistant at the AVIS Lab of the University of Dhaka, Bangladesh. Currently, I’m working on long horizon embodied manipulation problems. Along with it, I’ll be joining as a lecturer of Computer Science at BRAC University from the upcoming Summer Semester

Previously, I worked as Research Assistant at the MAIM Lab of the University of Dhaka, Bangladesh. I worked on a Wellcome Leap - In Utero funded project (Title: Translation of a Wearable Fetal Movement Monitor towards Stillbirth Prevention), under the supervision of Dr. Abhishek Kumar Ghosh and Dr. Niamh Nowlan to develop a wearable device to monitor fetal movements to cut stillbirth rates.

Back in March ‘24, I graduated from the University of Dhaka in Robotics and Mechatronics Engineering, supervised by Dr. Md Mehedi Hasan. My fourth year thesis project was on “Enhancing UAV Based Human Action Recognition: A Deep Learning Approach”.


Research Interest

I’m interested in Multi-modal Learning, especially at the fusion of Vision + Language for Embodied AI. Overall, I would like to work on fusing complementary intelligence from multi-modalities with the ambitious goal of positively influencing real-world environments via embodied agents.


Recently, my work on language-guided embodied agents got accepted to ICML '26. VLMs were used to generate natural language task feedback for causal reasoning via self-reflection and guide the agents for embodied manipulation tasks. Another one of my work on Open World Amodal Counting has been under review at ECCV '26. Prior to that, my work on Time Series Forecasting using a tri-modal architecture comprising time, spectral and LLM branches got accepted for a Poster at AAAI '26.

Overall, my work revolves around multimodal reasoning for embodied AI, multimodal generation, scene understanding, and vision-language integration.


Research Collab

I’m looking for research opportunities, and if you think we can work together on some ideas, I’d be more than happy to discuss. Just shoot me an email: monafabdul15@gmail.com

news

Apr 30, 2026 My paper titled LAGEA: Language Guided Embodied Agents for Robotic Manipulation got accepted for a Poster at ICML '26
Nov 16, 2025 Submitted my work on Open World Amodal Counting to ECCV '26, and the paper is published at ArXiv
Nov 08, 2025 My paper titled T3Time: Tri-Modal Time Series Forecasting via Adaptive Multi-Head Alignment and Residual Fusion got accepted for a Poster at AAAI '26
Dec 08, 2024 My journal paper titled U-ActionNet: Dual-Pathway Fourier Networks with Region-of-Interest Module for Efficient Action Recognition in UAV Surveillance, based on my fourth year thesis work, has been published in IEEE Access
Jul 31, 2024 Our work on stillbirth prevention got another year of funding extension worth $1M by Wellcome Leap - In Utero
Mar 14, 2024 Graduated from University of Dhaka, Bangladesh with a BSc. in Robotics & Mechatronics Engineering. [Report]
Oct 06, 2023 My paper titled FFT-UAVNet: FFT Based Human Action Recognition for Drone Surveillance System got accepted to the 5th IEEE International Conference on Sustainable Technologies for Industry 5.0 (STI) conference

selected publications

See my full publications at the publication page!

  1. ArXiv
    countocc.jpg
    Counting Through Occlusion: Framework for Open World Amodal Counting
    Safaeid Hossain Arib, Rabeya Akter, Abdul Monaf Chowdhury, Md Jubair Ahmed Sourov, and Md Mehedi Hasan
    arXiv preprint arXiv:2511.12702, 2025
  2. ICML 26
    lagea.jpg
    LAGEA: Language Guided Embodied Agents for Robotic Manipulation
    Abdul Monaf Chowdhury, Akm Moshiur Rahman Mazumder, Rabeya Akter, and Safaeid Hossain Arib
    ICML 2026, 2026
  3. AAAI 26
    t3_time.jpg
    T3Time: Tri-Modal Time Series Forecasting via Adaptive Multi-Head Alignment and Residual Fusion
    Abdul Monaf Chowdhury, Rabeya Akter, and Safaeid Hossain Arib
    Proceedings of the AAAI Conference on Artificial Intelligence, 2026
  4. Access
    u_action_model.jpg
    U-ActionNet: Dual-pathway fourier networks with region-of-interest module for efficient action recognition in UAV surveillance
    Abdul Monaf Chowdhury, Ahsan Imran, Md Mehedi Hasan, Riad Ahmed, Akm Azad, and 1 more author
    IEEE Access, 2024
  5. STI
    u_action_abs.jpg
    FFT-UAVNet: FFT Based Human Action Recognition for Drone Surveillance System
    Abdul Monaf Chowdhury, Ahsan Imran, and Md Mehedi Hasan
    In 2023 5th International Conference on Sustainable Technologies for Industry 5.0 (STI), 2023