Conference Overview

The 2026 International Conference on Vision, Language & Learning (WVLL 2026) convenes researchers and practitioners working at the intersection of computer vision, natural language processing, and multimodal learning. Across two days we will explore how to build efficient, responsible, and high-performing vision-language systems that advance understanding, interaction, and accessibility.

Event Snapshot

Key Areas of Exploration:

  • AI For Low-Resource Languages
  • Video And Speech Analysis For Low-Resource Languages
  • LLM and VLM Architectures and Neural Design
  • Parameter-Efficient Adaptation of Large Vision-Language Models
  • Applications in Vision-Language Models
  • Tiny VLMs: Efficient Multimodal AI at the Edge
  • New Benchmark Dataset & Evaluation Metrics
  • AI for Sign Language Understanding
  • Document Image Processing
  • Medical Data Analysis
  • Scene Text Detection And Recognition

WVLL 2026 fosters a rich exchange of ideas that can crystallize common problems and illuminate promising scientific paradigms in vision-language research. We aim to explicitly contrast competing frameworks, clarify essential research questions, and cultivate a stronger community around these shared interests. WVLL distinguishes itself by its balanced emphasis on theoretical advancements in model design and the practical, societal implications of their deployment, particularly in resource-constrained and specialized domains. We encourage the presentation of work-in-progress and forward-looking position papers to spark vibrant discussion and future breakthroughs.

Invited Speakers

Confirmed Speakers

Tentative Speakers

Diversity, Equity & Inclusion Plan

WVLL 2026 embeds diversity and inclusion across organizers, speakers, and attendees through concrete, realistic actions. Our committee spans multiple continents and balances academia with industry and NGO perspectives, creating natural mentorship pathways and technical breadth from computer vision to clinical AI. We are recruiting invited speakers through affinity groups and regional mailing lists to secure meaningful representation of women, non-binary scholars, and researchers based in the Global South. The gender-neutral CFP explicitly welcomes work on sign-language AI, low-resource languages, and edge deployment in underserved regions, while an optional mentored-review track pairs junior authors with experienced PC members. External sponsorships will fund travel stipends prioritized for students from low- and middle-income countries and for caregivers. Live captioning, wheelchair-accessible poster spacing, and an anonymous code-of-conduct reporting channel coordinated by our DEI chair will ensure a safe, inclusive environment, making diversity and broad participation integral to WVLL 2026 rather than an afterthought.

Estimated Number of Attendees

Given the growing interest in multimodal AI - especially low-resource language processing, efficient model adaptation, and applied vision-language systems - we anticipate 120–150 participants from academia and industry. This includes researchers, practitioners, and students focused on vision-language learning, efficient model design, and AI applications for underrepresented and resource-constrained domains.

Special Requirements and Technical Needs

WVLL 2026 is a two-day, in-person conference hosted at the University of North Texas. We request a standard A/V setup (projector with HDMI input, screen, microphones for speakers and audience), reliable internet to support live demos, and poster space for approximately 20–25 physical posters. We will also need a table area for interactive demos related to vision-language systems. The venue should provide wheelchair accessibility throughout.

Previous Editions

WVLL previously ran as a workshop at WACV 2024, focusing on vision-language learning for low-resource languages, parameter-efficient model adaptation, and applied multimodal AI. That edition received 14 submissions (3 accepted) with authors spanning Bangladesh, the United States, and India, and a reviewer pool of 32 experts. Building on this momentum, WVLL 2026 expands into a full conference to broaden reach, deepen technical exchange, and grow the community.

URL of previous workshop: https://wvll.github.io/2024

Brief Bios of Organizers

Fuad Rahman: Fuad Rahman, Ph.D., is an academician and entrepreneur who founded Apurba Technologies, specializing in machine learning. He is also an Adjunct Professor at the University of Arizona's BME Department. His company actively works on computerizing Bangla, a low-resource language, developing the first commercial Bangla OCR and screen reader. He has over 100 peer-reviewed publications.
Email: fuad@apurbatech.com | Website: apurbatech.com

Syed Akhter Hossain: Dr. Syed Akhter Hossain is the Dean of the Faculty of Science and Information Technologies at Daffodil International University. He has significantly advanced NLP research and has over 250 publications. A recipient of the Best Professor of IT Award (2012) and National ICT Award (2016), he notably developed a machine translator for Bangla Braille.
Email: deanfsit@daffodilvarsity.edu.bd | Website: https://faculty.daffodilvarsity.edu.bd/profile/swe/akhter.html

Mouhaydine Tlemcani: Dr. Mouhaydine Tlemcani is an Assistant Professor at the University of Évora, instrumental in their Mechatronics Engineering program. He holds an M.Sc. (1992) and Ph.D. (2007) in Electrical Engineering. His research includes instrumentation, signal/image processing, embedded systems, and AI applications in engineering, leading projects like non-destructive testing for aeronautic maintenance.
Email: tlem@uevora.pt | Website: https://www.uevora.pt/pessoas?id=5279

Tozammel Hossain: Dr. Tozammel Hossain is an Assistant Professor at the University of North Texas, specializing in applied machine learning, causal inference, and biomedical informatics. With a Ph.D. from Virginia Tech and postdoctoral experience at USC, he has contributed to high-impact projects funded by IARPA, DARPA, DHS, and USDA. He has published in leading journals and presented at top conferences.
Email: tozammel.hossain@unt.edu | Website: https://facultyinfo.unt.edu/faculty-profile?profile=kh0718

Tazin Afrin: Dr. Tazin Afrin holds a Ph.D. in Computer Science from the University of Pittsburgh, with expertise in NLP, educational technology, and human-computer interaction. She developed the ArgRewrite revision assistant and published in top-tier venues. At ETS, she develops advanced AI systems using LLMs and machine learning.
Email: tazin.tumpa@gmail.com | Website: https://tazin-afrin.github.io

Ting Xiao: Dr. Ting Xiao is an Assistant Professor in Data Science at the University of North Texas (UNT) and Director of the Deep Sensor Information eXtraction (SIX) Lab. She holds a Ph.D. in Physics from Northwestern University. Her research focuses on Machine Learning/Deep Learning, Vector Embeddings, Multimodal Large Language Models, and Clinical/Biomedical AI, with over 100 publications and an h-index of 36.
Email: Ting.Xiao@unt.edu | Website: https://engineering.unt.edu/people/ting-xiao.html

Sadia Afroz: Dr. Sadia Afroz is a Lead Scientist at Gen™, leading research in Security and Machine Learning. She holds a Ph.D. in Computer Science from Drexel University, specializing in Computer Security. Her expertise lies at the intersection of security, privacy, and machine learning. She previously served as a Research Professor at ICSI and a Staff Scientist at Avast.
Email: sadia@icsi.berkeley.edu | Website: https://www.icsi.berkeley.edu/icsi/people/sadia

Sheikh Abujar: Sheikh Abujar is a Ph.D. candidate in Computer Science at UAB, researching deep learning, vision-language models (VLMs), and clinical natural language processing. He interned at Samsung Research America (2024) and co-led impactful projects, including creating low-resource datasets like Bayanno (Bangla Speech) and IsharaLipi (Bangla Sign Language).
Email: sabujar@uab.edu | Website: https://sites.google.com/site/iamabujarsheikh

AKM Shahariar Azad Rabby: Shahariar Rabby is a researcher at the UAB Lung Imaging Lab and Machine Learning team lead at Apurba Technologies, specializing in OCR, Document Analyses, and Low-Resource Language Vision. He developed "Ekush," the largest Bangla handwritten dataset, and co-founded/supervised the CI LAB and DIU - NLP and Machine Learning Research LAB.
Email: arabby@uab.edu | Website: rabby.dev

Muntaser Syed: Muntaser Syed is a GPU Developer Advocate at NVIDIA and technical lead for the Open Hackathons team, focusing on accelerating research on supercomputing clusters. A Ph.D. scholar, his interests include machine learning on edge devices, NLP, and speech recognition. He contributed to UAV control systems and the FAA's LAANC program.
Email: muntasers@nvidia.com | Website: https://www.linkedin.com/in/muntasersyed

Confirmed Program Committee Members

Reviewer Organization
Abdus SattarDaffodil International University, Bangladesh
Abu Kaisar Mohammad MasumFlorida Institute of Technology, USA
Jagdish Chand BansalSouth Asian University, India
Stephen Olatunde OlabiyisiLadoke Akintola University of Technology, Nigeria
Sunil Kumar KhatriAmity University Tashkent, Uzbekistan
Yagyanath RimalPokhara University, Nepal
Ghalib HussaiynPayPal
Hasmot AliApurba Technologies Ltd
Md. Fahad HossainDaffodil International University, Bangladesh
Mahmudul HasanComilla University, Bangladesh
Mohammad Mamun Or RashidJahangirnagar University, Bangladesh
Md Majedul IslamKennesaw State University, USA
Md. Sanzidul IslamKing Abdulaziz University, Saudi Arabia
Mirza SamiDeka Research & Development
Mohammad Shorif UddinJahangirnagar University, Bangladesh
Mouhaydine TlemcaniUniversidade de Évora, Portugal
Nabeel MohammedNorth South University, Bangladesh
Naveed MahmudFlorida Institute of Technology, USA
Nushrat Jahan RiaDaffodil International University, Bangladesh
Pratim SahaUniversity of Alabama at Birmingham, USA
S.R. SubramanyaNational University (San Diego, USA) / Exskillence
S.M. Saiful Islam BadhonUniversity of North Texas, USA
Saif IslamCharles Schwab
Sandeep BodduluriUniversity of Alabama at Birmingham, USA
Sharun Akter KhushbuDaffodil International University, Bangladesh
Syed Ashiqur RahmanGSK, USA
Tanvir AhmedUniversity of Central Florida, USA
S.M. Mazharul Hoque ChowdhuryUniversity of North Texas, USA
Monjurul HudaAmazon