Md Arid Hasan

Twitter

GitHub

GitLab

CV [Updated 28/05/2026]

About Me

Hi! I'm a Ph.D. student in the Department of Computer Science and a graduate affiliate of the Schwartz Reisman Institute for Technology and Society (SRI) at the University of Toronto. I am advised by Prof. Ishtiaque Ahmed. I am also a member of the Department of Computer Science's Third Space research group and Dynamics Graphics Project. Before coming to the UofT, I completed my Masters from Faculty of Computer Science at University of New Brunswick, where I was supervised by Paul Cook, PhD. Prior to this, I worked as a Lecturer at Daffodil International University (DIU) in Bangladesh. Prior to that, I worked as a Research Programmer at Cognitive Insight Limited. I completed my Bachelor's Degree in Computer Science and Engineering from Daffodil International University, which is one of the topmost Engineering Universities in Bangladesh.

ACL 2026 Full Conference Student Registration Award (in-person), 11th Workshop on Computational Linguistics and Clinical Psychology - May, 2026
2026–2027 SRI Graduate Fellow, SRI, University of Toronto - April, 2026
OpenAI's Researcher Access Program and API, OpenAI - March, 2024
SGS Travel Awards - October, 2023
Masters International Differential Tuition Scholarship/Waiver - September, 2023 to August, 2025
Graduate Academic Award (GAA) / Graduate Research Award (GRA) - September, 2023 to April, 2025
Reserach Award, From Division of Research, Daffodil International University." - 2021-2023
(B.Sc.) - 2015-2018

My research advances human-centered and responsible AI, with a particular focus on natural language processing and generative models. I investigate how these systems encode cultural, linguistic, and subjective knowledge, especially in multilingual and low-resource settings, and how these factors shape their reliability and societal impact. My work spans LLM evaluation, culturally grounded AI, and applications in sensitive domains such as mental health and narrative dominance. A central focus of my research is understanding both the potential and the risks of deploying LLMs in mental health contexts, particularly for underserved populations in the Global South, where access to professional care is limited and cultural context is critical. I study how culturally aware LLMs can support para-counselors (also known as community health workers and lay counselors) by augmenting their ability to provide scalable, context-sensitive, and empathetic care, while also addressing challenges such as hallucinations, cultural misalignment, and biased ground truth. Through building and evaluating context-aware systems, my goal is to develop AI-driven mental health tools that are safe, inclusive, and responsive to diverse communities.

Education

Experiences

Working with GPU and Bash Scripts.
Working with ML frameworks & toolkits in handling large-scale and complex data sets

Skills: Large Language Models (LLM), Natural Language Processing (NLP), Transformer Models, BERT (Language Model), Data Preparation

Courses conducting as a GTA

Data Mining and Machine Learning (Tutorial Instructor and Grading)
Foundation of Artificial Intelligence (Grading)
Programming Languages (Grading)

Skills: Decision Trees, SVM, Random Forest, Long Short-term Memory (LSTM), Convolutional Neural Networks (CNN), University Lecturing, Python (Programming Language), Artificial Neural Networks

Publications

For a more up-to-date list of my publications, please check my Google Scholar.

* Equal Contributions

2026

Enhancing Mental Health Counseling Support in Bangladesh using Culturally-Grounded Knowledge

Md Arid Hasan, Azhagu Meena SP, Aditya Khan, Abu Md Akteruzzaman Bhuiyan, Helal Uddin Ahmed, Joysree Debi, Farig Sadeque, Annie En-Shiun Lee, Syed Ishtiaque Ahmed

Accepted at CLPsych 2026

PDF BibTeX

LLM-Based Multi-Task Bangla Hate Speech Detection: Type, Severity, and Target

Md Arid Hasan, Firoj Alam, Md Fahad Hossain, Usman Naseem, Syed Ishtiaque Ahmed

Accepted at ACL 2026 (Main)

PDF BibTeX

2025

Overview of BLP-2025 Task 1: Bangla Hate Speech Identification

Md Arid Hasan, Firoj Alam, Md Fahad Hossain, Usman Naseem, Syed Ishtiaque Ahmed

Proceedings of the Second Workshop on Bangla Language Processing (BLP-2025)

PDF BibTeX

EverydayMMQA: A Multilingual and Multimodal Framework for Culturally Grounded Spoken Visual QA

Firoj Alam, Ali Ezzat Shahroor, Md Arid Hasan, Zien Sheikh Ali, Hunzalah Hassan Bhatti, Mohamed Bayan Kmainasi, Shammur Absar Chowdhury, Basel Mousi, Fahim Dalvi, Nadir Durrani, Natasa Milic-Frayling

Submitted to ICML 2026

PDF BibTeX

PropXplain: Can LLMs Enable Explainable Propaganda Detection?

Maram Hasanain, Md Arid Hasan, Mohamed Bayan Kmainasi, Elisa Sartori, Ali Ezzat Shahroor, Giovanni Da San Martino, Firoj Alam

Findings of the Association for Computational Linguistics: EMNLP 2025

PDF BibTeX

Memeintel: Explainable Detection of Propagandistic and Hateful Memes

Mohamed Bayan Kmainasi, Abul Hasnat, Md Arid Hasan, Ali Ezzat Shahroor, Firoj Alam

Accepted at EMNLP 2025 (Main)

PDF BibTeX

SpokenNativQA: Multilingual Everyday Spoken Queries for LLMs

Firoj Alam, Md Arid Hasan, and Shammur Absar Chowdhury

Published at Interspeech 2025

PDF BibTeX

NativQA Framework: Enabling LLMs with Native, Local, and Everyday Knowledge

Firoj Alam, Md Arid Hasan, Sahinur Rahman Laskar, Mucahid Kutlu, Kareem Darwish, Shammur Absar Chowdhury

Submitted at ACL 2026 (Demo)

PDF BibTeX

CultranAI at PalmX 2025: Data Augmentation for Cultural Knowledge Representation

Hunzalah Hassan Bhatti, Youssef Ahmed, Md Arid Hasan, Firoj Alam

Proceedings of The Third Arabic Natural Language Processing Conference: Shared Tasks

PDF BibTeX

2024

NativQA: Multilingual Culturally-Aligned Natural Query for LLMs

Md Arid Hasan*, Maram Hasanain, Fatema Ahmed, Sahinur Rahman Laskar, Sunaya Upadhyay, Vrunda N Sukhadia, Mucahid Kutlu, Shammur Absar Chowdhury and Firoj Alam*

Accepted at ACL 2025 (Findings)

PDF BibTeX

ArMeme: Propagandistic Content in Arabic Memes

Firoj Alam, Abul Hasnat, Fatema Ahmed, Md Arid Hasan and Maram Hasanain

EMNLP 2024

PDF BibTeX

ArAIEval Shared Task: Propagandistic Techniques Detection in Unimodal and Multimodal Arabic Content

Maram Hasanain*, Md Arid Hasan*, Fatema Ahmed, Reem Suwaileh, Md Rafiul Biswas, Wajdi Zaghouani and Firoj Alam

ArabicNLP24 at ACL

PDF BibTeX

AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs

Basel Mousi, Nadir Durrani, Fatema Ahmed, Md Arid Hasan, Maram Hasanain, Tameem Kabbani, Fahim Dalvi, Shammur Absar Chowdhury and Firoj Alam

COLING 2025

PDF BibTeX

Do Large Language Models Speak All Languages Equally? A Comparative Study in Low-Resource Settings

Md Arid Hasan, Krishno Dey, Prerona Tarannum, Imran Razzak and Usman Naseem

Submitted to COLING2025

PDF BibTeX

Better to Ask in English: Evaluation of Large Language Models on English, Low-resource and Cross-Lingual Settings

Krishno Dey, Prerona Tarannum, Md Arid Hasan, Imran Razzak and Usman Naseem

Submitted to COLING2025

PDF BibTeX

2023

Zero-and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis

Md Arid Hasan, Shudipta Das, Afiyat Anjum, Firoj Alam, Anika Anjum, Avijit Sarker and Sheak Rashed Haider Noori

LREC-COLING 2024

PDF BibTeX

BLP 2023 Task 2: Sentiment Analysis

Md Arid Hasan, Firoj Alam, Anika Anjum, Shudipta Das and Afiyat Anjum

Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023), 6-11 December 2023, EMNLP, Singapore

PDF BibTeX

Role of Social Media Imagery in Disaster Informatics

Firoj Alam, Kashif Ahmad, Md Arid Hasan, Ferda Ofli and Mohammad Imran

In book: International Handbook of Disaster Research

PDF BibTeX

Z-Index at BLP-2023 Task 2: A Comparative Study on Sentiment Analysis

Prerona Tarannum, Md Arid Hasan, Krishno Dey

Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023), 6-11 December 2023, EMNLP, Singapore

PDF BibTeX

Semantics Squad at BLP-2023 Task 2: Sentiment Analysis of Bengali Text with Fine Tuned Transformer Based Models

Krishno Dey, Md Arid Hasan, Prerona Tarannum, and Francis Palma

Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023), 6-11 December 2023, EMNLP, Singapore

PDF BibTeX

Semantics Squad at BLP-2023 Task 1: Violence Inciting Bengali Text Detection with Fine-Tuned Transformer-Based Models

Krishno Dey, Prerona Tarannum, Md Arid Hasan, Francis Palma

Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023), 6-11 December 2023, EMNLP, Singapore

PDF BibTeX

Z-Index at CheckThat! 2023: Unimodal and Multimodal Checkworthiness Classification

Prerona Tarannum, Md Arid Hasan, Firoj Alam, and Sheak Rashed Haider Noori

CLEF 2023: Conference and Labs of the Evaluation Forum, 18-21 September 2023, Thessaloniki - Greece

PDF BibTeX

NN at CheckThat! 2023: Subjectivity in News Articles Classification with Transformer Based Models

Krishno Dey, Prerona Tarannum, Md Arid Hasan and Sheak Rashed Haider Noori

CLEF 2023: Conference and Labs of the Evaluation Forum, 18-21 September 2023, Thessaloniki - Greece

PDF BibTeX

FakeDTML at CheckThat! 2023: Identifying Check-worthiness of Tweets and Debate Snippets

Abdullah Al Mamun Sardar, Md. Ziaul Karim, Krishno Dey and Md Arid Hasan

CLEF 2022: Conference and Labs of the Evaluation Forum, 18-21 September 2023, Thessaloniki - Greece

PDF BibTeX

2022

MEDIC: a multi-task learning dataset for disaster image classification

Firoj Alam, Tanvirul Alam, Md Arid Hasan, Abul Hasnat, Muhammad Imran, and Ferda Ofli

Journal: Neural Computing and Applications, Springer Nature

PDF BibTeX

SemEval-2022 Task 3: PreTENS-Evaluating Neural Networks on Presuppositional Semantic Knowledge

Roberto Zamparelli, Shammur Chowdhury, Dominique Brunato, Cristiano Chesi, Felice Dell’Orletta, Md Arid Hasan, Giulia Venturi

Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)

PDF BibTeX

Z-Index at CheckThat! Lab 2022: Check-Worthiness Identification on Tweet Text

Prerona Tarannum, Md Arid Hasan, Firoj Alam, and Sheak Rashed Haider Noori

CLEF 2022: Conference and Labs of the Evaluation Forum, 05-08 September 2022, Bologna, Italy

PDF BibTeX

2021

A Review of Bangla Natural Language Processing Tasks and the Utility of Transformer Models

Firoj Alam, Md Arid Hasan, Tanvir Alam, Akib Khan, Janntatul Tajrin, Naira Khan, Shammur Absar Chowdhury

arXiv preprint, submitted to TALLIP

PDF BibTeX

Multi Class Fake News Detection using LSTM Approach

Bhaskar Majumdar, Md RafiuzzamanBhuiyan, Md Arid Hasan, Md Sanzidul Islam, Sheak Rashed Haider Noori

2021 10th International Conference on System Modeling & Advancement in Research Trends (SMART)

PDF BibTeX

M82B at CheckThat! 2021: Multiclass Fake News Detection Using BiLSTM.

Sohel Siddique Ashik, Abdur Rahman Apu, Nusrat Jahan Marjana, Md Sanzidul Islam, Md Arid Hasan

CLEF 2021: Conference and Labs of the Evaluation Forum, 21-24 September 2022, Bucharest, Romania

PDF BibTeX

Qword at CheckThat! 2021: An Extreme Gradient Boosting Approach for Multiclass Fake News Detection.

Rudra Sarker Utsha, Mumenunnessa Keya, Md Arid Hasan, Md Sanzidul Islam

CLEF 2021: Conference and Labs of the Evaluation Forum, 21-24 September 2022, Bucharest, Romania

PDF BibTeX

BlackOps at CheckThat! 2021: User Profiles Analyze of Intelligent Detection on Fake Tweets Notebook for PAN.

SM Sohan, Sharun Akter Khushbu, Md Sanzidul Islam, Md Arid Hasan

CLEF 2021: Conference and Labs of the Evaluation Forum, 21-24 September 2022, Bucharest, Romania

PDF BibTeX

Team Sigmoid at CheckThat! 2021 Task 3a: Multiclass fake news detection with Machine Learning.

Abdullah Al Mamun Sardar, Shahalu Akter Salma, Md Sanzidul Islam, Md Arid Hasan, Touhid Bhuiyan

CLEF 2021: Conference and Labs of the Evaluation Forum, 21-24 September 2022, Bucharest, Romania

PDF BibTeX

2020

Sentiment Classification in Bangla Textual Content: A Comparative Study

Md Arid Hasan, Jannatul Tajrin, Shammur Absar Chowdhury, Firoj Alam

2020 23rd International Conference on Computer and Information Technology (ICCIT)

PDF BibTeX

2019

Neural Machine Translation for the Bangla-English Language Pair

Md Arid Hasan, Firoj Alam, Shammur Absar Chowdhury, Naira Khan

2019 22nd International Conference on Computer and Information Technology (ICCIT)

PDF BibTeX

Neural vs Statistical Machine Translation: Revisiting the Bangla-English Language Pair

Md Arid Hasan, Firoj Alam, Shammur Absar Chowdhury, Naira Khan

2019 International Conference on Bangla Speech and Language Processing (ICBSLP)

PDF BibTeX

2018

A collaborative platform to collect data for developing machine translation systems

Md Arid Hasan, Firoj Alam, and Sheak Rashed Haider Noori

Proceedings of International Joint Conference on Computational Intelligence: IJCCI 2018

PDF BibTeX

Teaching

Throughout my tenure at Daffodil International University, I have passionately taught a diverse range of courses, including Artificial Intelligence, Data Mining and Machine Learning, Programming and Problem Solving, Digital Image Processing, and Object Oriented Programming. As an instructor, I dedicated myself to fostering a dynamic learning environment and guiding students towards comprehensive academic growth and success.

2023

2022

2021

Projects

Depthwise Separable Convolutions with Deep Residual Convolutions

XceptionNet, Depthwise Separable Convolutions, Deep Residual, CNN, CIFAR-10

In this project, we propose an optimized Xception architecture tailored for edge devices, aiming for lightweight and efficient deployment. We incorporate the depthwise separable convolutions with deep residual convolutions of the Xception architecture to develop a small and efficient model for edge devices. The resultant architecture reduces parameters, memory usage, and computational load. The proposed architecture is evaluated on the CIFAR 10 object detection dataset. The evaluation result of our experiment also shows the proposed architecture is smaller in parameter size and requires less training time while outperforming Xception architecture performance.

Ensemble Language Models for Multilingual Sentiment Analysis

BERT multilingual, AraBERT, XLM-RoBERTa, Instructions

In this project, I mainly explore sentiment analysis on tweet texts from SemEval-17 and the Arabic Sentiment Tweet dataset (ASTD). Moreover, I investigated four pretrained language models and proposed two ensemble language models. The findings include monolingual models exhibiting superior performance and ensemble models outperforming the baseline while the majority voting ensemble outperforms the English language.

Multiplatform Bangla Sentiment Analysis

Dataset, Transformers, LLMs, Instructions

The MUBASE dataset is a multiplatform dataset consisting of Tweets and Facebook posts, which are manually annotated with sentiment polarity. The annotation agreement of this manually annotated dataset shows an agreement score of 0.84, indicating a perfect agreement among the annotators.

MEDIC: a multi-task learning dataset for disaster image classification

Dataset, ResNet, VGG, EfficientNet, SqueezeNet, DenseNet

The MEDIC is the largest multi-task learning disaster related dataset, which is an extended version of the crisis image benchmark dataset. It consists data from several data sources such as CrisisMMD, data from AIDR and Damage Multimodal Dataset (DMD). The dataset contains 71,198 images.

Resources for Bangla Natural Language Processing (BanglaNLP)

Dataset, Transformers, BiLSTM, LMs

In our work A Review of Bangla Natural Language Processing Tasks and the Utility of Transformer Models, we provide a review of Bangla NLP tasks, resources, and tools available to the research community; we benchmark datasets collected from various platforms for nine NLP tasks using current state-of-the-art algorithms (i.e., transformer-based models). We provide comparative results for the studied NLP tasks by comparing monolingual vs. multilingual models of varying sizes. We report our results using both individual and consolidated datasets and provide data splits for future research. We reviewed a total of 108 papers and conducted 175 sets of experiments. Our results show promising performance using transformer-based models while highlighting the trade-off with computational costs. We hope that such a comprehensive survey will motivate the community to build on and further advance the research on Bangla NLP.

AmaderCAT

Language: PHP, JavaScript
Framework: CodeIgniter, JQuery, Bootstrap
Database:MySQL

The application AmaderCAT is the abbreviation of Amader Computer-assisted Translation. This application is developed for the purpose of building parallel corpus for Machine Translation system. The application contains a Translation Memory and a Glossary suggestions implementation that used for helping translators by providing TM and glossary suggestions. The application is collaborative and highly configurable for the translation task. It has the mechanism for crowd translation. You can use it as single user or a group/team. In future, we will add Machine Translation System in our application using Neural Network technologies.

Skills

Programming Languages

Python
PHP
JavaScript
Java

ML & NLP Tools

Transformers
Pytorch
LM-Harness
LLMeBench
OpenNMT
Keras
Sci-kit Learn
NLTK

LLMs Explored

GPT-4, 4o, and 4v
GPT-3.5
Gemini
Llama 2, 3, and 3.1
Jais
Bloomz
Claude-3.1
Mistral
FlanT5

Frameworks (Front- and back-end)

CodeIgniter
Vue.js
JQuery
Bootstrap
Laravel

Database

MySQL
SQLite
MS SQL Server

Web Server

Apache
NginX

Operating System

Mac OS
Ubuntu
Debian
Windos

IDE

PyCharm
PhpStorm
IntelliJ Idea
NetBeans
CodeBlocks

Others

Git
Docker
Latex
Anaconda
Jupyter Notebook

Professional Services

Reviewer (Notable)

2026

2026 ACL Rolling Reviewer, Full Year.

2026 ICLR Reviewer

2025

2025 ACL Rolling Reviewer, Full Year.

2024

2024 ACL Rolling Reviewer, Full Year

2023

The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Reviewed three articles

2022

2022 Conference on Neural Information Processing Systems (NeurIPS) Track Datasets and Benchmarks, Reviewed one articles

The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), Reviewed one articles

2021

Multimedia Systems, Springer Nature

Reviewed one articles titled "A systematic review of sentiment analysis using machine learning and deep learning approaches"

Professional Development

Participated in training on Outcome Based Education (OBE)

Daffodil International University, Bangladesh - June 2022

Participated in International Workshop on Computer Vision and Application (IWCVA)

Southeast University, Bangladesh - December 2019

Participated in 8th International Conference on SMART

Teerthanker Mahaveer University, India - November 2019

Extracurricular Activities

Co‑organizer, 2024 ArAIEval Shared Task at Arabic NLP: Propagandistic Techniques Detection in Unimodal and Multimodal Arabic Content

Proceedings of the Second Arabic Natural Language Processing Conference (ArabicNLP 2024), August 2024, ACL, Thailand

Co‑organizer, BLP‑2023 TASK 2: Sentiment Analysis

Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023), December 2023, EMNLP, Singapore

Talk on Artificial Intelligence in Natural Language Processing

7TH BANGLADESH SCHOOL OF INTERNET GOVERNANCE, Dhaka, Bangladesh - February 2023

Co‑organizer, SEMEVAL‑2022 TASK 3: PreTENS‑Evaluating Neural Networks on Presuppositional Semantic Knowledge

2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics - July 2022

Supervisor, DIU-NLP and Machine Learning Research Lab