Arid Hasan profile picture

Md. Arid Hasan

MCS Student at UNB.
Research Assistant at SE+AI Lab.
Teaching Assistant at FCS


Fredericton,

About Me

  • SGS Travel Awards -
  • Masters International Differential Tuition Scholarship/Waiver -
  • Graduate Academic Award (GAA) / Graduate Research Award (GRA) -
  • Reserach Award, From Division of Research, Daffodil International University." -
  • (B.Sc.) -

Hobbies

Programming and Math

Education

Experiences

  • Working with GPU and Bash Scripts.
  • Working with ML frameworks & toolkits in handling large-scale and complex data sets
Skills: Systematic Reviews, Data Scraping, Large Language Models (LLM), Natural Language Processing (NLP), Transformer Models, BERT (Language Model), Data Preparation

Courses conducting as a GTA

  • Data Mining and Machine Learning (Tutorial Instructor and Grading)
  • Foundation of Artificial Intelligence (Grading)
Skills: Decision Trees, SVM, Random Forest, Long Short-term Memory (LSTM), Convolutional Neural Networks (CNN), University Lecturing, Python (Programming Language), Artificial Neural Networks

Publications

2024

Do Large Language Models Speak All Languages Equally? A Comparative Study in Low-Resource Settings
Md Arid Hasan, Krishno Dey, Prerona Tarannum, Imran Razzak and Usman Naseem
Submitted to ACL2024
@inproceedings{hasan2024do, title={Do Large Language Models Speak All Languages Equally? A Comparative Study in Low-Resource Settings}, author={Hasan, Md Arid and Dey, Krishno and Tarannum, Prerona and Razzak, Imran and Naseem, Usman}, booktitle={Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics}, pubState={Submitted}, pages={}, year={2024} }
Building Multi-class Emotions Dataset from Commit Messages: A Comparison among Classical, Deep Learning, and Pre-trained Language Models for Emotions Prediction
Md Arid Hasan, Bikrom Roy, Huang Cao and Francis Palma
Submitted to CANAI2024
@inproceedings{hasan2024building, title={Building Multi-class Emotions Dataset from Commit Messages: A Comparison among Classical, Deep Learning, and Pre-trained Language Models for Emotions Prediction}, author={Hasan, Md Arid and Roy, Bikrom and Cao, Huang and Palma, Francis}, booktitle={Proceedings of the Canadian Conference on Artificial Intelligence}, publisher = {Canadian Artificial Intelligence Association (CAIAC)}, pubState={Submitted}, pages={}, year={2024} }

2023

Zero-and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis
Md Arid Hasan, Shudipta Das, Afiyat Anjum, Firoj Alam, Anika Anjum, Avijit Sarker and Sheak Rashed Haider Noori
arXiv preprint
@article{hasan2023zero, title={Zero-and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis}, author={Hasan, Md Arid and Das, Shudipta and Anjum, Afiyat and Alam, Firoj and Anjum, Anika and Sarker, Avijit and Noori, Sheak Rashed Haider}, journal={arXiv preprint arXiv:2308.10783}, year={2023} }
BLP 2023 Task 2: Sentiment Analysis
Md. Arid Hasan, Firoj Alam, Anika Anjum, Shudipta Das and Afiyat Anjum
Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023), 6-11 December 2023, EMNLP, Singapore
@inproceedings{blp2023-overview-task2, title = "BLP 2023 Task 2: Sentiment Analysis", author = "Hasan, Md. Arid and Alam, Firoj and Anjum, Anika and Das, Shudipta and Anjum, Afiyat", booktitle = "Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023)", month = dec, year = "2023", address = "Singapore", publisher = "Association for Computational Linguistics", }
Role of Social Media Imagery in Disaster Informatics
Firoj Alam, Kashif Ahmad, Md. Arid Hasan, Ferda Ofli and Mohammad Imran
In book: International Handbook of Disaster Research
@bookchapter{alam2023role, title = "Role of Social Media Imagery in Disaster Informatics", author = "Alam, Firoj and Ahmad, Kashif and Hasan, Md. Arid and Ofli, Ferda and Imran, Mohammad", booktitle = "International Handbook of Disaster Research", month = oct, year = "2023", publisher = "Springer Nature", }
Z-Index at BLP-2023 Task 2: A Comparative Study on Sentiment Analysis
Prerona Tarannum, Md. Arid Hasan, Krishno Dey
Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023), 6-11 December 2023, EMNLP, Singapore
@InProceedings{BLP2023:task2:z-index, author = {Tarannum, Prerona and Hasan, Md. Arid and Dey, Krishno}, title = "Z-Index at BLP-2023 Task 2: A Comparative Study on Sentiment Analysis", booktitle = "Proceedings of the 1st Workshop on Bangla Language Processing (BLP 2023)", month = dec, year = "2023", address = "Singapore", publisher = "Association for Computational Linguistics", }
Semantics Squad at BLP-2023 Task 2: Sentiment Analysis of Bengali Text with Fine Tuned Transformer Based Models
Krishno Dey, Md. Arid Hasan, Prerona Tarannum, and Francis Palma
Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023), 6-11 December 2023, EMNLP, Singapore
@InProceedings{BLP2023:task2:SemanticsSquad, author = {Dey, Krishno and Hasan, Md. Arid and Tarannum, Prerona and Palma, Francis}, title = "Semantics Squad at BLP-2023 Task 2: Sentiment Analysis of Bengali Text with Fine Tuned Transformer Based Models", booktitle = "Proceedings of the 1st Workshop on Bangla Language Processing (BLP 2023)", month = dec, year = "2023", address = "Singapore", publisher = "Association for Computational Linguistics", }
Semantics Squad at BLP-2023 Task 1: Violence Inciting Bengali Text Detection with Fine-Tuned Transformer-Based Models
Krishno Dey, Prerona Tarannum, Md. Arid Hasan, Francis Palma
Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023), 6-11 December 2023, EMNLP, Singapore
@inproceedings{BLP2023:task1:SemanticsSquad, title = "Semantics Squad at BLP-2023 Task 1: Violence Inciting Bengali Text Detection with Fine-Tuned Transformer-Based Models", author = "Dey, Krishno and Tarannum, Prerona and Hasan, Md. Arid and Palma, Francis", booktitle = "Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023)", month = dec, year = "2023", address = "Singapore", publisher = "Association for Computational Linguistics", }
Z-Index at CheckThat! 2023: Unimodal and Multimodal Checkworthiness Classification
Prerona Tarannum, Md. Arid Hasan, Firoj Alam, and Sheak Rashed Haider Noori
CLEF 2023: Conference and Labs of the Evaluation Forum, 18-21 September 2023, Thessaloniki - Greece
@article{tarannum2023z, title={Z-Index at CheckThat! 2023: Unimodal and multimodal checkworthiness classification}, author={Tarannum, Prerona and Hasan, Md Arid and Alam, Firoj and Noori, Sheak Rashed Haider}, journal={Working Notes of CLEF}, year={2023} }
NN at CheckThat! 2023: Subjectivity in News Articles Classification with Transformer Based Models
Krishno Dey, Prerona Tarannum, Md. Arid Hasan and Sheak Rashed Haider Noori
CLEF 2023: Conference and Labs of the Evaluation Forum, 18-21 September 2023, Thessaloniki - Greece
@article{dey2023nn, title={Nn at CheckThat! 2023: Subjectivity in news articles classification with transformer based models}, author={Dey, Krishno and Tarannum, Prerona and Hasan, Md Arid and Noori, Sheak Rashed Haider}, journal={Working Notes of CLEF}, year={2023} }

FakeDTML at CheckThat! 2023: Identifying Check-worthiness of Tweets and Debate Snippets
Abdullah Al Mamun Sardar, Md. Ziaul Karim, Krishno Dey and Md. Arid Hasan
CLEF 2022: Conference and Labs of the Evaluation Forum, 18-21 September 2023, Thessaloniki - Greece
@article{sardar2023fakedtml, title={Fakedtml at CheckThat! 2023: Identifying check-worthiness of tweets and debate snippets}, author={Sardar, Abdullah Al Mamun and Karim, Md Ziaul and Dey, Krishno and Hasan, Md Arid}, year={2023} }

2022

MEDIC: a multi-task learning dataset for disaster image classification
Firoj Alam, Tanvirul Alam, Md. Arid Hasan, Abul Hasnat, Muhammad Imran, and Ferda Ofli
Journal: Neural Computing and Applications, Springer Nature
@article{alam2023medic, title={MEDIC: a multi-task learning dataset for disaster image classification}, author={Alam, Firoj and Alam, Tanvirul and Hasan, Md Arid and Hasnat, Abul and Imran, Muhammad and Ofli, Ferda}, journal={Neural Computing and Applications}, volume={35}, number={3}, pages={2609--2632}, year={2023}, publisher={Springer} }
SemEval-2022 Task 3: PreTENS-Evaluating Neural Networks on Presuppositional Semantic Knowledge
Roberto Zamparelli, Shammur Chowdhury, Dominique Brunato, Cristiano Chesi, Felice Dell’Orletta, Md Arid Hasan, Giulia Venturi
Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)
@inproceedings{zamparelli2022semeval, title={SemEval-2022 Task 3: PreTENS-Evaluating Neural Networks on Presuppositional Semantic Knowledge}, author={Zamparelli, Roberto and Chowdhury, Shammur and Brunato, Dominique and Chesi, Cristiano and Dell’Orletta, Felice and Hasan, Md Arid and Venturi, Giulia}, booktitle={Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)}, pages={228--238}, year={2022} }
Z-Index at CheckThat! Lab 2022: Check-Worthiness Identification on Tweet Text
Prerona Tarannum, Md. Arid Hasan, Firoj Alam, and Sheak Rashed Haider Noori
CLEF 2022: Conference and Labs of the Evaluation Forum, 05-08 September 2022, Bologna, Italy
@article{tarannum2022z, title={Z-Index at CheckThat! Lab 2022: Check-Worthiness Identification on Tweet Text}, author={Tarannum, Prerona and Alam, Firoj and Hasan, Md Arid and Noori, Sheak Rashed Haider}, journal={arXiv preprint arXiv:2207.07308}, year={2022} }

2021

A Review of Bangla Natural Language Processing Tasks and the Utility of Transformer Models
Firoj Alam, Md. Arid Hasan, Tanvir Alam, Akib Khan, Janntatul Tajrin, Naira Khan, Shammur Absar Chowdhury
arXiv preprint, submitted to TALLIP
@article{alam2021review, title={A review of bangla natural language processing tasks and the utility of transformer models}, author={Alam, Firoj and Hasan, Arid and Alam, Tanvirul and Khan, Akib and Tajrin, Janntatul and Khan, Naira and Chowdhury, Shammur Absar}, journal={arXiv preprint arXiv:2107.03844}, year={2021} }
Multi Class Fake News Detection using LSTM Approach
Bhaskar Majumdar, Md RafiuzzamanBhuiyan, Md Arid Hasan, Md Sanzidul Islam, Sheak Rashed Haider Noori
2021 10th International Conference on System Modeling & Advancement in Research Trends (SMART)
@inproceedings{majumdar2021multi, title={Multi class fake news detection using LSTM approach}, author={Majumdar, Bhaskar and RafiuzzamanBhuiyan, Md and Hasan, Md Arid and Islam, Md Sanzidul and Noori, Sheak Rashed Haider}, booktitle={2021 10th International Conference on System Modeling & Advancement in Research Trends (SMART)}, pages={75--79}, year={2021}, organization={IEEE} }
M82B at CheckThat! 2021: Multiclass Fake News Detection Using BiLSTM.
Sohel Siddique Ashik, Abdur Rahman Apu, Nusrat Jahan Marjana, Md Sanzidul Islam, Md Arid Hassan
CLEF 2021: Conference and Labs of the Evaluation Forum, 21-24 September 2022, Bucharest, Romania
@inproceedings{ashik2021m82b, title={M82B at CheckThat! 2021: Multiclass Fake News Detection Using BiLSTM.}, author={Ashik, Sohel Siddique and Apu, Abdur Rahman and Marjana, Nusrat Jahan and Islam, Md Sanzidul and Hassan, Md Arid}, booktitle={CLEF (Working Notes)}, pages={435--445}, year={2021} }
Qword at CheckThat! 2021: An Extreme Gradient Boosting Approach for Multiclass Fake News Detection.
Rudra Sarker Utsha, Mumenunnessa Keya, Md Arid Hassan, Md Sanzidul Islam
CLEF 2021: Conference and Labs of the Evaluation Forum, 21-24 September 2022, Bucharest, Romania
@inproceedings{utsha2021qword, title={Qword at CheckThat! 2021: An Extreme Gradient Boosting Approach for Multiclass Fake News Detection.}, author={Utsha, Rudra Sarker and Keya, Mumenunnessa and Hasan, Md Arid and Islam, Md Sanzidul}, booktitle={CLEF (Working Notes)}, pages={619--627}, year={2021} }
BlackOps at CheckThat! 2021: User Profiles Analyze of Intelligent Detection on Fake Tweets Notebook for PAN.
SM Sohan, Sharun Akter Khushbu, Md Sanzidul Islam, Md Arid Hassan
CLEF 2021: Conference and Labs of the Evaluation Forum, 21-24 September 2022, Bucharest, Romania
@inproceedings{sohan2021blackops, title={BlackOps at CheckThat! 2021: User Profiles Analyze of Intelligent Detection on Fake Tweets Notebook for PAN.}, author={Sohan, SM and Khushbu, Sharun Akter and Islam, Md Sanzidul and Hasan, Md Arid}, booktitle={CLEF (Working Notes)}, pages={648--658}, year={2021} }
Team Sigmoid at CheckThat! 2021 Task 3a: Multiclass fake news detection with Machine Learning.
Abdullah Al Mamun Sardar, Shahalu Akter Salma, Md Sanzidul Islam, Md Arid Hassan, Touhid Bhuiyan
CLEF 2021: Conference and Labs of the Evaluation Forum, 21-24 September 2022, Bucharest, Romania
@inproceedings{sardar2021team, title={Team Sigmoid at CheckThat! 2021 Task 3a: Multiclass fake news detection with Machine Learning.}, author={Sardar, Abdullah Al Mamun and Salma, Shahalu Akter and Islam, Md Sanzidul and Hasan, Md Arid and Bhuiyan, Touhid}, booktitle={CLEF (Working Notes)}, pages={612--618}, year={2021} }

2020

Sentiment Classification in Bangla Textual Content: A Comparative Study
Md. Arid Hasan, Jannatul Tajrin, Shammur Absar Chowdhury, Firoj Alam
2020 23rd International Conference on Computer and Information Technology (ICCIT)
@inproceedings{hasan2020sentiment, title={Sentiment classification in bangla textual content: A comparative study}, author={Hasan, Md Arid and Tajrin, Jannatul and Chowdhury, Shammur Absar and Alam, Firoj}, booktitle={2020 23rd international conference on computer and information technology (ICCIT)}, pages={1--6}, year={2020}, organization={IEEE} }

2019

Neural Machine Translation for the Bangla-English Language Pair
Md. Arid Hasan, Firoj Alam, Shammur Absar Chowdhury, Naira Khan
2019 22nd International Conference on Computer and Information Technology (ICCIT)
@inproceedings{hasan2019a_neural, title={Neural Machine Translation for the Bangla-English Language Pair}, author={Hasan, Md Arid and Alam, Firoj and Chowdhury, Shammur Absar and Khan, Naira}, booktitle={2020 22nd international conference on computer and information technology (ICCIT)}, pages={1--6}, year={2019}, organization={IEEE} }
Neural vs Statistical Machine Translation: Revisiting the Bangla-English Language Pair
Md. Arid Hasan, Firoj Alam, Shammur Absar Chowdhury, Naira Khan
2019 International Conference on Bangla Speech and Language Processing (ICBSLP)
@inproceedings{hasan2019neural, title={Neural vs Statistical Machine Translation: Revisiting the Bangla-English Language Pair}, author={Hasan, Md Arid and Alam, Firoj and Chowdhury, Shammur Absar and Khan, Naira}, booktitle={Neural vs Statistical Machine Translation: Revisiting the Bangla-English Language Pair}, pages={1--6}, year={2019}, organization={IEEE} }

2018

A collaborative platform to collect data for developing machine translation systems
Md. Arid Hasan, Firoj Alam, and Sheak Rashed Haider Noori
Proceedings of International Joint Conference on Computational Intelligence: IJCCI 2018
@inproceedings{hasan2020collaborative, title={A collaborative platform to collect data for developing machine translation systems}, author={Hasan, Md Arid and Alam, Firoj and Noori, Sheak Rashed Haider}, booktitle={Proceedings of International Joint Conference on Computational Intelligence: IJCCI 2018}, pages={407--416}, year={2020}, organization={Springer} }

Teaching

Throughout my tenure at Daffodil International University, I have passionately taught a diverse range of courses, including Artificial Intelligence, Data Mining and Machine Learning, Programming and Problem Solving, Digital Image Processing, and Object Oriented Programming. As an instructor, I dedicated myself to fostering a dynamic learning environment and guiding students towards comprehensive academic growth and success.

2023

2022

2021

Projects

Ensemble Language Models for Multilingual Sentiment Analysis

BERT multilingual, AraBERT, XLM-RoBERTa, Instructions

In this project, I mainly explore sentiment analysis on tweet texts from SemEval-17 and the Arabic Sentiment Tweet dataset (ASTD). Moreover, I investigated four pretrained language models and proposed two ensemble language models. The findings include monolingual models exhibiting superior performance and ensemble models outperforming the baseline while the majority voting ensemble outperforms the English language.

Multiplatform Bangla Sentiment Analysis

Dataset, Transformers, LLMs, Instructions

The MUBASE dataset is a multiplatform dataset consisting of Tweets and Facebook posts, which are manually annotated with sentiment polarity. The annotation agreement of this manually annotated dataset shows an agreement score of 0.84, indicating a perfect agreement among the annotators.

MEDIC: a multi-task learning dataset for disaster image classification

Dataset, ResNet, VGG, EfficientNet, SqueezeNet, DenseNet

The MEDIC is the largest multi-task learning disaster related dataset, which is an extended version of the crisis image benchmark dataset. It consists data from several data sources such as CrisisMMD, data from AIDR and Damage Multimodal Dataset (DMD). The dataset contains 71,198 images.

Resources for Bangla Natural Language Processing (BanglaNLP)

Dataset, Transformers, BiLSTM, LMs

In our work A Review of Bangla Natural Language Processing Tasks and the Utility of Transformer Models, we provide a review of Bangla NLP tasks, resources, and tools available to the research community; we benchmark datasets collected from various platforms for nine NLP tasks using current state-of-the-art algorithms (i.e., transformer-based models). We provide comparative results for the studied NLP tasks by comparing monolingual vs. multilingual models of varying sizes. We report our results using both individual and consolidated datasets and provide data splits for future research. We reviewed a total of 108 papers and conducted 175 sets of experiments. Our results show promising performance using transformer-based models while highlighting the trade-off with computational costs. We hope that such a comprehensive survey will motivate the community to build on and further advance the research on Bangla NLP.

AmaderCAT

Language: PHP, JavaScript
Framework: CodeIgniter, JQuery, Bootstrap
Database:MySQL

The application AmaderCAT is the abbreviation of Amader Computer-assisted Translation. This application is developed for the purpose of building parallel corpus for Machine Translation system. The application contains a Translation Memory and a Glossary suggestions implementation that used for helping translators by providing TM and glossary suggestions. The application is collaborative and highly configurable for the translation task. It has the mechanism for crowd translation. You can use it as single user or a group/team. In future, we will add Machine Translation System in our application using Neural Network technologies.

Skills

Programming Languages
  • Python
  • PHP
  • JavaScript
  • Java
ML & NLP Tools
  • Transformers
  • Pytorch
  • LM-Harness
  • LLMeBench
  • OpenNMT
  • Keras
  • Sci-kit Learn
  • NLTK
LLMs Explored
  • GPT-4
  • GPT-3.5
  • Gemini
  • Llama 2
  • Jais
  • Bloomz
  • FlanT5
Frameworks (Front- and back-end)
  • CodeIgniter
  • Vue.js
  • JQuery
  • Bootstrap
  • Laravel
Database
  • MySQL
  • SQLite
  • MS SQL Server
Web Server
  • Apache
  • NginX
Operating System
  • Mac OS
  • Ubuntu
  • Debian
  • Windos
IDE
  • PyCharm
  • PhpStorm
  • IntelliJ Idea
  • NetBeans
  • CodeBlocks
Others
  • Git
  • Docker
  • Latex
  • Anaconda
  • Jupyter Notebook

Extracurricular Activities

Co‑organizer, 2024 ArAIEval Shared Task at Arabic NLP: Propagandistic Techniques Detection in Unimodal and Multimodal Arabic Content
Proceedings of the Second Arabic Natural Language Processing Conference (ArabicNLP 2024), August 2024, ACL, Thailand




Co‑organizer, BLP‑2023 TASK 2: Sentiment Analysis
Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023), December 2023, EMNLP, Singapore




Talk on Artificial Intelligence in Natural Language Processing
7TH BANGLADESH SCHOOL OF INTERNET GOVERNANCE, Dhaka, Bangladesh - February 2023




Co‑organizer, SEMEVAL‑2022 TASK 3: PreTENS‑Evaluating Neural Networks on Presuppositional Semantic Knowledge
2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics - July 2022

Supervisor, DIU-NLP and Machine Learning Research Lab
Daffodil International University, Bangladesh
January 2021 - December 2022

Trainer, Machine Learning Bootcamp
Daffodil International University, Bangladesh
October 2019 - January 2020

Trainer, Workshop on Tensorflow, Keras and PyTorch
Daffodil International University, Bangladesh
February 2020

Member, DIU-NLP and Machine Learning Research Lab
Daffodil International University, Bangladesh
April 2018 - December 2020

Student Prefect, Department Computer Science and Engineering
Daffodil International University, Bangladesh
August 2017 - December 2017

Professional Services

Reviewer (Notable)

2024
2024 ACL Rolling Reviewer

Full Year

2023
The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Reviewed three articles

2022
2022 Conference on Neural Information Processing Systems (NeurIPS) Track Datasets and Benchmarks

Reviewed one articles

The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Reviewed one articles

2021
Multimedia Systems, Springer Nature

Reviewed one articles titled "A systematic review of sentiment analysis using machine learning and deep learning approaches"

Professional Development

Participated in training on Outcome Based Education (OBE)
Daffodil International University, Bangladesh - June 2022

Participated in International Workshop on Computer Vision and Application (IWCVA)
Southeast University, Bangladesh - December 2019

Participated in 8th International Conference on SMART
Teerthanker Mahaveer University, India - November 2019