Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Page Not Found

Page not found. Your pixels are in another canvas.

Jupyter notebook markdown generator

Posts

Future Blog Post

less than 1 minute read

Published: January 01, 2199

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Blog Post number 4

less than 1 minute read

Published: August 14, 2015

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 3

less than 1 minute read

Published: August 14, 2014

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

Published: August 14, 2013

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 1

less than 1 minute read

Published: August 14, 2012

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

portfolio

Portfolio item number 1

Short description of portfolio item number 1

Portfolio item number 2

Short description of portfolio item number 2

publications

CLIP Exhibits Improved Compositional Generalization Through Representation Disentanglement

Published in Preprint (ICLR 2024 submission), 2023

Investigates how the compositional out‑of‑distribution generalization of CLIP models emerges from training data diversity and representation disentanglement. Demonstrates that richer attribute–object combinations in the training set lead to improved performance and that disentangling image and text representations enhances compositional generalization.

Download here

Hidden in Plain Sight: Evaluating Abstract Shape Recognition in Vision‑Language Models

Published in NeurIPS 2024 (Datasets & Benchmarks Track), 2024

Introduces IllusionBench, a dataset that hides letters, faces and animals inside everyday scenes to audit whether modern vision‑language models can recognize abstract shapes. Human subjects achieve near‑perfect accuracy on the tasks, whereas state‑of‑the‑art models score below 40 % zero‑shot, revealing significant robustness gaps.

Download here

Leveraging Retrieval‑Augmented Generation for Persian University Knowledge Retrieval

Published in 15th IKT (accepted – oral), 2024

Proposes a two‑stage retrieval‑augmented generation pipeline that combines Persian large language models with tailored prompt engineering to answer university‑related queries. Introduces the UniversityQuestionBench dataset and evaluates performance using faithfulness, answer relevance and context relevance metrics.

Download here

Context Awareness Gate for Retrieval‑Augmented Generation

Published in 15th IKT (accepted), 2024

Introduces the Context Awareness Gate (CAG), a mechanism that dynamically decides whether a query requires external context retrieval in a retrieval‑augmented generation pipeline. Includes a vector‑candidates method for scalable, LLM‑independent semantic search and demonstrates that skipping unnecessary retrieval improves answer quality.

Download here

Research Directions for Using LLM in Software Requirement Engineering: A Systematic Review

Published in Frontiers in Computer Science, 2025

Provides a comprehensive overview of how large language models can support requirement engineering, summarising current applications, challenges and future directions.

Download here

Adaptive Chunking for VideoRAG Pipelines with a Newly Gathered Bilingual Educational Dataset

Published in 29th CSICC (accepted), 2025

Leverages a bilingual educational dataset to propose a retrieval‑augmented video question answering pipeline with adaptive chunking, improving video‑to‑text inference for educational content.

Download here

Advanced Mutation Testing with Zero and Few‑Shot Evaluation Using GPT‑v4

Published in 29th CSICC (under review), 2025

Explores mutation testing using GPT‑v4 to evaluate software test cases under zero‑ and few‑shot settings, demonstrating the potential of large language models for automated software quality assurance.

Download here

MEENA (PersianMMMU): Multimodal‑Multilingual Educational Exams for N‑level Assessment

Published in Under review (COLM 2025), 2025

Presents the first large‑scale Persian multimodal benchmark for evaluating vision‑language models on scientific reasoning, problem‑solving and human‑level understanding. Contains 7,500 Persian and 3,000 English multimodal questions with rich metadata such as difficulty, descriptive answers and student success rates and evaluates GPT‑4, Gemini and other models under zero‑shot, few‑shot and hallucination detection settings.

Download here

talks

Zero-shot Learning in Medical Domain

Published: August 01, 2021

Focusing on developing models capable of recognizing unseen medical conditions from imaging data, this research tackles the challenge of improving diagnostic tools without extensive labeled datasets. Dr. Peyman Adibi guides the project, aiming to revolutionize diagnostic methodologies through zero-shot learning.

Text Summarization Using Graph Neural Network

Published: February 01, 2022

Under Dr. Hamidreza Baradaran’s guidance, this project employs graph neural networks for efficient text summarization. The objective is to enhance the coherence and relevance of automated summaries, facilitating improved information retrieval and understanding across large text corpora.

Vision-Linguistic Models

Published: February 01, 2023

This project aims to enhance the synergy between visual perception and language processing in AI systems, exploring the boundaries of machine understanding and interpretation of complex visual-textual information. The work under Prof. Mohammad Hossein Rohban investigates innovative methodologies to advance the capabilities of vision-linguistic models.

3D Vessel Segmentation Using Geometric Approaches

Published: May 01, 2023

This research focuses on leveraging geometric deep learning techniques for the accurate segmentation of blood vessels in 3D medical images. The aim is to enhance diagnostic and therapeutic procedures in medicine, showcasing the potential of geometric approaches in complex image segmentation tasks.

Counterfactual Modelling Using Vision-Linguistic Models

Published: August 01, 2023

Investigates generating counterfactual scenarios through integration of visual and linguistic data to enhance AI interpretability and reliability, focusing on applications such as autonomous driving and medical diagnostics. This research highlights the potential of vision-linguistic models in creating diverse, hypothetical scenarios for advanced problem-solving and decision-making processes.

Arshia Hemmat

Sitemap

Pages

Posts

portfolio

publications

talks

teaching