Skip to content

Creating Arabic Datasets with respect to Arabic Dialects  #103

@AliAlsalkhadi

Description

@AliAlsalkhadi

@Mahmoud-s-programs and I went through the articles recommended by @Sepideh-Ahmadian and after a long discussion to find the best way to gather the Arabic datasets with respect to the dialects is by creating different datasets for each region (Gulf, Levantine, Egyptian, Meghrbi). This will encapsulate all Arabic dialects and the model will be able to recognize them.

We have added more reviews to the semEval-2016 dataset already as it uses Gulf dialect exclusively.

Screenshot 2024-11-14 234904

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions