ArAIEval

Arabic AI Tasks Evaluation (ArAIEval)

Welcome to ArAIEval shared task at ArabicNLP 2023!

Propaganda and disinformation are widely used on both social media and mainstream media to influence or mislead audiences. This has become a significant concern for various stakeholders, including social media platforms and government agencies. Such concerns have motivated the organization of this shared task, aiming to further promote work on computational propaganda and disinformation detection in Arabic content.

ArAIEval offers two shared tasks: (i) persuasion technique detection, and (ii) disinformation detection. ArAIEval covers both the tweets and news articles domains and includes binary, multiclass and multlilabel subtasks.

Tasks

  • Task 1: Persuasion Technique Detection
  • Task 2: Disinformation Detection

Important Dates

  • 13 July 2023: Registration on codalab and beginning of the development cycle
  • June 2023: Registration at codalab (date TBA)
  • 27 July 2023: Beginning of the evaluation cycle (test sets release)
  • 5 August 2023 23:59 AOE: End of the evaluation cycle (run submission)
  • 12 August 2023:** Beginning of the evaluation cycle (test sets release and run submission)
  • 13 August 2023 23:59 AOE: Beginning of the evaluation cycle (test sets release and run submission)
  • 15 August 2023 23:59 AOE:** End of the evaluation cycle (run submission)
  • 18 August 2023 23:59 AOE:** End of the evaluation cycle (run submission)
  • 20 August 2023 23:59 AOE: End of the evaluation cycle (run submission)
  • 29 August 2023:** Deadline for the submission of working notes
  • 5 September 2023:** Deadline for the submission of working notes
  • 12 September 2023: Deadline for the submission of working notes
  • 12 October 2023: Notification of acceptance of working notes
  • 20 October 2023: Deadline for submission of camera-ready working notes
  • 7 December 2023: WANLP ArabicNLP Conference (colocated with EMNLP-2023)

Recent Updates

  • 22 August, 2023: Release of the leaderboard.
  • 14 August, 2023: The test sets for all tasks have been released.
  • 12 July, 2023: Datasets release
  • 1st June, 2023: Website is up!

Proceedings

Instructions to Prepare Your Paper:


The title of paper should be in the following format:
<Team Name> at ArAIEval Shared Task: <Name of the title>
For example: Scholarly at ArAIEval Shared Task: LLMs in Persuasion Technique Detection for Identifying Manipulative Strategies

Cite: Please check the citation section to cite the shared task papers.


Templates: Please find the templates at: https://2023.emnlp.org/calls/style-and-formatting/

Paper Length: Shared task description papers may include up to 4 pages of content, in addition to unlimited references.

More information: For more details, please visit the following link: https://arabicnlp2023.sigarab.org/call-for-papers

Paper submission: https://openreview.net/group?id=SIGARAB.org/ArabicNLP/2023/Conference

Suggested structure for the system description papers:


Abstract: four/five sentences highlighting your approach and key results.
Introduction: 3/4 a page expanding on the abstract mentioning key background such as why the task is challenging for current modeling techniques and why your approach is interesting/novel.
Related Work: This section provides a review of existing research in the field, with a focus on highlighting how our contribution diverges from and adds value to the current body of work.
Data: review of the data you used to train your system. Be sure to mention the size of the training, validation and test sets that you’ve used, and the label distributions, as well as any tools you used for preprocessing data.
System: a detailed description of how the systems were built and trained. If you’re using a neural network, were there pre-trained embeddings, how was the model trained, what hyperparameters were chosen and experimented with? How long did the model take to train, and on what infrastructure? Linking to source code is valuable here as well, but the description should be able to stand alone as a full description of how to reimplement the system. While other paper styles include background as a separate section, it’s fine to simply include citations to similar systems which inspired your work as you describe your system.
Results: a description of the key results of the paper. If you have done extra error analysis into what types of errors the system makes, this is extremely valuable for the reader. Unofficial results from after the submission deadline can be very useful as well.
Discussion: general discussion of the task and your system. Description of characteristic errors and their frequency over a sample of development data. What would you do if you had another 3 months to work on it?
Conclusion: a restatement of the introduction, highlighting what was learned about the task and how to model it.

Citation

Please cite the following papers.

[1] Maram Hasanain, Firoj Alam, Hamdy Mubarak, Samir Abdaljalil, Wajdi Zaghouani, Preslav Nakov, Giovanni Da San Martino, and Abdelhakim Freihat. 2023. “ArAIEval Shared Task: Persuasion Techniques and Disinformation Detection in Arabic Text.” In Proceedings of the First Arabic Natural Language Processing Conference (ArabicNLP 2023), December. Singapore: Association for Computational Linguistics.
[2] Firoj Alam, Hamdy Mubarak, Wajdi Zaghouani, Giovanni Da San Martino, and Preslav Nakov. 2022. “Overview of the WANLP 2022 Shared Task on Propaganda Detection in Arabic.”, In Proceedings of the Seventh Arabic Natural Language Processing Workshop (WANLP), 108–118, December. Abu Dhabi, United Arab Emirates (Hybrid): Association for Computational Linguistics. https://aclanthology.org/2022.wanlp-1.11.
[3] Mubarak, Hamdy, Samir Abdaljalil, Azza Nassar, and Firoj Alam. 2023. “Detecting and Identifying the Reasons for Deleted Tweets Before They Are Posted.” Frontiers in Artificial Intelligence 6. https://doi.org/10.3389/frai.2023.1219767.

@inproceedings{araieval:arabicnlp2023-overview,
    title = "ArAIEval Shared Task: Persuasion Techniques and Disinformation Detection in Arabic Text",
    author = "Hasanain, Maram and Alam, Firoj and Mubarak, Hamdy, and Abdaljalil, Samir  and Zaghouani, Wajdi and Nakov, Preslav  and Da San Martino, Giovanni and Freihat, Abed Alhakim",
    booktitle = "Proceedings of the First Arabic Natural Language Processing Conference (ArabicNLP 2023)",
    month = Dec,
    year = "2023",
    address = "Singapore",
    publisher = "Association for Computational Linguistics",
}

@inproceedings{alam-etal-2022-overview,
    title = "Overview of the {WANLP} 2022 Shared Task on Propaganda Detection in {A}rabic",
    author = "Alam, Firoj  and
      Mubarak, Hamdy  and
      Zaghouani, Wajdi  and
      Da San Martino, Giovanni  and
      Nakov, Preslav",
    booktitle = "Proceedings of the The Seventh Arabic Natural Language Processing Workshop (WANLP)",
    month = dec,
    year = "2022",
    address = "Abu Dhabi, United Arab Emirates (Hybrid)",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.wanlp-1.11",
    pages = "108--118",
}

@article{10.3389/frai.2023.1219767,
  author    = {Hamdy Mubarak and Samir Abdaljalil and Azza Nassar and Firoj Alam},
  title     = {Detecting and identifying the reasons for deleted tweets before they are posted},
  journal   = {Frontiers in Artificial Intelligence},
  volume    = {6},
  year      = {2023},
  url       = {https://www.frontiersin.org/articles/10.3389/frai.2023.1219767},
  doi       = {10.3389/frai.2023.1219767},
  issn      = {2624-8212},
}

Contact

Organizers

  • Firoj Alam, Qatar Computing Research Institute, HBKU
  • Hamdy Mubarak, Qatar Computing Research Institute, HBKU
  • Maram Hasanain, Qatar Computing Research Institute, HBKU
  • Samir Abdaljalil, Qatar Computing Research Institute, HBKU
  • Wajdi Zaghouani, HBKU
  • Preslav Nakov, Mohamed bin Zayed University of Artificial Intelligence
  • Giovanni Da San Martino, University of Padova
  • Abdelhakim Freihat, University of Trento