Published Research Data

Request access to our datasets.

Every dataset we release is shared through a lightweight Google Drive workflow — request access, tell us how you intend to use it, and we review and grant within a few working days.

0
Public Datasets
0.69MB
Imaging data
0
Labels Across Sets
0DAYS
Typical Turnaround
Available datasets

Three sets we maintain.

Each set is versioned, citable via DOI, and accompanied by a descriptor on IEEE DataPort. Click a DOI badge to copy it.

Brain MRI ND-5 Dataset

A curated MRI imaging set built for self-supervised pre-training and downstream brain tumor classification. Labeled across four diagnostic classes, distributed as a single archive ready for federated learning pipelines.

993.69 MBFORMAT · ZIPTYPE · MRI Images4 LABELED CLASSES
AuthorsMahamodul Hasan Mahadi · Md. Nasif Safwan · Souhardo Rahman · Taharat M. Jabir

Multi-Label Extremism and Jihadism Classification — Tweets

A multilingual corpus of curated tweets used to train and evaluate models distinguishing extremist content from adjacent but lawful speech. Released to support reproducible ethical-content research.

394.37 KBFORMAT · CSVTYPE · Multilingual Tweets7 BINARY LABELS
AuthorsMd. Nasif Safwan · Souhardo Rahman · Abdulla Al Hasib · Fahim Al Shihab

Multilabel Extremism Classification — Tweets

A larger companion corpus offering multi-label annotations on extremist content in tweet text — designed for fine-grained NLP studies on intersecting categories of harmful speech.

5.22 MBFORMAT · CSVTYPE · Tweet TextMULTI-LABEL NLP
AuthorsSouhardo Rahman · Abdulla Al Hasib · Mahamodul Hasan Mahadi
Low-load workflow

From email to access in three steps.

Lightweight by design — we keep it human, not a form-wall.

01

Choose a dataset

Pick the dataset that fits your research and grab its DOI from the card above. We'll need this in your email.

02

Send a short email

Email us at archeintelligencelab@gmail.com with the details listed in the checklist below. Two or three sentences is plenty.

03

Review & grant

We review within a few working days and grant Drive access to the Google account you provided. Citation guidance follows.

/ Checklist

What to include in your request.

  • Dataset name & DOISo we know exactly which set you're after — copy it straight from the card.
  • Your Google accountThe address we'll share the Drive folder with — best to use an institutional one.
  • Organization, role & intended useA few sentences on who you are and how the data fits into your research is plenty.
Terms

Data usage protocols & ethical guidelines.

By requesting access, you agree to use our data responsibly — these four points are non-negotiable.

01 · Distribution

Non-Redistribution

Data may not be re-hosted, mirrored, or redistributed in any form. Send collaborators back to the lab to request their own access.

02 · Attribution

Proper Citation

Cite the dataset using the DOI we provide and acknowledge Arché Intelligence Lab in any work that draws from it — papers, preprints, talks.

03 · Scope

Academic Scope

Use is limited to non-commercial academic research. Commercial or production use requires a separate written agreement.

04 · Integrity

Ethical Integrity

Findings must respect the privacy of subjects, the safety of vulnerable groups, and the spirit in which the data was collected.

Access

Ready to work with our data?

Send us a short email with your research context. We review within a few working days and share via Google Drive — no form-walls, no waiting lists.