Process Challenge's Website
PROCESS Challenge's logo ...

Dataset

In this challenge, we provide the latest corpus to the community to focus on early-stage dementia detection. The corpus is highly relevant for advancing dementia detection research and, as such, continues the impact trajectory of the previous challenges led by Luz and his team. The PROCESS data has the following advantages:

Prompts

The method for corpus collection was designed based on neuroscience research for dementia diagnosis. It includes the audio from three types of elicitation tasks: the Semantic Fluency, the Phonemic Fluency, and the Cookie Theft picture description task.

Cookie Theft Picture

Fig. 1. Cookie Theft picture from the Boston Diagnostic Aphasia Examination.

The Corpus

The training and development sets consist of audio recordings and corresponding manual transcripts of each prompt for every speaker. For the classification task, we provide a diagnosis for each speaker (such as healthy volunteer, MCI and dementia), while for the regression task, we provide the MMSE score for each speaker.

Important: for the test set, the transcripts and diagnoses are not available because providing manual transcription conflicts with the design principles of modern automated detection systems.