Publications
2023
- ACLTowards Domain-Agnostic and Domain-Adaptive Dementia Detection from Spoken LanguageShahla Farzana, and Natalie PardeACL, Jul 2023
Health-related speech datasets are often small and varied in focus. This makes it difficult to leverage them to effectively support healthcare goals. Robust transfer of linguistic features across different datasets orbiting the same goal carries potential to address this concern. To test this hypothesis, we experiment with domain adaptation (DA) techniques on heterogeneous spoken language data to evaluate generalizability across diverse datasets for a common task: dementia detection. We find that adapted models exhibit better performance across conversational and task-oriented datasets. The feature-augmented DA method achieves a 22% increase in accuracy adapting from a conversational to task-specific dataset compared to a jointly trained baseline. This suggests promising capacity of these techniques to allow for productive use of disparate data for a complex spoken language healthcare task.
@article{farzana-parde-2023-towards, title = {Towards Domain-Agnostic and Domain-Adaptive Dementia Detection from Spoken Language}, author = {Farzana, Shahla and Parde, Natalie}, booktitle = {Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)}, month = jul, year = {2023}, address = {Toronto, Canada}, publisher = {Association for Computational Linguistics}, url = {https://aclanthology.org/2023.acl-long.668}, doi = {10.18653/v1/2023.acl-long.668}, pages = {11965--11978}, journal = {ACL} } - WWWKnowledge Graph-Enhanced Neural Query RewritingShahla Farzana, Qunzhi Zhou, and Petar RistoskiIn Companion Proceedings of the ACM Web Conference 2023, Jul 2023
The main task of an e-commerce search engine is to semantically match the user query to the product inventory and retrieve the most relevant items that match the user’s intent. This task is not trivial as often there can be a mismatch between the user’s intent and the product inventory for various reasons, the most prevalent being: (i) the buyers and sellers use different vocabularies, which leads to a mismatch; (ii) the inventory doesn’t contain products that match the user’s intent. To build a successful e-commerce platform it is of paramount importance to be able to address both of these challenges. To do so, query rewriting approaches are used, which try to bridge the semantic gap between the user’s intent and the available product inventory. Such approaches use a combination of query token dropping, replacement and expansion. In this work we introduce a novel Knowledge Graph-enhanced neural query rewriting in the e-commerce domain. We use a relationship-rich product Knowledge Graph to infuse auxiliary knowledge in a transformer-based query rewriting deep neural network. Experiments on two tasks, query pruning and complete query rewriting, show that our proposed approach significantly outperforms a baseline BERT-based query rewriting solution.
2022
- SIGdialAre Interaction Patterns Helpful for Task-Agnostic Dementia Detection? An Empirical ExplorationShahla Farzana, and Natalie PardeSIGdial, Sep 2022
Dementia often manifests in dialog through specific behaviors such as requesting clarification, communicating repetitive ideas, and stalling, prompting conversational partners to probe or otherwise attempt to elicit information. Dialog act (DA) sequences can have predictive power for dementia detection through their potential to capture these meaningful interaction patterns. However, most existing work in this space relies on content-dependent features, raising questions about their generalizability beyond small reference sets or across different cognitive tasks. In this paper, we adapt an existing DA annotation scheme for two different cognitive tasks present in a popular dementia detection dataset. We show that a DA tagging model leveraging neural sentence embeddings and other information from previous utterances and speaker tags achieves strong performance for both tasks. We also propose content-free interaction features and show that they yield high utility in distinguishing dementia and control subjects across different tasks. Our study provides a step toward better understanding how interaction patterns in spontaneous dialog affect cognitive modeling across different tasks, which carries implications for the design of non-invasive and low-cost cognitive health monitoring tools for use at scale.
@article{farzana-parde-2022-interaction, title = {Are Interaction Patterns Helpful for Task-Agnostic Dementia Detection? An Empirical Exploration}, author = {Farzana, Shahla and Parde, Natalie}, booktitle = {Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue}, month = sep, year = {2022}, address = {Edinburgh, UK}, publisher = {Association for Computational Linguistics}, url = {https://aclanthology.org/2022.sigdial-1.18}, pages = {172--182}, journal = {SIGdial} } - BioNLPHow You Say It Matters: Measuring the Impact of Verbal Disfluency Tags on Automated Dementia DetectionShahla Farzana, Ashwin Deshpande, and Natalie PardeIn Proceedings of the 21st Workshop on Biomedical Language Processing, May 2022
2020
- INTERSPEECHExploring MMSE Score Prediction Using Verbal and Non-Verbal CuesShahla Farzana, and Natalie PardeInterspeech, May 2020
@article{farzana20_interspeech, author = {Farzana, Shahla and Parde, Natalie}, title = {{Exploring MMSE Score Prediction Using Verbal and Non-Verbal Cues}}, year = {2020}, booktitle = {Proc. Interspeech 2020}, pages = {2207--2211}, doi = {10.21437/Interspeech.2020-3085}, journal = {Interspeech} } - LRECModeling Dialogue in Conversational Cognitive Health Screening InterviewsShahla Farzana, Mina Valizadeh, and Natalie PardeIn Proceedings of the Twelfth Language Resources and Evaluation Conference, May 2020
Automating straightforward clinical tasks can reduce workload for healthcare professionals, increase accessibility for geographically-isolated patients, and alleviate some of the economic burdens associated with healthcare. A variety of preliminary screening procedures are potentially suitable for automation, and one such domain that has remained underexplored to date is that of structured clinical interviews. A task-specific dialogue agent is needed to automate the collection of conversational speech for further (either manual or automated) analysis, and to build such an agent, a dialogue manager must be trained to respond to patient utterances in a manner similar to a human interviewer. To facilitate the development of such an agent, we propose an annotation schema for assigning dialogue act labels to utterances in patient-interviewer conversations collected as part of a clinically-validated cognitive health screening task. We build a labeled corpus using the schema, and show that it is characterized by high inter-annotator agreement. We establish a benchmark dialogue act classification model for the corpus, thereby providing a proof of concept for the proposed annotation schema. The resulting dialogue act corpus is the first such corpus specifically designed to facilitate automated cognitive health screening, and lays the groundwork for future exploration in this area.
2016
- WDMaximally pair-wise disjoint set covers for directional sensors in visual sensor networksShahla Farzana, Khaleda Akther Papry, Ashikur Rahman, and 1 more authorIn 2016 Wireless Days (WD), May 2016