Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Joseph T. Colonel

Word Clouds as Common Voices: LLM-Assisted Visualization of Participant-Weighted Themes in Qualitative Interviews

Aug 11, 2025

Joseph T. Colonel, Baihan Lin

Abstract:Word clouds are a common way to summarize qualitative interviews, yet traditional frequency-based methods often fail in conversational contexts: they surface filler words, ignore paraphrase, and fragment semantically related ideas. This limits their usefulness in early-stage analysis, when researchers need fast, interpretable overviews of what participant actually said. We introduce ThemeClouds, an open-source visualization tool that uses large language models (LLMs) to generate thematic, participant-weighted word clouds from dialogue transcripts. The system prompts an LLM to identify concept-level themes across a corpus and then counts how many unique participants mention each topic, yielding a visualization grounded in breadth of mention rather than raw term frequency. Researchers can customize prompts and visualization parameters, providing transparency and control. Using interviews from a user study comparing five recording-device configurations (31 participants; 155 transcripts, Whisper ASR), our approach surfaces more actionable device concerns than frequency clouds and topic-modeling baselines (e.g., LDA, BERTopic). We discuss design trade-offs for integrating LLM assistance into qualitative workflows, implications for interpretability and researcher agency, and opportunities for interactive analyses such as per-condition contrasts (``diff clouds'').

Via

Access Paper or Ask Questions

Predicting Cognitive Decline: A Multimodal AI Approach to Dementia Screening from Speech

Feb 13, 2025

Lei Chi, Arav Sharma, Ari Gebhardt, Joseph T. Colonel

Figure 1 for Predicting Cognitive Decline: A Multimodal AI Approach to Dementia Screening from Speech

Figure 2 for Predicting Cognitive Decline: A Multimodal AI Approach to Dementia Screening from Speech

Figure 3 for Predicting Cognitive Decline: A Multimodal AI Approach to Dementia Screening from Speech

Figure 4 for Predicting Cognitive Decline: A Multimodal AI Approach to Dementia Screening from Speech

Abstract:Recent progress has been made in detecting early stage dementia entirely through recordings of patient speech. Multimodal speech analysis methods were applied to the PROCESS challenge, which requires participants to use audio recordings of clinical interviews to predict patients as healthy control, mild cognitive impairment (MCI), or dementia and regress the patient's Mini-Mental State Exam (MMSE) scores. The approach implemented in this work combines acoustic features (eGeMAPS and Prosody) with embeddings from Whisper and RoBERTa models, achieving competitive results in both regression (RMSE: 2.7666) and classification (Macro-F1 score: 0.5774) tasks. Additionally, a novel two-tiered classification setup is utilized to better differentiate between MCI and dementia. Our approach achieved strong results on the test set, ranking seventh on regression and eleventh on classification out of thirty-seven teams, exceeding the baseline results.

* Submitted to IEEE ICAD 2025

Via

Access Paper or Ask Questions

Direct design of biquad filter cascades with deep learning by sampling random polynomials

Oct 07, 2021

Joseph T. Colonel, Christian J. Steinmetz, Marcus Michelen, Joshua D. Reiss

Figure 1 for Direct design of biquad filter cascades with deep learning by sampling random polynomials

Figure 2 for Direct design of biquad filter cascades with deep learning by sampling random polynomials

Figure 3 for Direct design of biquad filter cascades with deep learning by sampling random polynomials

Figure 4 for Direct design of biquad filter cascades with deep learning by sampling random polynomials

Abstract:Designing infinite impulse response filters to match an arbitrary magnitude response requires specialized techniques. Methods like modified Yule-Walker are relatively efficient, but may not be sufficiently accurate in matching high order responses. On the other hand, iterative optimization techniques often enable superior performance, but come at the cost of longer run-times and are sensitive to initial conditions, requiring manual tuning. In this work, we address some of these limitations by learning a direct mapping from the target magnitude response to the filter coefficient space with a neural network trained on millions of random filters. We demonstrate our approach enables both fast and accurate estimation of filter coefficients given a desired response. We investigate training with different families of random filters, and find training with a variety of filter families enables better generalization when estimating real-world filters, using head-related transfer functions and guitar cabinets as case studies. We compare our method against existing methods including modified Yule-Walker and gradient descent and show IIRNet is, on average, both faster and more accurate.

* Submitted to ICASSP 2022

Via

Access Paper or Ask Questions