Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

About

Posts

Thinking in Extremes

7 minute read

Published:

This essay discusses the cognitive tool of thinking in spectra and extremes.

The Limits of Thinking

12 minute read

Published:

This essay discusses the mental concept of thinking in limits. Its utility is demonstrated in a situation where humans have poor intuition.

Cognitive Unit Tests

13 minute read

Published:

This essay introduces the concept of “cognitive unit tests”.

Essays in Progress

less than 1 minute read

Published:

This post describes my approach to essay writing.

publications

Nora the empathetic psychologist

Published in INTERSPEECH, 2017

Nora is a new dialog system that mimics a conversation with a psychologist by screening for stress, anxiety, and depression. She understands, empathizes, and adapts to users using emotional intelligence modules trained via statistical modelling such as Convolutional Neural Networks. These modules also enable her to personalize the content of each conversation.

Winata et al. (2018). "Nora the Empathetic Psychologist." INTERSPEECH. [PDF]

Perspectives on machine learning from psychology’s reproducibility crisis

Published in ICLR Workshop of Science and Engineering of Deep Learning, 2021

This short paper presents select ideas from psychology’s reformation, translating them into relevance for a machine learning audience.

Bell and Kampman (2021). "Perspectives on Machine Learning from Psychology's Reproducibility Crisis." ICLR Workshop of Science and Engineering of Deep Learning. [PDF]

Modeling the machine learning multiverse

Published in NeurIPS, 2022

This work model the multiverse with a Gaussian Process surrogate and apply Bayesian experimental design to efficiently explore high-dimensional and often continuous ML search spaces, and synthesize conflicting research on the effect of learning rate on the large batch training generalization gap.

Bell et al. (2022). "Modeling the Machine Learning Multiverse." NeurIPS. [PDF]

Time-varying functional connectivity as Wishart processes

Published in Imaging Neuroscience, 2024

The WP outperformed a sliding window approach with adaptive cross-validated window lengths and a dynamic conditional correlation-multivariate generalized autoregressive conditional heteroskedasticity (MGARCH) baseline on the external stimulus prediction task, while being less prone to false positives in the TVFC null models.

Kampman et al. (2024). "Time-varying functional connectivity as Wishart processes." Imaging Neuroscience. [PDF]

SEACrowd: A multilingual multimodal data hub and benchmark suite for Southeast Asian languages

Published in EMNLP, 2024

This work introduces SEACrowd, a comprehensive resource center that fills the resource gap by providing standardized corpora in nearly 1,000 SEA languages across three modalities, and assesses the quality of AI models on 36 indigenous languages across 13 tasks.

Lovenia et al. (2024). "SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages." EMNLP. [PDF]

A multi-agent dual dialogue system to support mental health care providers

Published in arXiv, 2024

A general-purpose, human-in-the-loop dual dialogue system to support mental health care professionals and found that the proposed responses matched a reasonable human quality in demonstrating empathy, showing its appropriateness for augmenting the work of mental health care providers.

Kampman et al. (2024). "A Multi-Agent Dual Dialogue System to Support Mental Health Care Providers." arXiv. [PDF]

Crowdsource, crawl, or generate? Creating SEA-VL, a multicultural vision-language dataset for Southeast Asia

Published in ACL, 2025

SEA-VL, an open-source initiative dedicated to developing high-quality, culturally relevant data for SEA languages, aims to bridge the representation gap in SEA, fostering the development of more inclusive AI systems that authentically represent diverse cultures across SEA.

Cahyawijaya et al. (2025). "Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia." ACL. [PDF]

Conversational self-play for discovering and understanding psychotherapy approaches

Published in AI4X Conference, 2025

This paper explores conversational self-play with LLMs as a scalable approach for analyzing and exploring psychotherapy approaches, evaluating how well AI-generated therapeutic dialogues align with established modalities.

Kampman, Onno P. (2025). "Conversational Self-Play for Discovering and Understanding Psychotherapy Approaches." AI4X Conference. [PDF]

Harnessing digital phenotyping to advance university student mental health (Brightline) in Singapore: Study protocol for a prospective observational study

Published in BMJ Open, 2025

This study will employ an observational study design over a 6-month period, recruiting 500 students from a major public university in Singapore, to identify the digital biomarkers associated with depression, anxiety, stress, loneliness and affect among university students.

Ito et al. (2025). "Harnessing digital phenotyping to advance university student mental health (Brightline) in Singapore: study protocol for a prospective observational study." BMJ Open. [PDF]

Mind the Gap: Aligning knowledge bases with user needs to enhance mental health retrieval

Published in NeurIPS GenAI4Health Workshop, 2025

Access to reliable mental health information is vital for early help-seeking, yet expanding knowledge bases is resource-intensive and often misaligned with user needs. This results in poor performance of retrieval systems when presented concerns are not covered or expressed in informal or contextualized language. We present an AI-based gap-informed framework for corpus augmentation that authentically identifies underrepresented topics (gaps) by overlaying naturalistic user data such as forum posts in order to prioritize expansions based on coverage and usefulness. In a case study, we compare Directed (gap-informed augmentations) with Non-Directed augmentation (random additions), evaluating the relevance and usefulness of retrieved information across four retrieval-augmented generation (RAG) pipelines. Directed augmentation achieved near-optimal performance with modest expansions–requiring only a 42% increase for Query Transformation, 74% for Reranking and Hierarchical, and 318% for Baseline–to reach ~95% of the performance of an exhaustive reference corpus. In contrast, Non-Directed augmentation required substantially larger and thus practically infeasible expansions to achieve comparable performance (232%, 318%, 403%, and 763%, respectively). These results show that strategically targeted corpus growth can reduce content creation demands while sustaining high retrieval and provision quality, offering a scalable approach for building trusted health information repositories and supporting generative AI applications in high-stakes domains.

Chan et al. (2025). "Mind the Gap: Aligning Knowledge Bases with User Needs to Enhance Mental Health Retrieval." NeurIPS GenAI4Health Workshop. [PDF]

Worker skills associated with outcomes in suicidal-related youth chat sessions

Published in OSF, 2026

Introduction: Text-based chatlines have become preferred entry points for youth seeking mental health support, yet most research examines dedicated crisis services rather than general chatlines where suicide emerges alongside diverse concerns. This study compared suicidal-related and non-suicidal sessions within a general youth chatline to identify session characteristics and worker skills associated with positive outcomes.Methods: We analyzed 1,710 chat sessions (202,336 messages) from QuickChat, a Singapore youth chatline between 2016 and 2020. Large language models classified sessions as suicidal or self-harm related(n=406, 24%) or non-suicidal (n=1,304, 76%).User-reported outcomes measured service quality and coping ability. Twelve therapeutic skills were coded from 79,587 worker messages. Multilevel regression models examined skill-outcome associations.Results: Suicidal-related sessions were significantly longer, contained more messages, and yielded lower outcomes. Suicidal ideation was most prevalent (85%), followed by self-harm (43%). In suicidal sessions, normalization demonstrated the strongest associations with all outcomes, followed by teaching/psychoeducation and making strengths explicit. These patterns differed substantially from non-suicidal sessions.Conclusion: Suicidal-related sessions within general chatlines demand greater engagement and differentiated responses from workers. Normalization and psychoeducation emerge as effective techniques for improving outcomes in suicidal chats. These findings provide actionable guidance for training frontline workers in general youth services.

Chung, Lim, and Kampman. (2026). "Worker Skills Associated With Outcomes In Suicidal-Related Youth Chat Sessions." OSF. [PDF]

The let’s talk Digital Peer Support Forum for Youth Mental Health and Wellbeing in Singapore: A Three-Year Process Evaluation and Framework Description

Published in JMIR Mental Health Preprint, 2026

let’s talk is a Singaporean mental health forum. This article describes it Theory of Change and discusses a process evaluation of its first three years.

Weng et al. (2026). "The let’s talk Digital Peer Support Forum for Youth Mental Health and Wellbeing in Singapore: A Three-Year Process Evaluation and Framework Description." JMIR Mental Health Preprint. [PDF]

teaching