Databases

  • SUBTLEX-UK: Frequency for British English based on subtitles of British television programmes
  • SUBTLEX-CH: A database of Chinese word and character frequencies
  • Latent Semantic Analysis
  • MRC Psycholinguistic Database: for familiarity, imageability and concreteness ratings, among others
  • MCWord: An Orthographic Wordform Database: for obtaining orthographic neighbourhood frequency and generating lists of nonwords
  • Sentence norms: Completion norms for 3085 English sentence contexts, for obtaining sentences with a target word with a certain cloze value
  • Cloze/Sentence norms: Cloze probability and completion norms for 498 English sentences (Block & Baldwin, 2010)
  • Cloze/Sentence norms: Cloze probability, predictability ratings, and computational estimates for 205 English sentences (de Varda, Marelli & Amenta, 2023)
  • Kanji database: Kanji frequency, On- and Kun-reading frequencies, On-reading ratio, kanji productivity of two-kanji compounds, symmetry of kanji productivity, entropy, number of meanings, etc.
  • Online Japanese Accent Dictionary : for obtaining Tokyo accent and generating lists of words with a certain accent type
  • University of South Florida Free Association Norms: Semantic Association database
  • Google Ngram Viewer: Ngrams in Google's text corpora
  • CLEARPOND: The Cross-Linguistic Easy-Access Resource for Phonological and Orthographic Neighborhood Densities
  • The Auditory English Lexicon Project (AELP): A multi-talker, multi-region psycholinguistic database of 10,170 spoken words and 10,170 spoken nonwords
  • LexiCAL: A calculator for lexical variables
  • Corpora

  • British National Corpus: This corpus contains a 100 million words of text texts from a wide range of genres
  • Corpus of Contemporary American English: This corpus contains more than 520 million words of text. It is equally divided among spoken, fiction, popular magazines, newspapers, and academic texts.
  • Chunagon (中納言): Corpus of written & spoken Japanese from NINJAL
  • Auditory English Lexicon Project (AELP): A multi-talker, multi-region psycholinguistic database of 10,170 spoken words and nonwords. (English)
  • Eye-tracking corpora

  • The Provo Corpus: A Large Eye-Tracking Corpus with Predictability Norms. Luke, S.G. & Christianson, K. (2018). The Provo Corpus: A Large Eye-Tracking Corpus with Predictability Ratings. Behavior Research Methods, 50, 826-833.
  • GECO: An eye-tracking corpus of monolingual and bilingual sentence reading. Cop, U., Dirix, N., Drieghe, D., & Duyck, W. (2017). Presenting GECO: An eyetracking corpus of monolingual and bilingual sentence reading. Behavior Research Methods, 49(2), 602-615.
  • MECO: An eye-tracking corpus of multilingual L2 (English) sentence reading. Siegelman et al. (2022). Expanding horizons of cross-linguistic research on reading: The Multilingual Eye-movement Corpus (MECO). Behavior Research Methods.
  • Chinese eye-tracking reading corpus: An eye-tracking corpus of Chinese sentence reading. Zhang et al. (2022). The database of eye-movement measures on words in Chinese reading. Scientific Data.
  • E-books

  • Linear Mixed Models in Linguistics and Psychology: A Comprehensive Introduction by Shravan Vasishth, Daniel Schad, Audrey Bürki, and Reinhold Kliegl
  • R for Data Science by Hadley Wickham and Garret Grolemund
  • Learning Statistical Models Through Simulation in R by Dale Barr
  • R for Psychological Research (Course materials) by Glenn Williams
  • One Way ANOVA with R Completely Randomized Design - Between Groups by Bruce Dudek
  • Doing Meta-Analysis with R: A Hands-On Guide by Mathias Harrer, Pim Cuijpers, Toshi A. Furukawa & David D. Ebert
  • Power Analysis with Superpower by Aaron R. Caldwell, Daniël Lakens, Chelsea M. Parlett-Pelleriti, Guy Prochilo & Frederik Aust
  • Picture databases

    Black & White

  • 260 pictures standardised in English: Snodgrass & Vanderwart (1980)
  • 360 pictures standardised in Japanese: Nishimoto, Ueda, Miyawaki, Une, & Takahashi (2012)
  • International Picture Naming Project
  • The Noun Project
  • Colour

  • Bank of Standardised stimuli (BOSS): Brodeur, Dionne-Dostie, Montreuil, & Lepage (2010)
  • Colour version of Snodgrass & Vanderwart (1980): Moreno-Martínez & Montoro (2012)
  • MultiPic: A standardized set of 750 drawings with multilingual norms
  • LinguaPix: 1,620 colour photographs normed in Dutch, English, Polish, and Cantonese
  • Tools

  • Mix: stimuli (pseudo-)randomisation tool
  • Research Randomizer: simple randomisation tool
  • Ralpha: a software for resizing images (only for Windows)
  • LexTALE: Lexical Test for Advanced Learners of English. Lemhöfer, K., & Broersma, M. (2012). Introducing LexTALE: A quick and valid Lexical Test for Advanced Learners of English. Behavior Research Methods, 44, 325-343.
  • PCIbex Farm: web-based experiments
  • Working memory span tests: available in Czech, English, German, Japanese, Russian, and Spanish (credit: Titus von der Malsburg)
  • PsychoPy experiment templates: reaction time experiments, digit span, counterbalancing, mouse tracking, self-paced reading etc.
  • Linger: a software for self-paced reading experiments
  • Whisper Large V3: Transcribe Audio: transcribe long-form microphone or audio inputs
  • Restream: transcribe audio to text
  • WebPower: statistical power analysis online
  • Tutorials

  • PsyTeachR: Many useful R resources from the University of Glasgow
  • Introduction to mixed-effects models: by Ian Cunnings & George Pontikas (YouTube)
  • Tutorials for visual world eye-tracking data analysis in R: R tutorials I made for a workshop in 2023 based on Ito & Knoeferle (2022, Behavior Research Methods). You can find tutorial videos in the Media tab.
  • Simulation-based power analysis: from Kumle, Võ and Draschkow (2021, Behavior Research Methods)
  • ERP training: ERPLAB tutorials by Jen Lewendon