The LADAL Opening Webinar Series 2021 consists of weekly presentations from a wide range of voices with backgrounds in linguistics, data science, or computational humanities and it covers a variety of topics related to digital handling of language data!

The LADAL Opening Webinar Series 2021 kicked off with a presentation by Stefan Th. Gries on MuPADRF (Multifactorial Prediction and Deviation Analysis Using Regression/Random Forests) on June 3, 2021, 5pm Brisbane time.

All events take place on Zoom (the Zoom links are provided in the webinar descriptions below and they are announced on Twitter (@slcladal), via the UQ School of Languages and Cultures, and via our collaborators). Once the recordings of the webinars are processed, they are made available on the LADAL YouTube channel.

See below for the full list of presentations that are part of the LADAL Opening Webinar Series 2021.


July 28: Vincent DeLuca, Toms Voits, & Jason Rothman

Neurocognitve effects of bilingual experience

Time: 5pm Brisbane, 9am Berlin

Zoom link:


Much research over the past two decades shows that bilingualism affects brain structure, function, and potentially domain-general cognition (see e.g., Bialystok 2017; Pliatsikas 2019). The specificity of these effects, however, has become the subject of significant debate in recent years, in large part due to variability of findings across studies (see Leivada et al. 2020 for review). In this talk, we will introduce our research programs within the Psycholinguistics of Language Representation (PoLaR) lab that addresses the juxtaposition of data and argumentation. Our work is guided by the principle that although bilingual effects are existent, they are conditional. In other words, bilingualism per se is not a sufficient condition for relevant effects on neurocognition. We will review our work that is generally designed to test the hypothesis that specific experience-based factors (EBFs) variably affect neural activity and plasticity in brain regions and pathways implicated in language- and executive control across the lifespan. We present results from a series of MRI studies showing a specificity of neural adaptations to different EBFs (DeLuca, Rothman, and Pliatsikas 2019; DeLuca et al. 2019, 2020) in younger adults. We will also present data from older adults, showing similar EBF effects in healthy cognitive ageing (Voits, Robson, and Pliatsikas, n.d.) and with mild cognitive impairment (Voits et al., n.d.). EBFs related to duration of bilingual language use correlate to neurocognitive adaptations suggesting increased efficiency in language control, whereas those related to extent of additional language use correlate with adaptations suggesting increased control demands. Considered together, these data suggest that the brain strives to be maximally effective and efficient in language processing and control, which in turn affects domain-general cognitive processes proportionally to degree of engagement with bilingual experiences. The work in older populations leads to the conclusion that degree of engagement with bilingualism is a catalyst for cognitive/brain reserve and thus has some real-world benefits in aging.

About Vince

Vincent is Associate Professor in the Neurocognition of Bilingualism and co-director of the Psycholinguistics of Language Representation (PoLaR) lab in the AcqVa Aurora Centre at UiT-The Arctic University of Norway. His research is focused on how different aspects of bilingual language experience variably impact brain structure, function, and several cognitive processes. His work focuses on how these neural and cognitive adaptations dynamically shift over time and with changes to patterns of language use.

About Toms

Toms is a Postdoctoral Researcher at UiT the Arctic University of Norway. He is affiliated with the Psycholinguistics of Language Representation (PoLaR) lab and the AcqVA Aurora Center in the Department of Language and Culture at UiT. His work is primarily focused on investigating the effects of bilingualism on neurocognition, with a particular interest in examining bilingualism as a contributing factor to cognitive and brain reserves in the later years of life.

About Jason

Jason is Professor of Linguistics at UiT the Arctic University of Norway and Senior Researcher in Cognitive Science at Universidad Nebrija (Spain). He is deputy director of the AcqVA Aurora Center at UiT, where he co-leads the center’s theme/concentration on the Neurocognition of Bilingualism. Professor Rothman also co-directs the center’s Psycholinguistics of Language Representation (PoLaR) lab. A linguist by training, he has worked extensively on language acquisition, linguistic processing and language-associated links to domain general neurocognition across the lifespan of varioustypes of bi-/multilingual populations. He is founding editor of the journal Linguistic Approaches to Bilingualism and serves as executive editor of the book series Studies in Bilingualism.

August 2: Simon Musgrave

A gentle introduction to networks

Time: 5pm Brisbane, 9am Berlin

Zoom link:


Linguists have adopted several methods from data science, but network analysis has been used rather less than others even though it is a useful tool. This presentation will introduce the basics of using network analysis, discussing the types of problems for which the method is useful, the kinds of data which are amenable to analysis, and the graphical outputs which can be achieved. These points will be illustrated with several examples from different areas of linguistic research, as well as with an example with data concerning a social network.

About Simon

Simon Musgrave was a lecturer in the Linguistics program at Monash University until the end of 2020. His research covers various areas in linguistics and sociolinguistics, linked by the themes of the use of computational tools in linguistic research and the relationship between Linguistics and Digital Humanities, an interest continued in his current work in the Linguistic Data Commons of Australia project.

August 12: Janet Wiles & Ben Foley

(Semi-)Automated speech recognition using ELPIS (preliminary title)

August 19: Mikko Laitinen

Establishing Computational Humanities Infrastructures in Finland (preliminary title)

August 26: Laura Janda

Strategic targeting of rich inflectional morphology for linguistic analysis and L2 acquisition

Note: This talk was originally scheduled for September 2.


Many languages have rich inflectional morphology signaling grammatical categories such as case, number, tense, etc. Rich morphology presents a challenge for L2 learners because even a basic vocabulary of a few thousand words can entail mastery of over 100,000 word forms. However, only a handful of the potential forms of a given word occur frequently, while the remainder are rare. Access to digital corpora makes it possible to determine which forms of any given word are of highest frequency, as well as what grammatical and collocational contexts motivate those few frequent forms, facilitating strategically focused language learning tools. Corpus analysis of the frequency distributions of inflectional forms provide linguists with added insights into the function of languages. The results achieved primarily by using correspondence analysis of Russian material are potentially portable to any language with rich inflectional morphology.

About Laura

In the Cold War era, Laura Janda combined study of Slavic linguistics at Princeton and UCLA with US-government-funded adventures as an exchange student behind the Iron Curtain in countries that have since changed their names: USSR, Czechoslovakia, and Yugoslavia. After over two decades at the University of Rochester and UNC-Chapel Hill, she moved to the University of Tromsø in 2008. Laura Janda was an early adopter of Cognitive Linguistics in the 1980s and has explored quantitative methods since 2007. Her research focuses primarily on the morphology of Slavic languages, with various admixtures (North Saami, conlangs, political discourse).

September 2: Peter Crosthwaite

Data-driven learning for younger learners: Boosting schoolgirls’ knowledge of passive voice constructions for STEM education


This paper explores how corpus technology and DDL pedagogy can support secondary schoolgirls’ reporting of an observed science experiment through a written research report, focusing particularly on how corpora were used to develop receptive and productive knowledge of passive voice constructions. A pre-test of the grammaticality of passive constructions was conducted, alongside a diagnostic pre-instruction written report requiring the retelling of an observed science experiment were collected from 60 Year 9-10 girls at a high school in Australia. During a full 10-week term, students were given guided individual homework tasks and short in-class pair/group DDL activities focusing on passive voice constructions, using freely available online corpus applications such as SketchEngine. Following this treatment, a post-test was conducted while an additional written research report was collected. Questionnaire and interview data was also collected to determine the perceptions of younger female learners and their teachers regarding their engagement with corpora and DDL for improving knowledge and use of passive constructions over time. The data suggest the DDL treatment resulted in increased knowledge of the grammaticality of passive voice constructions, while students were more likely to include such constructions in their written reports while increasing the accuracy of their use. Qualitative stakeholder perceptions of improved disciplinary linguistic knowledge, increased data management skills, and positive engagement with “science” were also found in the survey/interview data, although a number of challenges at the technical and conceptual levels for DDL still remain.

About Peter

Peter is Senior Lecturer in the School of Languages and Cultures at UQ (since 2017), having formerly been an assistant professor at the Centre for Applied English Studies (CAES), University of Hong Kong (since 2014). His areas of research and supervisory expertise include corpus linguistics and the use of corpora for language learning (known as data-driven learning), as well as English for General and Specific Academic Purposes. I am the author of the monograph Learning the language of Dentistry: Disciplinary corpora in the teaching of English for specific academic purposes as part of Benjamins’ Studies in Corpus Linguistics series (with Lisa Cheung, published 2019), as well as the edited volumes Data Driven Learning for the Next Generation: Corpora and DDL for Pre-tertiary Learners (published 2019) and Referring in a second language: Reference to person in a multilingual world (with Jonathon Ryan, published 2020) with Routledge. I am also currently serving as the corpus linguistics section editor for Open Linguistics (ISI-ESCI) - an open access linguistics journal from De Gruyter, as well as on the editorial board of Applied Corpus Linguistics, a new journal covering the direct applications of corpora to teaching and learning.

September 13: Gregor Wiedemann

Text classification for automatic detection of hate speech, counter speech, and protest events


Social sciences have opened up to text mining, i.e., a set of methods to automatically identify semantic structures in large document collections. However, the methods have often been limited to a statistical analysis of textual data, strongly limiting the scope of possible research questions. The more complex concepts central to the social sciences such as arguments, frames, narratives and claims still are mainly studied using manual content analyses in which the knowledge needed to apply a category (i.e. to “code”) is verbally described in a codebook and implicit in the coder’s own background knowledge. Supervised machine learning provides an approach to scale-up this coding process to large datasets. Recent advantages in neural network-based natural language processing allow for pretraining language models that can transfer semantic knowledge from unsupervised text collections to specific automatic coding problems. With deep learning models such as BERT automatic coding of context-sensitive semantics with substantially lowered efforts in training data generation comes within reach to content analysis. The talk will introduce to the applied usage of these technologies along with two interdisciplinary research projects studying hate speech and counter speech in German Facebook postings, and information extraction for the analysis of the coverage of protest events in local news media.

About Gregor

Dr. Gregor Wiedemann is working as Senior Researcher Computational Social Science at the Leibniz Institute for Media Research │ Hans Bredow Institute (HBI). Since September 2020, he heads the Media Research Methods Lab (MRML). His current work focuses on the development of methods and applications of natural language processing and text mining for empirical social and media research. Gregor Wiedemann studied political science and computer science in Leipzig and Miami, USA. In 2016 he received his doctorate from the Department of Computer Science at the University of Leipzig for his thesis on automation of discourse and content analysis using text mining and machine learning methods. Afterwards he worked as a postdoc in the NLP group of Computer Science Department at the University of Hamburg. Among other things, the resulting works are concerned with unsupervised information extraction to support investigative research in unknown document collections (see and with the detection of hate and counter-speech in social media.

September 21: Terttu Nevalainen, Turo Hiltunen & Aatu Liimatta

Case studies from VARIENG (preliminary title)

September 27: Laurence Anthony

AntConc 4.0 (preliminary title)

About Laurence

Laurence Anthony is Professor of Applied Linguistics at the Faculty of Science and Engineering, Waseda University, Japan. He has a BSc degree (Mathematical Physics) from the University of Manchester, UK, and MA (TESL/TEFL) and PhD (Applied Linguistics) degrees from the University of Birmingham, UK. He is the current Director of the Center for English Language Education in Science and Engineering (CELESE), which runs discipline-specific language courses for the 10,000 students of the faculty. His main research interests are in corpus linguistics, educational technology, and English for Specific Purposes (ESP) program design and teaching methodologies. He received the National Prize of the Japan Association for English Corpus Studies (JAECS) in 2012 for his work in corpus software tools design.

October 2: Guillaume Desagulier

Doing diachronic linguistics with distributional semantic models in R

October 15: Tanja Säily

Analysing 18th-century publications and publishing networks (preliminary title)

About Tanja

Tanja Säily is a tenure-track assistant professor in English language at the University of Helsinki. Her research interests include corpus linguistics, digital humanities, historical sociolinguistics, and linguistic productivity. She is also interested in the social embedding of language variation and change in general, including gendered styles in the history of English and extralinguistic factors influencing language change. Her overarching aim is to develop new ways of understanding language variation and change, often in collaboration with experts from other fields. Her current project combines historical sociolinguistics, intellectual history, book history and data science to analyse eighteenth-century publications and publishing networks.

October 18: Stéphane Guillou

Git and GitHub/Gitlab for versioning and collaborating


Git is a tool for versioning of and collaborating on any text-based file. Widely used in software development, it is now adopted in many different settings, from document versioning to data analysis management. It is also at the centre of major platforms like GitHub and GitLab, used by millions to share and collaborate on code and documents. In this workshop, you will learn about:

  • The main commands used in a git workflow

  • How to publish your work online

  • How to collaborate on a GitHub repository

If you would like to follow along, please do the following before attending:

About Stéphane

Stéphane has worked for the last 10 years at the University of Queensland (UQ). After completing a master’s degree in plants science and ecology in France, he worked in research around the topic of sustainable agriculture. In 2018, a drastic move to a Technology Trainer position at the Library allowed him to share data analysis best practice skills, and promote Open Source tools for research. He is motivated by the principles of Open Science and the opportunities an increasingly collaborative research ecosystem offers.

October 29:Joseph Flanagan

How to Make your Research Reproducible


RMarkdown (along with Jupyter Notebooks) is a unified authoring framework that allows one to combine code, figures, and text into a single document. It is increasingly becoming a default way for researchers in the computational sciences to share their work, as all the materials necessary to reproduce the analysis can be found in a single document.

This workshop will explore how RMarkdown can be used for writing academic papers (instead of tutorials or blog posts). It will assume no previous knowledge, other than a basic familiarity with R. During the workshop, participants will learn:

  1. The fundamentals of RMarkdown, including what it is, how it works, and its basic syntax
  2. Options for displaying, hiding, and executing code
  3. Options for including tables and graphs
  4. Options for including citations and cross-references

About Joe

Joseph Flanagan is a University Lecturer in Languages/English Studies at the University of Helsinki.


June 3: Stefan Gries

MuPADRF (Multifactorial Prediction and Deviation Analysis Using Regression/Random Forests)


The recording of Stefan’s talk is available here on the LADAL YouTube channel.


In this talk, Stefan gave a brief and relatively practical introduction to an approach called MuPDAR(F) (for Multifactorial Prediction and Deviation Analysis using Regressions/Random Forests) that he developed (see Gries and Deshors (2014), Gries and Adelman (2014) for the first applications). The main part of the talk involved using a version of the data in Gries and Adelman (2014) to exemplify how this protocol works and how it can be done in R. Second, Stefan discussed a few recent extensions proposed in Gries and Deshors (2020) and Gries (n.d.), which have to do with

  1. how to deal with situations with more than two linguistic choices,

  2. how predictions are made, and

  3. how deviations are quantified.

Finally, he briefly comment on exploring individual variation among the target speakers (based on Gries and Wulff (2021)).

About Stefan

Stefan Th. Gries is full professor at the University of California, Santa Barbara (UCSB), as well as Honorary Liebig-Professor and Chair of English Linguistics at the Justus-Liebig-Universität Giessen. Stefan has held several prestigious visiting professorships at top universities and, methodologically, he is a quantitative corpus linguist at the intersection of corpus linguistics, cognitive linguistics, and computational linguistics. Stefan has applied a variety of different statistical methods to investigate a wide range of linguistic topics and much of his work involves the open-source software R. Stefan has produced more than 200 publications (articles, chapters, books, and edited volumes), he is an active member of various editorial boards as well as academic societies.

June 10: Martin Schweinberger & Michael Haugh

The Australian Text Analytics Platform (ATAP) and the Language Technology and Data Analysis Laboratory (LADAL) - building computational humanities infrastructures: experiences, problems, and potentials


This talk introduces the Language Technology and Data Analysis Laboratory (LADAL) which is a computational humanities resource infrastructure maintained by the School of Languages and Cultures at the University of Queensland. The talk will also provide information about its relations to the Australian Text Analytics Platform (ATAP) which represents an effort to promote text analytics in Australia and to make resources for using text analytics available to a wider community of researchers.

About Martin

Martin is a language data scientist with a PhD in English linguistics who has specialized in corpus linguistics and quantitative, computational analyses of language data. Martin is a Lecturer in Applied Linguistics at the University of Queensland, Australia where he has been establishing the Language Technology and Data Analysis Laboratory (LADAL) and he holds an additional part-time Associate Professorship in the AcqVA-Aurora Center at the Arctic University of Norway in Tromsø.

About Michael

Michael is Professor of Linguistics and a Fellow of the Australian Academy of the Humanities. His research interests lie primarily in the field of pragmatics, the science of language-in-use. He works with recordings and transcriptions of naturally occurring spoken interactions, as well as data from digitally-mediated forms of communication across a number of languages. An area of emerging importance in his view is the role that language corpora can play in the humanities and social sciences more broadly. He has been involved in the establishment of the Australian National Corpus and the Language Technology and Data Analytics Lab, and is currently leading the establishment of a national language data commons.

June 18: Felicity Meakins

Field-based methods for collecting quantitative data


Shana Poplack has set benchmarks for the development of corpora since the early 1980s. Poplack (2015, p. 921) maintains that the “gold standard remains the (standard sociolinguistic-style) … corpus”. The aim of producing corpora using these principles is to avoid the ‘cherry picking’ approach which dominates much of the theoretical literature. Poplack and her team have created the Ottawa-Hull Corpus which consists of 3.5 million words of informal speech data. This corpus is enormous and beyond the capabilities of a single linguist in a small language community. This talk offers suggestions for corpus development in the field that follow Poplack’s principles, but also shows where compromises can be made. I discuss the method developed during the Gurindji Kriol project called ‘peer elicitation’. It supplements Poplack’s gold standard of naturally occurring speech with semi-formal elicitation to ensure sufficient data for quantitative analyses.

About Felicity

Felicity Meakins is an ARC Future Fellow in Linguistics at the University of Queensland and a CI in the ARC Centre of Excellence for the Dynamics of Language. She is a field linguist who specialises in the documentation of Australian Indigenous languages in the Victoria River District of the Northern Territory and the effect of English on Indigenous languages. She has worked as a community linguist as well as an academic over the past 20 years, facilitating language revitalisation programs, consulting on Native Title claims and conducting research into Indigenous languages. She has compiled a number of dictionaries and grammars of traditional Indigenous languages and has written numerous papers on language change in Australia.

June 24: Gerold Schneider

Text Crunching Center (TCC): Data-driven methods for linguists, social science and digital humanities


This talk introduces the Text Crunching Centre (TCC) which is a Computational Linguistics and Digital Humanities service hosted at the University of Zurich, and a collaboration partner of LADAL. We present a selection of our case studies using text analytics, from cognitive linguistics, social, political and historical studies. We show how stylistics, document classification, topic modelling, conceptual maps, distributional semantics and eye-tracking can offer new perspectives. Our case studies include language and age, learner language, the history of medicine, democratisation, religion, and attitudes to migration. We conclude with an outlook to the future of text analytics.

About Gerold

Gerold Schneider is a Senior Lecturer, researcher and computing scientist at the department of Computational Linguistics at the University of Zurich, Switzerland. His doctoral degree is on large-scale dependency parsing, his habilitation on using computational models for corpus linguistics. His research interests include corpus linguistics, statistical approaches, Digital Humanities, text mining and language modeling. He has published over 100 articles on these topics. He has published a book on statistics for linguists (Schneider and Lauber 2019), and a book on digital humanities is under way. His Google scholar page can be accessed here.

July 1: Monika Bednarek

Corpus-based media linguistics: A case study of linguistic diversity in Australian television


In this golden age of Indigenous television (Sebbens, n.d.), it is important to analyze Indigenous-authored drama series, so that we can move beyond ‘a deficit perspective’ (Charity Hudley, Mallison, and Bucholtz 2020, 216) in relation to mediated linguistic diversity. This talk presents a corpus linguistic case study of Australian Aboriginal English (AAE) lexis as present in three such Indigenous-authored television series: Redfern Now, Cleverman, and Mystery Road. For television viewers, mediated AAE can be an important source of information, especially if they do not regularly interact with Aboriginal and/or Torres Strait Islander people. This would be the case for many Australians, and even more so for international viewers of Australian television series. All three analyzed series were exported overseas and thus have both Australian and international audiences. Using lexical profiling analysis (AntWordProfiler; Anthony (2013)) in combination with qualitative concordance analysis, the talk identifies and compares the use of AAE lexis across the three series. Analysis of frequency and distribution will pinpoint words that appear to be particularly significant lexical resources in mediated AAE. The talk will be framed through the notion of diversity, as conceptualised in relation to television series.

About Monika

Monika is Professor of Linguistics at the University of Sydney and Director of the Sydney Corpus Lab. Her research uses corpus linguistic methodologies across a variety of fields, including media linguistics, discourse analysis and sociolinguistics. She has a particular interest in the linguistic expression of emotion and opinion, with a focus on English. Monika is the author or co-author of six books and two short volumes as well as numerous journal articles and book chapters. She has co-edited several edited volumes and special issues of journals, most recently Corpus approaches to telecinematic language (International Journal of Corpus Linguistics 26/1, 2021) and Corpus linguistics and Education in Australia (Australian Review of Applied Linguistics 43/2, 2020). She is on the steering committee of the Asia Pacific Corpus Linguistics Association and tweets @corpusling.

July 8: Natalia Levshina

Recycle, relax, repeat: Advantages of Bayesian inference in comparison with frequentist methods


The slides for Natalia’s talk are available here. An Open Science Foundation (OSF) repository containing the R code and data used for the case study in this talk are available here. Here is a link to the GitHub repo of Shravan Vasishth (Professor for Psycholinguisics at the University of Potsdam) with additional resources on Bayesian statistics.


Bayesian inference is becoming increasingly popular in linguistic research. In this talk I will compare frequentist (maximum likelihood) and Bayesian approaches to generalized linear mixed-effects regression, which is de facto the standard method for testing linguistic hypotheses about linguistic variation. The main advantages of Bayesian inference include an opportunity to test the research hypothesis directly, instead of trying to reject the null hypothesis. One can also use information from previous research as priors for subsequent models, which helps to overcome the recent crisis of reproducibility. This also enables one to use smaller samples. It helps to solve such problems as overfitting, data separation and convergence issues, which often arise when one fits generalized mixed-effect models with complex structure. These advantages will be illustrated by a multifactorial case study of help + (to-)infinitive in US magazines, as in the example These simple tips will help you (to) survive the Zombie apocalypse.

About Natalia

Natalia is a linguist working at the Max Planck Institute for Psycholinguistics in Nijmegen. Her main research interests are cognitive and functional linguistics, pragmatics, typology, corpora and data science. She obtained her PhD at KU Leuven in 2011 and got her habilitation qualification at Leipzig University in 2019 with a thesis Towards a theory of communicative efficiency. In addition to papers on causatives, differential case marking, politeness, word order variation and other linguistic topics, Natalia is the author of a best-selling statistical manual How to Do Linguistics with R (Levshina 2015).

July 15: Myrte Vos

Gathering data and creating online experiments with jsPsych and JATOS

Resources: The materials, including scripts, files, etc., can be accessed via and downloaded from this GitHub repository.


Web-based studies are an increasingly popular and attractive alternative to lab- and field-based studies, enabling remote data collection for a rapidly growing variety of (psycho-)linguistic methods ranging from acceptability judgment tasks and self-paced reading to interactive group designs and the Visual World paradigm. Most platforms for building and hosting online studies are proprietary and subscription-based (e.g., Gorilla, Pavlovia, and FindingFive), but there also exist various free, open-source tools for writing and managing studies on your own server. This talk gives a practical introduction to building and hosting studies using jsPsych (De Leeuw 2015) and JATOS (Lange, Kühn, and Filevich 2015), by demonstrating a speedrun of the entire process: from an empty file in a code editor, to distributing URLs to participants.

About Myrte

Myrte Vos (she/they) is a doctoral research fellow at the Arctic University of Norway (Tromsø). They’re supposed to be studying incremental processing of aspect and modality in English, but got sidetracked by figuring out how to do that using webcam-based eye tracking.

July 22: Amanda Miotto & Julie Toohey

Going down the Reproducible Research pathway: You have to begin somewhere, right?



The idea that you can duplicate an experiment and get the same conclusion is the basis for all scientific discoveries. Reproducible research is data analysis that starts with the raw data and offers a transparent workflow to arrive at the same results and conclusions. However not all studies are replicable due to lack of information on the process. Therefore, reproducibility in research is extremely important.
Researchers genuinely want to make their research more reproducible, but sometimes don’t know where to start and often don’t have the available time to investigate or establish methods on how reproducible research can speed up every day work. We aim for the philosophy “Be better than you were yesterday”. Reproducibility is a process, and we highlight there is no expectation to go from beginner to expert in a single workshop. Instead, we offer some steps you can take towards the reproducibility path following our Steps to Reproducible Research self paced program.

About Amanda

Amanda Miotto is an eResearch Analyst for Griffith University and QCIF. She started off in the field of Bioinformatics and learnt to appreciate the beauty of science before discovering the joys of coding. She is also heavily involved in Software Carpentry, Hacky Hours and ResBaz, and has developed on platforms around HPC and scientific portals.

About Julie

Julie Toohey is a Library Research Data Management Specialist at Griffith University Library. Julie has an extensive career in academic libraries and is passionate about research data management practices. Previously, Julie co-facilitated the Australian National Data Services 23 Things (research data) Health and Medical Data Community series and is currently a member of the QULOC Research Support Working Party. Julie works closely with Griffith eResearch Services delivering education awareness programs around managing research data, reproducible research and working with sensitive data. Julie has co-authored several research data related publications with Griffith researchers and eResearch Services partners.

Back to top

Back to HOME


Anthony, Laurence. 2013. “AntWordProfiler (Version 1.4.0).” Tokyo, Japan: Waseda University.

Bialystok, Ellen. 2017. “The Bilingual Adaptation: How Minds Accommodate Experience.” Psychological Bulletin 3 (143): 233–62.

Charity Hudley, Anne H., Christine Mallison, and Mary Bucholtz. 2020. “Toward Racial Justice in Linguistics: Interdisciplinary Insights into Theorizing Race in the Discipline and Diversifying the Profession.” Language 96 (4): 200–235.

De Leeuw, Joshua R. 2015. “Jspsych: A Javascript Library for Creating Behavioral Experiments in a Webbrowser.” Behavior Research Methods 47 (1): 1–12.

DeLuca, Vincent, Jason Rothman, Ellen Bialystok, and Christos Pliatsikas. 2019. “Redefining Bilingualism as a Spectrum of Experiences That Differentially Affects Brain Structure and Function.” Proceedings of the National Academy of Sciences 116 (15): 7565–74.

———. 2020. “Duration and Extent of Bilingual Experience Modulate Neurocognitive Outcomes.” NeuroImage 204: 116222.

DeLuca, Vincent, Jason Rothman, and Christos Pliatsikas. 2019. “Linguistic Immersion and Structural Effects on the Bilingual Brain: A Longitudinal Study.” Bilingualism: Language and Cognition 22 (5): 1160–75.

Gries, Stefan Th. n.d. “MuPDAR for Corpus-Based Learner and Variety Studies: Two (More) Suggestions for Improvement.” In Tba, edited by Martin Hilpert and Susanne Flach, ???–??? ? ?

Gries, Stefan Th, and Allison S Adelman. 2014. “Subject Realization in Japanese Conversation by Native and Non-Native Speakers: Exemplifying a New Paradigm for Learner Corpus Research.” In Yearbook of Corpus Linguistics and Pragmatics 2014, 35–54. Springer.

Gries, Stefan Th, and Sandra C Deshors. 2014. “Using Regressions to Explore Deviations Between Corpus Data and a Standard/Target: Two Suggestions.” Corpora 9 (1): 109–36.

———. 2020. “There’s More to Alternations Than the Main Diagonal of a 2×2 Confusion Matrix: Improvements of Mupdar and Other Classificatory Alternation Studies.” ICAME Journal 44: 69–96.

Gries, Stefan Th, and Stefanie Wulff. 2021. “Examining Individual Variation in Learner Production Data: A Few Programmatic Pointers for Corpus-Based Analyses Using the Example of Adverbial Clause Ordering.” Applied Psycholinguistics 42 (2): 279–99.

Lange, Kristian, Simone Kühn, and Elisa Filevich. 2015. “Just Another Tool for Online Studies (Jatos): An Easy Solution for Setup and Management of Web Servers Supporting Online Studies.” PloS One 10 (6): e0130834.

Leivada, Evelina, Marit Westergaard, Jon Andoni Duñabeitia, and Jason Rothman. 2020. “On the Phantom-Like Appearance of Bilingualism Effects on Cognition: (How) Should We Proceed?” Bilingualism: Language and Cognition 24 (1): 1–14.

Levshina, Natalia. 2015. How to Do Linguistics with R: Data Exploration and Statistical Analysis. Amsterdam: John Benjamins Publishing Company.

Pliatsikas, Christos. 2019. “Understanding Structural Plasticity in the Bilingual Brain: The Dynamic Restructuring Model.” Bilingualism: Language and Cognition, no. 2: 1–13.

Schneider, Gerold, and Max Lauber. 2019. “Introduction to Statistics for Linguists.” Pressbooks.

Sebbens, Shari. n.d. “The Golden Age of Indigenous Television Is Here – and It’s Changed Australia Forever.” The Guardian (Australia edition).

Voits, Toms, Rothman Robson H., and Christos Pliatsikas. n.d. “The Effects of Bilingualism on the Structure of the Hippocampus and on Memory Performance in Ageing Bilinguals.”

Voits, Toms, Jason Rothman, Calabria Robson H., and Christos Pliatsikas. n.d. “Bilingualism-Related Neural Adaptations in Mild Cognitive Impairment Patients Are Modulated by Language Experiences.”