Welcome to the website of the Language Technology and Data Analysis Laboratory (LADAL). LADAL (pronuced lah’DULL) is a school-based, collaborative support infrastructure for digital humanities researchers established and maintained by the School of Languages and Cultures at the University of Queensland. LADAL assists with data processing, visualization, and analysis and offers guidance on matters relating to language technology and digital research tools.


LADAL aims to help develop computational and digital skills by providing information and practical, hands-on tutorials on data and text analytics as well as on statistical methods relevant for language research. In addition, the LADAL provides self-guided study materials relevant for computational Natural Language Processing. In order to be attractive to both beginners and people with advanced skills, the LADAL website covers topics and introduces methods relevant for people coming with different degrees of prior knowledge and experience - ranging from introductions to concepts of quantitative reasoning to step-by-step guides on advanced statistical modeling.

Since the primary concern of LADAL is to introduce computational methods that are relevant to research involving natural language, the focus of this website is placed on linguistic data and methods relevant for text analytics. As such, LADAL provides resources for (computational) text analytics and offers introductions to quantitative reasoning, research designs, and computational methods including data visualization and statistics. The areas covered on the LADAL website are

  • introductions to quantitative reasoning and basic concepts in empirical language studies.

  • introductions to R as programming environment for processing natural language data.

  • tutorials on data visualization and data analytics (statistics and machine learning).

  • tutorials on text analysis, text mining, distant reading, and corpus linguistics.


The LADAL resources are aimed at researchers in HASS (Humanities, Arts, and the Social Sciences) and we aspire to attract complete novices as well as expert users. And, while the focus of the LADAL website is placed on handling data that represents natural language, anyone who has an interest in quantitative methods, data visualization, statistics, or R is welcome to explore this webpage.


LADAL uses the programming language R because R is extremely flexible, relatively easy to learn, free and open source, and R has a substantive and very friendly user community. R is not merely a software package but a fully-fledged programming environment which allows complex Natural Language Processing, statistics and data visualizations and it can also be used to create websites or apps, and has direct pipelines for version control (Git). This website as well as the self-guided study materials offered by the LADAL use are written in R-markdown - a way to combine R-code with text. The flexibility of R makes it a sensible choice for researchers that strive for high quality and extreme flexibility while following best practices that enable complete replicability and full transparency.

As computation is becoming ever more prevalent across disciplines as well as in both the social and economic domains, LADAL offers a resource space for R that make it accessible to lay users as well as expert programmers. That said, we hope to expand the resources provided by LADAL to other tools and environments (e.g. Python) in the future.

Licensing & Citation

The LADAL website was created by Martin Schweinberger. It was freely released under GNU General Public License, Version 3, June 2007. If you use (parts of) it for your own research or in your teaching materials, please cite the individual subpages as shown at the bottom of each page or reference it as:

Schweinberger, Martin. 2020. The Language Technology and Data Analysis Laboratory (LADAL). Brisbane: The University of Queensland, School of Languages and Cultures. url: https://slcladal.github.io/index.html (Version 2020/09/24).

  author = {Schweinberger, Martin},
  title = {The Language Technology and Data Analysis Laboratory (LADAL)},
  note = {https://slcladal.github.io/index.html},
  year = {2020},
  organization = {The University of Queensland, School of Languages and Cultures},
  address = {Brisbane},
  edition = {2020/09/24}


The content of this website is free and comes with ABSOLUTELY NO WARRANTY. You are welcome to redistribute the content of LADAL resources given you adhere to the licensing. The content of this website is distributed under the terms of the GNU General Public License, Version 3, June 2007.

Share and Enjoy!

Back to top