Colloquium – Democratising Natural Language Processing: Overcoming Language and Domain Barriers in Low-Resource Environments, 11.1.2024

Democratising Natural Language Processing: Overcoming Language and Domain Barriers in Low-Resource Environments

Dr. Yftah Ziser, University of Edinburgh

Natural language processing (NLP) has been revolutionised in recent years to the point where it is an inseparable part of our daily lives. The transition to transformer-based models allows us to train models on vast amounts of text efficiently, proving that scale plays a crucial role in improving performance. Unfortunately, many people worldwide are marginalised from getting access to high-quality NLP models, as the language they speak and the domains they are interested in count for only a tiny fraction of current state-of-the-art models’ training sets.

This talk will address the challenges, approaches, and opportunities for democratising NLP across different languages and domains by developing methods to improve NLP in low-resource scenarios. I will start by discussing how we can ease distribution mismatches to improve performance using representation learning. However, as NLP models become increasingly present in our lives, improving other crucial aspects beyond their performance, such as their fairness, factuality, and our ability to understand their underlying mechanisms, is essential. Therefore, I will also discuss using spectral methods to remove information from neural networks to reduce undesired attributes, such as bias, to increase fairness where sensitive data is scarce. Finally, I will explore future directions for making these models accessible toa broader audience by improving the aspects mentioned above in low-resource scenarios.

Dr. Yftah Ziser

Yftah Ziser is a Postdoctoral Researcher at the School of Informatics at Edinburgh University, hosted by Shay Cohen. He focuses on Deep-Learning methods for dealing with the resource bottleneck, which seriously challenges the worldwide accessibility of NLP technology. His research develops methods to improve low-resource models’ performance, fairness, and factuality while developing analysis methods for deepening our understanding of them. He co-organized the Domain Adaptation for NLP Workshop at EACL 2021.

Before joining the University of Edinburgh, Yftah worked as a research scientist at Amazon Alexa. Yftah obtained his PhD from the Technion, where he was advised by Roi Reichart.

Dr. Yftah Ziser

Before joining the University of Edinburgh, Yftah worked as a research scientist at Amazon Alexa. Yftah obtained his PhD from the Technion, where he was advised by Roi Reichart.

A Social computing platform for Biodiversity monitoring

Weaam Shaheen & Lior Koren

In this talk, we aim to explore the utilization of human computation and machine learning techniques to expedite and enhance the tedious tasks faced by ecologists in monitoring biodiversity. By working towards the integration of human and machine intelligence, our intermediate goal is to create a crowd-computing platform accessible for citizen scientists, allowing them to augment or even substitute the efforts of a single expert, thereby expediting the process considerably.

Should our system prove successful, it will enable ecologists to harness algorithms and crowdsourced assistance to produce accurate and prompt assessments of nature’s status. Such data will be crucial for conservation organizations and authorities in formulating effective measures. Additionally, this approach is expected to strengthen the connection between citizen scientists and nature, while raising awareness about the importance of wildlife sustainability.