PyData Amsterdam 2024

Marzieh Fadaee

Marzieh Fadaee is a senior research scientist at Cohere For AI, a non-profit research lab that seeks to solve complex machine learning problems and create more points of entry into machine learning research. Marzieh's work is broadly interested in all aspects of natural language understanding, particularly in multilingual learning, data-conscious learning, robust and scalable models, compositionality, and interpretability. Previously she was the NLP/ML research lead at Zeta Alpha Vector working on smarter ways to discover and organize knowledge. She did her PhD at University of Amsterdam, working on developing models to understand and utilize interesting phenomena in the data.

The speaker's profile picture

Sessions

09-20
16:30
50min
Keynote - The Art of Language: Mastering Multilingual Challenges in LLMs
Marzieh Fadaee

Multilingual Natural Language Processing (NLP) has played a pivotal role in the recent advancements of Large Language Models (LLMs). The ability to understand and generate text in multiple languages has expanded the capabilities of these models, making them more versatile and accessible to a global audience. In this talk we explore the current landscape of multilingual LLMs, addressing the challenges and opportunities that lie ahead. The discussion will cover critical topics such as the scarcity of multilingual datasets, the evaluation and benchmarking of multilingual models, and the unique safety considerations when dealing with diverse languages.

Additionally, the talk will highlight the challenges and gains of the global open science efforts, such as Aya and Global Exams, to build state of the art multilingual models and resources. Finally, we discuss the unexplored areas in multilingual NLP, providing insights into potential future research directions and the ongoing efforts to enhance the performance and applicability of LLMs.

Rembrandt