Philip Blair
Philip is the director of Blair Software, an Amsterdam-based AI consultancy specializing in NLP software. Originally from the United States, he spent nearly a decade doing applied research and development on NLP systems, and conducts a mixture of AI software development, AI corporate advisory work, and advising software companies of the impacts of transatlantic AI regulation for companies in Europe and the United States.
Sessions
Data scientists and app developers today must deal with data coming from different regions around the world. Whether handling signup form data, scraping news articles, or building LLM pipelines, one of the most common types of unstructured text data seen today are names of people and addresses; however, conventions for how these are written are completely different depending on their country of origin. In this talk, intended for data scientists and non-technical stakeholders alike, I will provide an introduction to what these types of data can look like, a number of misconceptions about what they do or don't contain, and some examples for how to work with them.