Andrew Janco (MS '19), digital scholarship librarian at Haverford College, will give the talk, "What Natural Language Processing Reveals in a Corpus of 400,000 Russian Diaries."
Starting with a short introduction to recent innovations in natural language processing (NLP), this talk outlines what new methods in applied NLP have revealed in the records of Prozhito, a crowd-sourcing project that has transcribed more than 3,000 Russian-language diaries. What patterns appear when studying the collection at scale? What new questions can we ask? What answers are does the collection bring to existing debates?
Janco has a passion for inquiry-driven and community-engaged digital projects. He is one of the lead developers working on a digital archive and research application for the Groupo de Apoyo Mutuo; Guatemala's oldest human rights organization. He also works on applied machine learning for Humanities and Social Science research. Janco completed his PhD at the University of Chicago and held post-docs at the University of Chicago's Pozen Family Center for Human Rights and the Human Rights Institute at the University of Connecticut.
This event is sponsored by the School of Information Sciences and the Russian, East European & Eurasian Center If you w