School of Information Sciences

Digital exhibit focuses on the evolution of Star Wars

Ben Ostermeier

For MS/LIS student Ben Ostermeier, the digital exhibit he curated for the U of I Rare Book and Manuscript Library (RBML), "Starkiller to Skywalker: How Star Wars Evolved from Script to Screen," was a labor of love. A Star Wars fan, Ostermeier spent ten months curating the exhibit, although background work on the exhibit actually started earlier, as a project for one of his MS/LIS courses.

Why did you pick Star Wars for your exhibit?

My work with Star Wars began in my Data Science in the Humanities course I took in Spring 2021 with Professor Ted Underwood. In that class, we analyzed how men and women are portrayed in movies using a data set of more than 600 films. For the course's final project, I decided to focus on the portrayal of gender in Star Wars films throughout their 40-year history, as I am a big fan of the franchise. To do this, I created a data set of the Star Wars original trilogy and Star Wars sequel trilogy. The following summer, I added the prequel trilogy as well.

For my RBML assistantship, I worked on projects to improve our digital exhibits platform and develop a web interface for the digital exhibit site. Since I knew we had the shooting script for Star Wars, and I had already done the work for my class, I decided to create a digital exhibit about the shooting script and use the dialogue data set I already had for the original film as a point of comparison with the script.

How did the library acquire the Star Wars script?

We don't actually know the precise provenance of the Star Wars script, but we know the library acquired it prior to 1982, as the script contains a note that it was transferred to the "Rare Book Room" (as RBML was known at the time) in June 1982. This means it somehow made its way to the University of Illinois Library between the script’s creation in 1976 and 1982.

How did you figure out the word counts for characters and deleted scenes?

For the data set I created for class, I had to go through and identify the speakers of each line of dialogue, which I typically did either by memory or by consulting the film itself or the published script. This was fairly straightforward for most of the dialogue, but Star Wars is rather notorious for giving very minor background characters not only names but extensive backstories. 

I digitized the script using our scanner in RBML and then performed optical character recognition on it in the Scholarly Commons, where I am also a graduate assistant. Then I transferred the text into a spreadsheet like the data set for the film itself. To determine word count for both characters and genders, I used the Python Data Analysis Library pandas to analyze the text of both the script and the film.

What was the most interesting fact that you discovered while working on the project?

I already kind of knew this, but the 2017 film Rogue One used archival footage of Red Leader and Gold Leader in its film. This footage was originally shot for Star Wars in the 1970s but ultimately cut from the film, and some of the dialogue for the brief scenes that are in Rogue One is in the shooting script—meaning that George Lucas wrote some dialogue in the 1970s that eventually made its way into a movie in 2017.

Updated on
Backto the news archive

Related News

Downie presents TORCHLITE in Germany

This week, Professor and Executive Associate Dean J. Stephen Downie was a guest speaker at the Herder Institute in Marburg and the University of Göttingen. Downie, who serves as co-director of the HathiTrust Research Center (HTRC), lectured on the HTRC's "Tools for Open Research and Computation with HathiTrust: Leveraging Intelligent Text Extraction" (TORCHLITE) project.

J. Stephen Downie

Internship Spotlight: San Francisco Public Library

PhD student Adebola Obayemi discusses her internship with the San Francisco Public Library, where she worked on Expanding Information Access for Incarcerated People Initiative. She has been invited to present her proposal on digital literacy for incarcerated populations at the Expanding Information Access for Incarcerated People Convening, which will be held in June in Chicago. 

Adebola Obayemi

Undergraduate Research Symposium features iSchool researchers

The iSchool is well represented in the 19th annual Undergraduate Research Symposium, which will be held on April 30 from 9:00 a.m.-5:00 p.m. in the Illini Union. The iSchool is a Gold Sponsor of the symposium, which spotlights undergraduate research through oral and poster presentations, creative performances, and art exhibits.

Vaez Afshar selected as 2026 APT Student Scholar

The Association for Preservation Technology (APT) International has named Informatics PhD student Sepehr Vaez Afshar as a 2026 Student Scholar. Established in 1985, the APT Student Scholarship annually recognizes ten students worldwide whose work advances preservation technology through innovative and impactful approaches.

Sepehr Vaez Afshar

Nguyen receives Critical Language Scholarship

MSLIS student Christine Nguyen has been awarded a U.S. Department of State Critical Language Scholarship (CLS) to study Japanese this summer. She is one of four University of Illinois Urbana-Champaign students who received full scholarships to spend 8-10 weeks abroad and study one of 14 critical languages. The program is part of an initiative to expand the number of Americans studying and mastering critical foreign languages and cultural skills to enable them to contribute to U.S. economic competitiveness and national security.

Christine Thuy Minh Nguyen

School of Information Sciences

501 E. Daniel St.

MC-493

Champaign, IL

61820-6211

Voice: (217) 333-3280

Email: ischool@illinois.edu

Back to top