This project will create both a master’s and doctoral-level specialization in Socio-technical Data Analytics (SODA). Partnerships with local researchers and businesses who already work with large data-sets will enable master's graduates to receive first-hand experience with both the social and technical implications of large digital data collections, and thus be well-prepared for leadership roles in academic and corporate environments. Similarly, doctoral students will consider multiple stages of the information lifecycle, which will help to ensure that their research findings will generalize to a range of scholarly and business practices. Case studies from these partners will be incorporated into new courses that will initially be held on campus and will later be evolved to the School...
RESEARCHERS WORKING IN THIS AREA
RELATED RESEARCH PROJECTS
Institute of Museum and Library Services
University of Illinois Extension
The Illinois Digital Innovation Leadership Program will increase opportunities for entrepreneurship, economic development, and innovation through the expansion of digital manufacturing, digital media production, and data analytics. Supported by the University of Illinois Extension, the project will engage Illinoisans with mobile digital design and innovation labs, or “DigiTech Hubs,” which will serve as high-tech inventor workshops equipped with tools for everything from audio production to 3D printing. Digital Innovation Leadership staff will work with 4-H clubs, public libraries, and public schools to develop permanent community-based and -supported studios, creating a network that will build statewide capacity in digital...
Korea Institute of Science and Technology Information
How do limitations and intransparencies in data quality and data provenance bias research outcomes, and how can we detect and mitigate these limitations? For example, we have been investigating the impact of entity resolution errors on network analysis results. We found that commonly reported network metrics and derived implications can strongly deviate from the truth—as established based on gold standard data or approximations thereof—depending on the efforts dedicated to entity resolution.
IN THE NEWS
The iSchool is pleased to announce that Karen Wickett (MS '07, PhD '12) will join the faculty in August 2018. She is currently an assistant professor in the School of Information at the University of Texas at Austin.
Wickett's research areas include the conceptual and logical foundations of information organization systems and artifacts. She is most interested in the analysis of common concepts in information systems, such as documents, datasets, digital objects, metadata records, and collections. Her work recognizes the pressing need for logically consistent definitions and descriptions in digital environments. This is especially important as semantic technologies (such as RDF and linked data services) become more commonplace for digital library and curation systems.
Before her faculty...
If you love to talk about data management, data curation, and data analysis, we'd love to chat with you at Data & Drinks, the second summer professional networking event from the iSchool at Illinois. We aim to provide a space for central Illinois residents and visitors who work with data to meet colleagues in the field and have productive conversations about our challenges, ideas, and projects.
The iSchool will provide tasty snacks and the venue will have a cash bar for your convenience. Please register to attend in advance of the event.
Assistant Professor Peter Darch and Research Associate Professor David Dubin participated in the Research Data Alliance (RDA) 9th Plenary Meeting, which was held April 5-7 in Barcelona, Spain.
The RDA was launched in 2013 by the European Commission, the United States Government's National Science Foundation and National Institute of Standards and Technology, and the Australian Government’s Department of Innovation with the goal of building the social and technical infrastructure necessary to enable open sharing of data. The RDA community includes more than 5,400 members from 123 countries.
At the plenary meeting, Darch presented his poster, "How Do Researchers Trust Data in New and Emerging Scientific Domains?"
As co-chair of the Research Data Provenance Interest Group, Dubin led the kickoff session for a proposed working group on provenance patterns. The Research Data Provenance Interest Group is concerned with questions of data origins, maintenance of...
iSchool staff and students will participate in the 12th International Digital Curation Conference (IDCC), which will be held on February 20-23 in Edinburgh, Scotland. IDCC is organized annually by the UK-based Digital Curation Centre and provides opportunities for educators and professionals to consider digital curation in a multidisciplinary context. The theme of this year's conference is "Upstream, Downstream: embedding digital curation workflows for data science, scholarship and society."
iSchool presentations include:
"When Scientists Become Social Scientists: How Citizen Science Projects Learn About Volunteers," a paper authored by iSchool Assistant Professor Peter Darch.
"Revealing the Detailed Lineage of Script Outputs using Hybrid Provenance," a paper authored by iSchool postdoctoral research associates Qian Zhang and Yang Cao and Professor Bertram Ludäscher, director of...
Where in the world is Carmen Sandiego? Children playing this educational video game on their school's computer in the 1990s got an entertaining geography lesson while in hot pursuit of Carmen and her villains. Preserving a video game such as this for future generations to study and appreciate involves challenges beyond the obvious fact that computers no longer support the software needed to play the game. In "Where Does Significance Lie: Locating the Significant Properties of Video Games in Preserving Virtual Worlds II Data," Rhiannon Bettivia, a postdoctoral research associate at the iSchool, examines some of the difficulties inherent in video game preservation and comes to the...
Professor Bertram Ludäscher will present the international tutorial at the thirty-first Brazilian Symposium on Databases (SBBD2016) in Salvador-Bahia on October 4-7. SBBD, an official event of the Brazilian Computer Society, is the largest venue in Latin America for presenting and discussing research results in the database domain. The symposium brings together researchers, students, and practitioners from Brazil and abroad for technical sessions, invited talks, and tutorials given by distinguished speakers from the international research community. Ludäscher’s tutorial is titled "Provenance in Databases and Scientific Workflows."
Abstract: In computer science, data provenance describes the lineage and processing history of data as it is transformed through queries or workflows. Different computer science sub-disciplines have studied approaches to capture and exploit provenance, e.g., the systems and programming...
Provenance information describes the origin and history of artifacts. Because of the vital role played by data and workflow provenance in support of transparency and reproducibility in computational and data science, creating tools for capturing and using provenance information is an important yet challenging task.
Post-doctoral Research Associate Yang Cao and Professor Bertram Ludäscher recently presented joint work on data provenance at the Data Observation Network for Earth (DataONE) All Hands Meeting in Santa Ana Pueblo, New Mexico. In their poster and system demonstration, jointly authored by a team of University of Illinois students and staff as well as collaborators from the UK, Cao and Ludäscher demonstrated how the YesWorkflow tool is "Revealing the Detailed History of Script Outputs with Hybrid Provenance Queries."1
In an earlier article for the Winter 2015/6 issue of DataONE News, "Your Data has a History,...