Skip to Main Content
Research Guides@Tufts

New Scores at Lilly (2020-2021): About the Data & Visualizations


Data Sources

Lilly Music Library collections data was exported via the analytics reporting process in Alma. MMS-IDs were used to pull and match Primo links with individual titles using a Python script. The data was cleaned using OpenRefine.

Data used for this analysis is available in this GitHub repository:

Composer data

Demographic details about each composer were identified and added to the collection data, because this type of data is not available in library catalog records. Racial/ethnicity, gender-identity, and nationality data was reviewed from a variety of sources, including:

  • Institute for Composer Diversity database
  • DBpedia
  • Wikipedia
  • Library of Congress Linked Data Service
  • VIAF
  • Grove Music Online (Oxford)
  • Individual composer websites
  • Publisher websites


The tree map was created using Tableau software. The timeline was generated using RawGraphs.


Please contact Anna E. Kijas (Head, Lilly Music Library), anna.kijas at, with questions or feedback about the items in the collection, data sources, or how the visualization was created.