As part of the Jisc business intelligence project lots of clever folk from different HE Institutions are getting together, looking at data they know about and working with HESA to augment it with data HESA collects. The result is the creation of all kinds of interesting businesses intelligence dashboards and reports; many of which will be available through the replacement to the Heidi system- Heidi Plus. For the project Cetis is offering ‘data wrangling’ support, which involves giving a hand getting data in a format they the analysts can use.
One the things we’ve found that we get asked for a lot is simply CSV’s of universities that puts them in groups or ranking for easy comparison. Well known groups such such as mission groups, Jisc bands, Regional memberships, REF/RAE/REF Power scores etc etc. This data isn’t hard to get hold of, but requires the analysts to spend some time creating them manually, followed by lots more time making sure the institutions have a common identifier between data sets.
Cetis have a growing resource of simple data sets the analysts have found useful on Github, there are a few more that need to be cleaned up and added, but basically these consist of institutions, the grouping they belong to and the UKPRN.
This is a growing list as the project continues, but since we get asked a quite oftern for them it seemed worth sharing now. Here are the resources:
- Data Wrangling Repo – Home for various scripts and the data sets that are commonly requested by the analysts
- Cetis on BI CKAN – List of the common data sets we have hosted. Growing but currently includes UKPRN, name and band for: