UT Data Hub

The UT Data Hub

The Data Hub is UT Austin's next-generation data ecosystem, based on industry-leading cloud architecture and designed to meet the university's growing need for data and analytics capabilities. The Data Hub provides a centralized data repository to store and manage comprehensive academic and institutional data, and can connect to a range of advanced analytics tools to enable state-of-the-art data analytics.

Based on a hub-and-spoke model, the Data Hub integrates with UT operational systems, stores data in a data lake (the hub), and provides workspaces (the spokes) for data developers to transform operational data into valuable insights that advance university outcomes.

What Can I Do in the Data Hub?

Once your CSU has access, the Data Hub supports a wide range of analytical work:

  • Query institutional data directly using tools you already know: Tableau, Power BI, Amazon QuickSite, SQL-based tools, Python, R, and more
  • Store and manage CSU-level data alongside institutional sources
  • Build and maintain dashboards and reports within your spoke
  • Collaborate with your team in a governed, secure environment

Data Hub Components

Data Lake
The Data Lake contains selected feeds and batch loads of unrefined, raw data from UT systems and other sources. It also serves as the central repository for historical and transformed data.
Operational Data Store (ODS)
With proper authorization, data from the Data Lake can be functionally validated, modeled, and transformed within the ODS using approved business logic. D2I uses dbt (Data Build Tool) to build and maintain transformation logic in this layer. The data definitions produced here enable consistent use and reuse across data spokes.
Enterprise Data Warehouse (EDW)
The EDW is where institutional data is transformed, modeled, and made ready for analysis. Dimensional data models, curated, subject-area datasets are built, maintained and optimized here for consistent, reliable reporting and analytics across the university. D2I uses Informatica to build and maintain analytics pipelines and transformations in this layer. 
Integration Hub

The Integration Hub is D2I's enterprise integration platform, enabling secure, governed REST API, access to Data Hub data for UT operational systems, and other university customers. It serves as the primary pathway for automated data flows from the platform, ensuring consistent and reliable integrations across the university. 

Data Spokes
Spokes are dedicated workspaces within the Data Hub ecosystem. In a spoke, developers can access approved institutional data, store and join CSU-level data, create tables, and build derived data products. Each spoke is scoped, vetted, and approved in coordination with the appropriate UT data stewards.

Frequently Asked Questions

Who can use the Data Hub?
The Data Hub is available to UT Austin colleges, schools, and units (CSUs). Because each onboarding requires dedicated time and configuration, D2I brings on a limited number of new CSUs at a time. Submit a request at d2i.utexas.edu/data-hub-access-requests to register your interest and D2I will follow up as capacity becomes available.
What data is available in the Data Hub?
Not all data from all university systems is replicated to the Data Hub. When new analytics require additional data, the D2I team develops a plan to obtain and refresh that data in consultation with the appropriate UT data stewards. To explore what's currently available, visit Alation (utexas.alationcloud.com/app), UT's metadata catalog.
Is the Data Hub secure?
Yes. Data is encrypted in-flight and at-rest at all layers of the Data Hub ecosystem. The architecture and procedures have been reviewed and approved by UT's Information Security Office. The Data Hub is approved to store all classifications of UT data.
Can my CSU bring in our own data?
Yes. Data spokes are designed to support both institutional data access and CSU-level data storage. Your unit can upload local data, join it with institutional sources, and build derived data products, all within your spoke's governed environment.
What tools can I use to connect?
The Data Hub supports connections from a range of analytics tools including Tableau, Power BI, Amazon QuickSight, SQL-based tools, Python, R, SPSS, and more. If you have questions about a specific tool, contact D2I at insights@utexas.edu.
How is the Data Hub different from Cognos/Legacy Data Services?
Cognos is UT's legacy data warehouse platform, now in sustainment mode and no longer being actively developed. The Data Hub is UT's modern, cloud-based data ecosystem where new investment and capabilities are being built. 
Who do I contact to learn more?
Contact D2I at insights@utexas.edu for help with any questions.