This repository contains Python code for collecting various GitHub analytics data from public repositories of Imaging Data Commons (IDC) and select repositories from Quantitative Image Informatics for Cancer Research (QIICR) GitHub organizations. Additionally, anonymous data captured using Google Analytics 4 is retrieved via the Google Analytics API. GitHub Actions automate the execution of Python scripts daily.
The collected data is stored in Google BigQuery and visualized in an Apache Superset instance, enabling us to gain insights into utilization and engagement with IDC.
Collects clone and unique clone counts for specified repositories.
Gathers view and unique view counts for repositories.
Captures the top 10 referrers over the last 14 days.
Records the top 10 paths over the last 14 days.
Tracks commits by contributors, including a Weekly Hash.
A Python script is used for collecting and analyzing data from Google Analytics 4 (GA4), including Audience Overview, Acquisition Overview, and Behavior Overview.
Retrieves and analyzes data related to user engagement on specified websites. Metrics include total users, new users, sessions, engaged sessions, screen page views, and average session duration.
Focuses on data related to user acquisition and traffic sources. Provides insights into how users are finding and accessing the websites.
Analyzes user behavior on websites, including page views, session durations, and engagement.