Getting Started

These instructions will set you up for running the Python Dashboard on your local machine for development and testing purposes.

Prerequisites

You need to have Python installed on your machine. You can download Python here.

Running Locally

Fork or Clone this repository.
Navigate to the project directory.

Note: When using GitHub Codespaces, you may skip steps 3,4 and 5. When you launch a GitHub Codespaces instance, the virtual environment is automatically created, and the required packages are installed, along with the GitHub Copilot extension.

Create a Python virtual environment using the following command:

python -m venv usage

usage is the name of the virtual environment. You can replace it with any name you prefer.

Activate the virtual environment:

Windows:

./venv/scripts/activate

Linux/Mac:

source usage/bin/activate

Install the required packages using pip:

pip install -r requirements.txt

Rename .env.sample to .env and fill in the necessary environment variables.

Environment Variables

The following environment variables are optional/required for the application to run:

AZURE_OPENAI_API_VERSION: (Optional) The API version for Azure OpenAI. Example: "your_api_version"
AZURE_OPENAI_ENDPOINT: (Optional) The endpoint for Azure OpenAI. Example: "https://your_endpoint.openai.azure.com/"
AZURE_OPENAI_API_KEY: (Optional) The API key for Azure OpenAI. Example: "your_api_key"
AZURE_OPENAI_ENGINE: (Optional) The engine for Azure OpenAI. Example: "your_engine"
GHCP_TOKEN: (Required) The GitHub Copilot REST API token. Example: "your_ghcp_token"
ORG_NAME: (Required) The name of your GitHub organization. Example: "your_org_name"
ORG_ID: (Optional) The ID of your GitHub organization. Example: "your_org_id"

The Azure OpenAI environment variables are used in the Productivity Calculator page. Azure OpenAI Analyzes the Calculations and provides further recommendations generated by the new GPT-4o (Omni) Model. Leaving these environment variables empty will skip AI analysis.

Make sure to set these environment variables before running the application.

The requirements.txt file includes all the necessary packages, such as streamlit, openai, azure-ai-documentintelligence, azure-ai-formrecognizer, azure-cognitiveservices-speech, azure-common, azure-core, azure-identity, azure-search-documents, bs4, langchain, langchain_community, langchain_core, langchain_openai, markdown, matplotlib, opencv-python, pyperclip, pypdfium2, python-docx, python-dotenv, python-pptx, PyMuPDF, PyPDF2, requests, streamlit-webrtc, unstructured, youtube-transcript-api, and plotly.

Running the Application

To run the application, ensure that you are in the project directory and the virtual environment is activated. Then, execute the following command in the terminal:

streamlit run home.py

Project Structure

The project has the following structure:

helpers/: This directory contains helper functions for the application.
api.py: Contains functions related to API calls.
charts.py: Contains functions for generating charts.
openai.py: Contains functions related to OpenAI.
pages/: This directory contains the Streamlit pages for the application.
productivity_calculator.py: The productivity calculator page.
usage_dashboard.py: The usage dashboard page.
home.py: The main file to run the application.
.env: Contains environment variables. You need to create this file based on .env.sample.
requirements.txt: Contains the required packages for the project.

How to Use the Application

Navigate to the Home Page:
- Start by opening the application. The home page provides an overview of the available features.
Select a Feature from the Sidebar:
- Use the sidebar to navigate to either the Productivity Calculator or the Usage Dashboard.
Using the Productivity Calculator:
- Enter the required values in the sidebar.
- Click the "Calculate" button to perform the calculations.
- View the results displayed on the page.
- If the Azure OpenAI environment variables are set, the AI-powered analysis automatically execute. Otherwise, it will be skipped.
AI Analysis: The AI model, guided by a predefined system prompt, analyzes the provided data. The prompt instructs the AI to:
- Start with a forward-looking statement about the anticipated influence of GitHub Copilot on productivity.
- Emphasize key observations that can aid in decision-making.
- Provide suggestions on how to further enhance productivity impact.
Using the Usage Dashboard:
- View various charts and metrics related to GitHub Copilot usage.
- Click the "AI-Powered Data Analysis" button to get insights on usage trends.
- If the API key is set, an analysis will be generate against the available usage data. Otherwise, an error message will be displayed.
AI Analysis: The AI model, guided by a detailed system prompt, analyzes the provided data. The prompt instructs the AI to focus on several key areas:
- Adoption Trends: Explore how GitHub Copilot adoption has evolved over time, highlighting significant increases or decreases.
- Usage Patterns: Examine trends in user interactions with suggestions, including the frequency and type of suggestions made.
- Acceptance Analysis: Break down the acceptance rate of suggestions, identifying high and low acceptance scenarios and possible reasons.
- Key Metrics: Evaluate critical metrics such as the average number of active users and the average acceptance rate, and discuss their implications.
- User Segmentation: Segment the user base to see if different groups (e.g., beginners vs. experienced developers) have varying usage patterns.
- Language-Based Aggregations: Analyze suggestions, acceptances, lines suggested, and active users per programming language.
- Editor-Based Aggregations: Analyze suggestions, acceptances, lines suggested, and active users per code editor.

Built With

Python
Streamlit
Azure OpenAI
GitHub Copilot REST API

Using the PowerShell Script

Using the PowerShell Script to Extract GitHub Copilot Usage Data.

The extract-ghcp-usage-data.ps1 PowerShell script is designed to fetch and normalize GitHub Copilot usage data for your organization. This guide will walk you through the steps to use the script effectively.

Prerequisites

Before running the script, ensure you have the following:

PowerShell: Make sure you have PowerShell installed on a Windows machine.
GitHub Copilot REST API Token: You need a valid GitHub Copilot REST API token. Follow these instructions to generate a personal access token: Managing your personal access tokens.
Organization Name: The name of your GitHub organization.
The following permissions set in for your GitHub organization profile:

copilot
- manage_billing:copilot

Setting Up Environment Variables

The script relies on environment variables to authenticate and fetch data from the GitHub API. You need to set the following environment variables:

GHCP_TOKEN: Your GitHub Copilot REST API token.
ORG_NAME: The name of your GitHub organization.

You can set these environment variables in your PowerShell session using the following commands:

Set-Item -Path Env:GHCP_TOKEN -Value "your-ghcp-token"
Set-Item -Path Env:ORG_NAME -Value "your-org-name"

Alternatively, you can add these variables to your system environment variables.

Running the Script

Clone the Repository: If you haven't already, clone the repository containing the script to your local machine.
Navigate to the powershell Directory: Open a PowerShell terminal and navigate to the directory containing the extract-ghcp-usage-data.ps1 script.
Execute the Script: Run the script using the following command:

.\extract-ghcp-usage-data.ps1

Script Output

The script performs the following actions:

Fetches Copilot Usage Data: Calls the GitHub API to retrieve seats and usage data for your organization.
Normalizes the Data: Processes and normalizes the data to ensure it is in a consistent format.
Exports Data to CSV: Saves the normalized data to CSV files in the data directory. The data directory is created automaticall under the powershell directory if it does not exist.

The script generates two CSV files:

ghcp-seats-data-YYYY-MM-DD.csv
ghcp-usage-data-YYYY-MM-DD.csv

The YYYY-MM-DD part of the file name corresponds to the date when the script was run.

Example Output

After running the script, you should see output similar to the following:

Copilot usage data saved to data/ghcp-seats-data-2024-07-11.csv
Usage data saved to data/ghcp-usage-data-2024-07-11.csv

Troubleshooting

Invalid Token: Ensure your GHCP_TOKEN is correct and has the necessary permissions.
Network Issues: Check your internet connection if the script fails to fetch data. Verify if you require any firewall or proxy settings.
Environment Variables: Verify that the environment variables are set correctly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

start-here.md

start-here.md

Getting Started

Prerequisites

Running Locally

Running the Application

Project Structure

How to Use the Application

Using the PowerShell Script

Prerequisites

Setting Up Environment Variables

Running the Script

Script Output

Example Output

Troubleshooting

Files

start-here.md

Latest commit

History

start-here.md

File metadata and controls

Getting Started

Prerequisites

Running Locally

Running the Application

Project Structure

How to Use the Application

Using the PowerShell Script

Prerequisites

Setting Up Environment Variables

Running the Script

Script Output

Example Output

Troubleshooting