Feature/indiv markov graphs by AdaraPutri · Pull Request #66 · bohuie/processAnalysis

AdaraPutri · 2026-02-02T17:12:14Z

This PR adds an individual-level version of the transition-edges + Markov graph pipeline, while keeping the same PR-ID sessioning behavior as the team-level flow. It introduces two new scripts (transition_edges_individual.py and graphing_individual.py) that generate per-person CSV outputs and graphs, with the logic split depending on the data source:

Branching/structure labels (data/graph_labels/*_labels_branching_and_structure.csv): the pipeline splits rows by pr_author, so each person’s transitions/graphs only use their own rows (still grouped by PR session).
PR labels (raw, non-clean) (data/csv/pr_labels_year-long-project-team-*.csv): since the raw file has two possible “author” columns, the pipeline first derives a single user field per row using: if source is empty or "pr" → use pr_author, otherwise if source is "review" → use author. Also, because the raw PR label CSV doesn’t have a timestamp column, this flow generates one using the same rules as the clean-CSV helper: default created_at, use merged_at for merge events (reviewed_merge / self_merge), and use updated_at for no_merge (fallback to created_at if missing).

To avoid copy/pasting the same Markov helpers across 4 scripts, common pieces (event parsing/explode + edge computation + graph rendering helpers) were moved into process_model/_markov_common.py, and the team + individual scripts import the functions they need directly.

Testing

Testing will be done in a separate test script PR since this change is already pretty big.

Expected Result

Here is a sample output of one student in Team 2 for the PR label (access through data/outputs/pr_individual/users/year-long-project-team-2/indigoalex-771a/individual_avg_session/indigoalex-771a_avg_session.png)

Closes: #60

…runs evething from extraction to anylysis to graphs. I also created a file that just runs the anylysis and graphing scripts given sometimes we dont need to re gather data.

… error handling in main execution flow

Refactor/documentation

…core

…treamline pipeline execution, and improve error handling for team analysis

…raphing modules; update documentation and tests for clarity and accuracy

Add Unit Tests for process_model

…move wrappers

Feature/adding main run it all file

Mahatav

It looks good; however, I would pull from dev and fix the toggle thing.

Mahatav · 2026-02-21T14:42:29Z

+CURRENT_DIR = os.path.dirname(os.path.abspath(__file__))
+ROOT = os.path.abspath(os.path.join(CURRENT_DIR, "../"))
+
+# ============================================================


I would pull from dev and rework it so everything works without a toggle.

# Conflicts: # event_labelling/CodeStructure_Branching/main.py # event_labelling/PR/get_clean_pr_label.py

Feature/adding elbow score

Refactor: Clean Scripts into Utility

…ation

Feature/comm label

…raphs

AdaraPutri · 2026-03-09T17:37:19Z

@Mahatav thanks for the review Manu! I've pulled from dev now so feel free to re-review

Mahatav

Looks good to merge

Mahatav and others added 21 commits January 19, 2026 11:48

eddited the readme so the explantion for the toggle system is better

239fb8b

updated the redme again to better relft how the filtering works now

4decd28

fixed the clean lable

0279b12

remove the toggel feature in its totallity and creted main file that …

6327283

…runs evething from extraction to anylysis to graphs. I also created a file that just runs the anylysis and graphing scripts given sometimes we dont need to re gather data.

added init file for scripts

cb5d6c5

Added Tests for Process Model

32b9146

Refactored Clean Scripts

f764b63

Refactor label processing functions to use Optional types and improve…

2354db0

… error handling in main execution flow

removing the pdf that i added by mistake

a6be9de

Add compute_elbow_scores function and save elbow scores to CSV

2f58fff

deleted dead code: preprocessing

996c1a3

fix typos in clean_labels.py references in documentation

d24d0ee

added transition_edges_individual.py

9db7928

added graphing_individual.py

168b02d

add utility file markov_common.py

836937f

modified graphing.py to use util functions

a690461

modified graphing_individual.py to use util functions

3c0045c

modified transition_edges.py to use util functions

fae80cb

modified transition_edges_individual.py to use util functions

2c0d22a

fix markov_common exports in transition_edges_individual.py

8a1eb0e

fix some comments

738e563

AdaraPutri requested review from Mahatav and d2r3v February 2, 2026 17:12

AdaraPutri self-assigned this Feb 2, 2026

AdaraPutri added 6 commits February 2, 2026 11:20

added helpers_comm.py

e996952

added prep_data.py

c91ac42

added llm_prompts.py

25c6fc2

added get_clean_comm_label.py

b6927f7

modified comm_label.py to use helper functions

36951c7

restored comm_label.py

89cde60

Mahatav and others added 14 commits February 7, 2026 11:18

Merge pull request #41 from bohuie/refactor/documentation

a847455

Refactor/documentation

Add checks for elbow score computation and CSV saving in main function

8be5381

Merge remote-tracking branch 'origin/dev' into feature/adding_elbow_s…

e10d4b1

…core

Enhance README and main execution flow: add LLM setup instructions, s…

7e2d521

…treamline pipeline execution, and improve error handling for team analysis

Refactor configuration and improve edge filtering in clustering and g…

e052d65

…raphing modules; update documentation and tests for clarity and accuracy

pulled dev and rexolved conflits so it is ready for a merge

358bdda

added temporary config for communication process model

bfa4bca

Add Unit Tests for process_model (#50)

6fa8a7c

Add Unit Tests for process_model

Refactor: Integrate cleaning logic into main labelling scripts and re…

85f1646

…move wrappers

Refactor: Moved Clean Script to Util

8f1b256

Refactor/ File Cleanup

112d503

Merge pull request #55 from bohuie/feature/adding-main-run-it-all-file

ed42e07

Feature/adding main run it all file

Add elbow score plotting functionality and save plots in main function

6a720f1

merged with dev to insure evething is up to date

7ab182d

Mahatav requested changes Feb 21, 2026

View reviewed changes

d2r3v and others added 10 commits March 1, 2026 23:25

Merge remote-tracking branch 'origin/dev' into Refactor/Clean-Util

87a6eb0

# Conflicts: # event_labelling/CodeStructure_Branching/main.py # event_labelling/PR/get_clean_pr_label.py

Merge pull request #65 from bohuie/feature/adding_elbow_score

62eee9c

Feature/adding elbow score

Merge pull request #54 from bohuie/Refactor/Clean-Util

1d6e841

Refactor: Clean Scripts into Utility

added .venv to gitignore

6d80c00

updated toggle removal for graphing and transition edges

d695edb

Merge remote-tracking branch 'origin/dev' into feature/comm-label

50a90ca

updated clustering and zscore_calculation configs to include communic…

db95e92

…ation

Merge pull request #67 from bohuie/feature/comm-label

fac69ff

Feature/comm label

added .venv to gitignore

ca98840

Merge remote-tracking branch 'origin/dev' into feature/indiv-markov-g…

4c46f29

…raphs

AdaraPutri requested a review from Mahatav March 9, 2026 17:37

d2r3v force-pushed the dev branch 2 times, most recently from a8e6d00 to d14e5b5 Compare March 16, 2026 08:26

Mahatav approved these changes Mar 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/indiv markov graphs#66

Feature/indiv markov graphs#66
AdaraPutri wants to merge 53 commits intodevfrom
feature/indiv-markov-graphs

AdaraPutri commented Feb 2, 2026 •

edited

Loading

Uh oh!

Mahatav left a comment

Uh oh!

Mahatav Feb 21, 2026

Uh oh!

AdaraPutri commented Mar 9, 2026

Uh oh!

Mahatav left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AdaraPutri commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing

Expected Result

Uh oh!

Mahatav left a comment

Choose a reason for hiding this comment

Uh oh!

Mahatav Feb 21, 2026

Choose a reason for hiding this comment

Uh oh!

AdaraPutri commented Mar 9, 2026

Uh oh!

Mahatav left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AdaraPutri commented Feb 2, 2026 •

edited

Loading