Spaces:
Running
Running
zhimin-z
commited on
Commit
·
3f8cff7
1
Parent(s):
d34dfc3
refine
Browse files
msr.py
CHANGED
|
@@ -1,13 +1,3 @@
|
|
| 1 |
-
"""
|
| 2 |
-
Minimalist Review Metadata Mining Script
|
| 3 |
-
Mines PR review metadata from local GHArchive data via DuckDB and saves to HuggingFace dataset.
|
| 4 |
-
|
| 5 |
-
Changes from previous version:
|
| 6 |
-
1. Single SQL query for all agents (no batching)
|
| 7 |
-
2. Batch upload with time gaps and comprehensive retry logic
|
| 8 |
-
3. Handles both rate limit and timeout errors with exponential backoff
|
| 9 |
-
"""
|
| 10 |
-
|
| 11 |
import json
|
| 12 |
import os
|
| 13 |
import time
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
import json
|
| 2 |
import os
|
| 3 |
import time
|