Skip to content

Commit 8855433

Browse files
committed
docs: add Phase B tracking checklist (context-aware ranking)
1 parent a7498c5 commit 8855433

File tree

2 files changed

+40
-0
lines changed

2 files changed

+40
-0
lines changed

docs/PHASE_B_CHECKLIST.md

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,24 @@
1+
# Phase B — Context-Aware Ranking (Tracking)
2+
3+
Scope (initial slice)
4+
- Context ranking: add a lightweight scorer that re-ranks Bengali candidates using recent selections/context.
5+
- Decoder tuning: conservative rule extensions (near-keys/insert/delete caps), keep candidate cap small and deterministic.
6+
- Regression harness: broaden TSV and/or add a Bengali top‑1 mapping set.
7+
- Performance: DEBUG timings and coarse benchmarks; establish per‑keystroke budget.
8+
9+
Acceptance
10+
- Flag OFF unchanged vs baseline.
11+
- Flag ON improves top‑1 for selected contexts; all tests green; no noticeable latency.
12+
13+
Tasks
14+
- [ ] Scaffold `ContextRanking.h/m` (no‑op pass‑through initially) + unit tests
15+
- [ ] Add simple history store (recent selections)
16+
- [ ] Rank blend: prefer previously chosen forms; tie policies stay deterministic
17+
- [ ] Integration tests: 2–3 context cases where top‑1 changes as intended
18+
- [ ] Expand TSV (~25–40 rows) and add a small Bengali top‑1 mapping set
19+
- [ ] Perf: add DEBUG timings and a coarse benchmark target/guard
20+
21+
Notes
22+
- Keep decoder caps small; avoid regressions; grow tests alongside rule changes
23+
- Consider adding PR comment with top failing tests/statistics (optional)
24+

docs/export.md

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -64,6 +64,22 @@ This file is maintained by Codex for fast, uninterrupted continuation of work. I
6464
## TODO / Reminders
6565
- Commit Podfile.lock: run `pod install` locally to generate it and commit to lock pod versions (CI warns if missing).
6666
- If we observe duplicate lines when importing large verbatim transcripts, tune overlap trimming in `tools/new-session-from-continue-chat.sh` via `MAX_OVERLAP_LINES`/`MIN_OVERLAP_LINES` (defaults 200/10).
67+
68+
## Backlog — Phase A Nice‑to‑Haves
69+
- Integration top‑1: for a few stable words, assert expected is top‑1 (not just present).
70+
- ON/OFF parity (Suggestion): paired tests where decoder would alter roman; flag OFF must keep Bengali output identical to baseline.
71+
- TSV growth: expand to ~25–40 rows (multi‑error within limits, near‑keys, no‑ops).
72+
- Determinism: explicit ordering stability tests when caps/inputs equal.
73+
- Ranking with priors: when a reliable prior source is confirmed, blend with edit distance + tests.
74+
- Perf threshold (optional): convert measure to an upper bound once variance is understood.
75+
76+
## Phase B — Preview / Ready Checklist
77+
- Context‑aware ranking: incorporate local context/history signal to re‑rank candidates.
78+
- Decoder tuning: extend rules conservatively; keep beam/caps guarded; add regression cases.
79+
- Regression harness: broaden TSV and/or add a Bengali top‑1 mapping set.
80+
- Performance: basic benchmarks and log sampling in DEBUG; budget targets per keystroke.
81+
- Tooling: keep CI artifacts and summaries; consider adding PR comment with top failing tests.
82+
- Branching: create `feature/phase-b` from `dev` once Phase A merged and green.
6783
- Revisit Subversion install in CI: once RegexKitLite is reliably sourced via a Git podspec or vendored, remove the Homebrew Subversion install step.
6884

6985
## Post‑Install Hook (optional quick tweak)

0 commit comments

Comments
 (0)