Tamil ITN Cardinal Grammar - Dummy PR by retheckj-star · Pull Request #430 · NVIDIA/NeMo-text-processing

retheckj-star · 2026-06-05T05:41:34Z

Language: Tamil
Task: ITN

Dummy PR created as requested.

for more information, see https://pre-commit.ci

mayuris-00

The core exercise is complete and correct: the folder structure, data files, and all three TODOs are implemented properly, and the 28 core test cases should pass. The following items should be addressed before review sign-off:

Target branch: this PR is opened against NVIDIA/NeMo-text-processing:main. Section 11 requires it to target the designated training/review branch, not main.
DCO sign-off: neither commit contains a Signed-off-by: line. The -s flag is required or the DCO check will fail. Please amend or rebase with sign-off and force-push.
Commit message: please use the specified format, feat(ta): add cardinal ITN tagger, verbalizer and test cases.
Remove the stale # TODO instruction comments in the file-level comments.

mayuris-00 · 2026-06-05T06:46:16Z

No comment needed. Correctly empty package marker .

mayuris-00 · 2026-06-05T06:51:17Z

Matches the spec table (1–9) exactly.

mayuris-00 · 2026-06-05T06:54:23Z

0 → சுழியம் matches the spec, so this is correct for the exercise.

mayuris-00 · 2026-06-05T06:55:15Z

All 18 rows (10–20 + round tens) match the spec exactly.

mayuris-00 · 2026-06-05T07:05:36Z

Overall correct; both TODOs are implemented properly. Inline notes:
On the three string_file(...).invert() lines:
-TODO 1 is implemented correctly. .invert() is applied to all three sources, which is required because the TSV files map number to word while ITN needs word to number.

On the # TODO 1: add .invert()... comment:
-This instruction comment is now stale since the line is complete. Please remove it.

On graph = graph_digit | graph_zero | graph_teens_and_ties:
-TODO 2 is correct and appropriate for the core scope. Numbers in the 21–99 range and hundreds would require place-value composition, which is the Section 9 stretch goal and is not expected here.

On the # TODO 2: Combine them... comment:
-Stale instruction comment; please remove.

mayuris-00 · 2026-06-05T07:07:09Z

Overall correct. Inline notes:
On + pynini.closure(NEMO_NOT_QUOTE, 1):
-TODO 3 is correct. Matching one or more non-quote characters correctly captures the digit value between the quotes.

On the # TODO 3: keep the digits... comment:
-Stale instruction comment; please remove now that the line is complete.

mayuris-00 · 2026-06-05T07:11:09Z

Copied from the Hindi folder as instructed, which is correct for the exercise. Minor observation: there is a stray from pynini.lib import pynutil in the middle of the file that is not used, it is not harmful but can be removed for clean code.

mayuris-00 · 2026-06-05T07:12:56Z

This is the Hindi helper copied over, which is exactly what Section 5 instructs, so no change is required for the exercise.

mayuris-00 · 2026-06-05T07:14:16Z

Matches the specification's checker script, and the root location is what Section 8 requires. No change needed.

mayuris-00 · 2026-06-05T07:14:46Z

All 28 cases match Section 8 exactly and use the correct input~expected format.

retheckj-star and others added 2 commits June 5, 2026 11:06

Add Tamil ITN cardinal grammar

6ac0b66

[pre-commit.ci] auto fixes from pre-commit.com hooks

32a2020

for more information, see https://pre-commit.ci

mayuris-00 suggested changes Jun 5, 2026

View reviewed changes

retheckj-star closed this Jun 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tamil ITN Cardinal Grammar - Dummy PR#430

Tamil ITN Cardinal Grammar - Dummy PR#430
retheckj-star wants to merge 2 commits into
NVIDIA:mainfrom
retheckj-star:ta-itn-dummy-pr

retheckj-star commented Jun 5, 2026

Uh oh!

mayuris-00 left a comment

Uh oh!

mayuris-00 Jun 5, 2026

Uh oh!

mayuris-00 Jun 5, 2026

Uh oh!

mayuris-00 Jun 5, 2026

Uh oh!

mayuris-00 Jun 5, 2026

Uh oh!

mayuris-00 Jun 5, 2026

Uh oh!

mayuris-00 Jun 5, 2026

Uh oh!

mayuris-00 Jun 5, 2026

Uh oh!

mayuris-00 Jun 5, 2026

Uh oh!

mayuris-00 Jun 5, 2026

Uh oh!

mayuris-00 Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

retheckj-star commented Jun 5, 2026

Uh oh!

mayuris-00 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants