kr_lp_pgnet

Commit Graph

Author	SHA1	Message	Date
songhyeonsoo	0db2bd14b5	fix: lower lr 0.001→0.0001, warmup 5→15, epoch 100→200; add 하/호/배 to HANGUL_CHAR_MAP	1 month ago
songhyeonsoo	429f794018	config: extend training to 100 epochs, reduce warmup to 5 for faster lr rise	1 month ago
songhyeonsu	035394febd	Drop batch size to 16 (32 OOMs on 5090 with PGNet's variable input size) PGNet uses dynamic image sizes per batch (max_text_size=512), which spikes peak memory above 32GB at batch 32. Settle on 16 — ~2x throughput vs the default 14 while staying well clear of OOM. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	1 month ago
songhyeonsu	b94c048526	Sync PGProcessTrain.batch_size with loader's 32 The previous commit only updated Train.loader.batch_size_per_card; PGProcessTrain still expected batch_size=14 which would mismatch the dataloader and silently drop samples. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	1 month ago
songhyeonsu	c7c13e48cd	Provision a 5090-tuned container and bake in cuDNN 9.17 fix - config: batch_size_per_card 14 -> 32 (5090 32GB headroom) - setup_server.sh: pin nvidia-cudnn-cu13>=9.17 to match the sm_120 wheel (without it conv2d hits "Cannot load symbol cublasLtCreate" abort) - new scripts/recreate_container.sh: one-shot rebuild with --shm-size 8g, preserves /root/.netrc so wandb auth survives, runs setup_server.sh Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	1 month ago
songhyeonsu	f5f8939a5c	Wire up wandb logging for Step1 training - Global.use_wandb: True + top-level wandb.project=kr_lp_pgnet - Add wandb to setup_server.sh pip install list User must run `docker exec -it kr_lp_pgnet wandb login` once before training so the API key lands in /root/.netrc inside the container. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	1 month ago
songhyeonsu	82c046522e	Add Step1 training runner and lower default epochs to 50 - run_step1.sh: symlinks /workspace/train_data into PaddleOCR, runs tools/train.py with the step1 pretrain checkpoint, supports DRY_RUN=1 for quick smoke test and EPOCHS=N override - epoch_num: 200 -> 50 (matches the 50k synthetic budget) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	1 month ago
songhyeonsu	3e5f823dd5	Add Korean LP dictionary and PGNet config - dict/kr_lp_dict.txt: 67 chars covering 4 plate types (10 digits + 40 usage hangul + 17 region hangul, dedup) - configs/kr_lp_pgnet.yml: PGNet config tuned for Korean LP (pad_num=67, max_text_length=10, valid_set=partvgg, infer_visual_type=CN) - setup_server.sh: symlink dict and config into PaddleOCR tree Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	1 month ago

8 Commits (3a1f37b9c57ec8afe72f6af5445d736460575826)