←back to thread

Brood War Korean Translations

(blog.sourcedive.net)
231 points todsacerdoti | 1 comments | | HN request time: 0.21s | source
Show context
jaeyounkg ◴[] No.42741363[source]
This was an fun read, as someone who's both a Korean BW player and a speech recognition researcher.

It's interesting to note that the original Korean transcription already has many errors, seemingly (and impressively) corrected by LLMs later on. For example, 12 안마당 빌드 (12 courtyard build) is actually 12 앞마당 빌드 (12 frontyard build), which might have been more understandable to BW players. Similarly 투에처리 빌드 (processing-at-two build? makes no sense lol) should have been transcribed 투해처리 빌드 (two-Hatchery build).

Therefore it may also be helpful to directly feed the slang dictionary into Whisper's inference process using contextual biasing. There are lots of ways to do this, but the simplest would be to increase the probability of slang words in the dictionary in the final prediction layer of Whisper by a constant factor. This is fairly easy to implement, for example by using HuggingFace's library: https://huggingface.co/docs/transformers/en/internal/generat...

replies(4): >>42741417 #>>42741497 #>>42742944 #>>42744184 #
bee_rider ◴[] No.42741497[source]
Do they actually use the Korean word for, like, tossing something to refer to the Protoss? That’s a pretty funny cross-language pun if so.
replies(3): >>42741530 #>>42741587 #>>42744112 #
1. jaeyounkg ◴[] No.42741530[source]
Haha, no I acutually never associated this with the English word toss lol.