ZenHAX

Free Game Research Forum | Official QuickBMS support | twitter @zenhax | SSL HTTPS://zenhax.com
It is currently Sat Dec 05, 2020 12:12 am

All times are UTC




Post new topic  Reply to topic  [ 1 post ] 
Author Message
PostPosted: Mon Nov 09, 2020 2:30 am 

Joined: Sun Nov 08, 2020 10:31 pm
Posts: 1
The Secret of Monkey Island & Monkey Island 2: LeChuck's Revenge for FM Towns were released with both English & Japanese versions of the games on the disc, and I'm trying to get Japanese text dumps out of them. There's an old translation tool floating around called SCUMMtr that can export/import the full text of SCUMM games, but it obviously wasn't designed with Japanese text in mind. I successfully dumped the English texts from the FM Towns copy of each game with no problems, but because the tool only exports *.TXT files with a limited number of formatting options, all of my attempts to dump the Japanese script are garbled text.

One of the options in SCUMMtr is to export hexadecimal char codes, which seems like the most likely candidate to convert into readable text, but it looks like there's still a bunch of garbage characters in there that I can't figure out what to do with (e.g. {, n, I).

Here's the first 10 lines of the SOMI text dump in English, with matching Japanese text transcribed from the game as reference:
Code:
jungle : ジャングル
rowboat and oars : ボートとオール
jungle : ジャングル
jungle : ジャングル
jungle : ジャングル
It's a gross old bottle.\255\003I don't want it. : 薄汚れた古いガラスびんだ。\255\003こんな物いらない。
bottle : ガラスびん
note : メモ
Herman Toothrot : ハーマン・トゥースロット
There's a message in it! : 中に手紙が入ってる!

The corresponding Japanese hexadecimal codes exported by SCUMMtr:
Code:
\x83W\x83\x83\x83\x93\x83O\x83\x8B
\x83{\x81[\x83g\x82\xC6\x83I\x81[\x83\x8B
\x83W\x83\x83\x83\x93\x83O\x83\x8B
\x83W\x83\x83\x83\x93\x83O\x83\x8B
\x83W\x83\x83\x83\x93\x83O\x83\x8B
\x94\x96\x89\x98\x82\xEA\x82\xBD\x8C\xC3\x82\xA2\x83K\x83\x89\x83X\x82\xD1\x82\xF1\x82\xBE\x81B\xFF\x03\x82\xB1\x82\xF1\x82\xC8\x95\xA8\x82\xA2\x82\xE7\x82\xC8\x82\xA2\x81B
\x83K\x83\x89\x83X\x82\xD1\x82\xF1
\x8E\xE8\x8E\x86
\x83n\x81[\x83}\x83\x93\x81E\x83g\x83D\x81[\x83X\x83\x8D\x83b\x83g
\x92\x86\x82\xC9\x8E\xE8\x8E\x86\x82\xAA\x93\xFC\x82\xC1\x82\xC4\x82\xE9\x81I

I've attached the exported hex code dumps from both games, and a sample file comparing the first 10 lines of the SOMI text dump using SCUMMtr's other export options, if they can help. MI2's other dumps are pretty much identical to SOMI's, except for some reason it kept generating UTF-16 text files instead of ANSI when that option was used.


Is it possible to get a properly encoded Japanese text dump out of this, or is SCUMMtr just not the right tool for the job?


Attachments:
File comment: MI2 hex dump.
M2K-H.txt [661.28 KiB]
Downloaded 15 times
File comment: SOMI hex dump.
MK-H.txt [452.88 KiB]
Downloaded 16 times
File comment: First 10 lines in SOMI text file exported by SCUMMtr. ANSI charset, binary mode, hexadecimal char codes.
SOMI scummtr export sample.txt [2.4 KiB]
Downloaded 15 times
Top
   
Display posts from previous:  Sort by  
Post new topic  Reply to topic  [ 1 post ] 

All times are UTC


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Powered by phpBB® Forum Software © phpBB Limited