日韩性视频-久久久蜜桃-www中文字幕-在线中文字幕av-亚洲欧美一区二区三区四区-撸久久-香蕉视频一区-久久无码精品丰满人妻-国产高潮av-激情福利社-日韩av网址大全-国产精品久久999-日本五十路在线-性欧美在线-久久99精品波多结衣一区-男女午夜免费视频-黑人极品ⅴideos精品欧美棵-人人妻人人澡人人爽精品欧美一区-日韩一区在线看-欧美a级在线免费观看

歡迎訪問 生活随笔!

生活随笔

當前位置: 首頁 > 前端技术 > HTML >内容正文

HTML

native2ascii转译HTML字符,native2ascii转字符

發布時間:2023/12/10 HTML 38 豆豆
生活随笔 收集整理的這篇文章主要介紹了 native2ascii转译HTML字符,native2ascii转字符 小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

轉字符(非Ascii碼字符轉成unicode)

native2ascii msg.txt dist.txt

轉字符(Ascii碼字符轉成中文)

native2ascii -encoding gb2312 msg.txt dist.txt

今天分析一個程序包,用DJ Java Decompiler反編譯后,發現源文件中的很多中文字符都被轉換成UNICODE的編碼格式了。便研究了一下native2ascii工具。這個工具是一個很好的解決國際化問題的工具。具體的命令格式如下:

native2ascii - Native-to-ASCII Converter

將一個文件中的非拉丁或非UNICODE字符形式轉換成拉丁或UNICODE字符的文件。

native2ascii [options] [inputfile [outputfile]]

Java編譯器和其它的Java工具僅僅能夠處理Latin-1或者Unicode(udddd形式)編碼的字符,native2ascii工具將包含其它編碼形式的文件轉換成拉丁或UNICODE格式。

可選項參數

-reverse

執行還原操作:把一個文件從拉丁或UNICODE格式轉換成本地編碼格式的字符。

編碼對照表:

Converter Class Description

-------------------------------------------------------------

8859_1 ISO 8859-1

8859_2 ISO 8859-2

8859_3 ISO 8859-3

8859_4 ISO 8859-4

8859_5 ISO 8859-5

8859_6 ISO 8859-6

8859_7 ISO 8859-7

8859_8 ISO 8859-8

8859_9 ISO 8859-9

Big5 Big5, Traditional Chinese

CNS11643 CNS 11643, Traditional Chinese

Cp037 USA, Canada(Bilingual, French), Netherlands,Portugal, Brazil, Australia

Cp1006 IBM AIX Pakistan (Urdu)

Cp1025 IBM Multilingual Cyrillic: Bulgaria, Bosnia,Herzegovinia, Macedonia(FYR)

Cp1026 IBM Latin-5, Turkey

Cp1046 IBM Open Edition US EBCDIC

Cp1097 IBM Iran(Farsi)/Persian

Cp1098 IBM Iran(Farsi)/Persian (PC)

Cp1112 IBM Latvia, Lithuania

Cp1122 IBM Estonia

Cp1123 IBM Ukraine

Cp1124 IBM AIX Ukraine

Cp1125 IBM Ukraine (PC)

Cp1250 Windows Eastern European

Cp1251 Windows Cyrillic

Cp1252 Windows Latin-1

Cp1253 Windows Greek

Cp1254 Windows Turkish

Cp1255 Windows Hebrew

Cp1256 Windows Arabic

Cp1257 Windows Baltic

Cp1258 Windows Vietnamese

Cp1381 IBM OS/2, DOS People's Republic of China (PRC)

Cp1383 IBM AIX People's Republic of China (PRC)

Cp273 IBM Austria, Germany

Cp277 IBM Denmark, Norway

Cp278 IBM Finland, Sweden

Cp280 IBM Italy

Cp284 IBM Catalan/Spain, Spanish Latin America

Cp285 IBM United Kingdom, Ireland

Cp297 IBM France

Cp33722 IBM-eucJP - Japanese (superset of 5050)

Cp420 IBM Arabic

Cp424 IBM Hebrew

Cp437 MS-DOS United States, Australia, New Zealand,South Africa

Cp500 EBCDIC 500V1

Cp737 PC Greek

Cp775 PC Baltic

Cp838 IBM Thailand extended SBCS

Cp850 MS-DOS Latin-1

Cp852 MS-DOS Latin-2

Cp855 IBM Cyrillic

Cp857 IBM Turkish

Cp860 MS-DOS Portuguese

Cp861 MS-DOS Icelandic

Cp862 PC Hebrew

Cp863 MS-DOS Canadian French

Cp864 PC Arabic

Cp865 MS-DOS Nordic

Cp866 MS-DOS Russian

Cp868 MS-DOS Pakistan

Cp869 IBM Modern Greek

Cp870 IBM Multilingual Latin-2

Cp871 IBM Iceland

Cp874 IBM Thai

Cp875 IBM Greek

Cp918 IBM Pakistan(Urdu)

Cp921 IBM Latvia, Lithuania (AIX, DOS)

Cp922 IBM Estonia (AIX, DOS)

Cp930 Japanese Katakana-Kanji mixed with 4370 UDC,superset of 5026

Cp933 Korean Mixed with 1880 UDC, superset of 5029

Cp935 Simplified Chinese Host mixed with 1880 UDC,superset of 5031

Cp937 Traditional Chinese Host miexed with 6204 UDC,superset of 5033

Cp939 Japanese Latin Kanji mixed with 4370 UDC,superset of 5035

Cp942 Japanese (OS/2) superset of 932

Cp948 OS/2 Chinese (Taiwan) superset of 938

Cp949 PC Korean

Cp950 PC Chinese (Hong Kong, Taiwan)

Cp964 AIX Chinese (Taiwan)

Cp970 AIX Korean

EUCJIS JIS, EUC Encoding, Japanese

GB2312 GB2312, EUC encoding, Simplified Chinese

GBK GBK, Simplified Chinese

ISO2022CN ISO 2022 CN, Chinese

ISO2022CN_CNS CNS 11643 in ISO-2022-CN form, T. Chinese

ISO2022CN_GB GB 2312 in ISO-2022-CN form, S. Chinese

ISO2022KR ISO 2022 KR, Korean

JIS JIS, Japanese

JIS0208 JIS 0208, Japanese

KOI8_R KOI8-R, Russian

KSC5601 KS C 5601, Korean

MS874 Windows Thai

MacArabic Macintosh Arabic

MacCentralEurope Macintosh Latin-2

MacCroatian Macintosh Croatian

MacCyrillic Macintosh Cyrillic

MacDingbat Macintosh Dingbat

MacGreek Macintosh Greek

MacHebrew Macintosh Hebrew

MacIceland Macintosh Iceland

MacRoman Macintosh Roman

MacRomania Macintosh Romania

MacSymbol Macintosh Symbol

MacThai Macintosh Thai

MacTurkish Macintosh Turkish

MacUkraine Macintosh Ukraine

SJIS Shift-JIS, Japanese

UTF8 UTF-8[@more@]native2ascii轉字符

總結

以上是生活随笔為你收集整理的native2ascii转译HTML字符,native2ascii转字符的全部內容,希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯,歡迎將生活随笔推薦給好友。