目标
实现汉字转拼音功能
参考
https://yq.aliyun.com/articles/18443
开发环境
| 名称 | 版本 |
|---|---|
| 操作系统 | Windows 10 X64 |
| JDK | JDK1.8(jdk-8u151-windows-x64) |
| IntelliJ IDEA | IntelliJ IDEA 2018.3 |
| Maven | Maven 3.6.0 |
说明
本文代码在 IntelliJ IDEA 2018.3 创建 Maven 多模块(Module)项目+负载均衡 项目上进行添加操作,不过也可独立出来看。
pom.xml 引用
<!--汉字转拼音--><dependency><groupId>com.belerweb</groupId><artifactId>pinyin4j</artifactId><version>2.5.0</version></dependency>
公共类-CharacterPinYinConvert
package com.test.invoice.util;import java.util.HashSet;import java.util.Set;import net.sourceforge.pinyin4j.PinyinHelper;import net.sourceforge.pinyin4j.format.HanyuPinyinCaseType;import net.sourceforge.pinyin4j.format.HanyuPinyinOutputFormat;import net.sourceforge.pinyin4j.format.HanyuPinyinToneType;import net.sourceforge.pinyin4j.format.HanyuPinyinVCharType;/*** 汉字转拼音工具类,保留汉字与字母,其他部分忽略,需添加jar包:pinyin4j-2.5.0.jar* https://yq.aliyun.com/articles/18443* @author jianggujin**/public class CharacterPinYinConvert {/*** 设置汉字拼音输出的格式**/HanyuPinyinOutputFormat format = null;/*** 默认构造方法,初始化汉字拼音输出格式*/public CharacterPinYinConvert() {// 设置汉字拼音输出的格式format = new HanyuPinyinOutputFormat();// 不使用音调标记,format.setToneType(HanyuPinyinToneType.WITHOUT_TONE);// 如果使用WITH_TONE_MARK,则必须指定setVCharType为WITH_U_UNICODE,否则报错// fmt.setToneType(HanyuPinyinToneType.WITH_TONE_MARK);// fmt.setVCharType(HanyuPinyinVCharType.WITH_U_UNICODE);// 韵母“驴”(lu->lv)使用V来代替format.setVCharType(HanyuPinyinVCharType.WITH_V);// 返回的拼音为小字字母format.setCaseType(HanyuPinyinCaseType.UPPERCASE);}/*** 汉字转拼音,返回单个转换结果(多音字只取一个结果)** @param text 待转换字符串* @return allPinYin*/public String toPinYin(String text) {StringBuilder allPinYin = new StringBuilder();for (int i = 0; i < text.length(); i++) {char wordChar = text.charAt(i);// 如果为汉字if (Character.toString(wordChar).matches("[\\u4E00-\\u9FA5]")) {try {// 返回汉字的全部拼音(因为有些汉字为多音字,否则只返回一个)String[] pinYinArray = PinyinHelper.toHanyuPinyinStringArray(wordChar, format);// 取第一个拼音String pinYin = pinYinArray[0];if (pinYin != null) {allPinYin.append(pinYin);}} catch (Exception e) {}} else if (((int) wordChar >= 65 && (int) wordChar <= 90)|| ((int) wordChar >= 97 && (int) wordChar <= 122)) {allPinYin.append(wordChar);}}return allPinYin.toString();}/*** 汉字转拼音,返回多个转换结果(例如有多音字的情况)* 英文字母原样显示,数字和字符不显示,例如:123ABC-罗马,输出 ABCLUOMA** @param text 待转换字符串* @return pinYinSet*/public Set<String> toPinYins(String text) {char[] srcChar = text.toCharArray();String[][] temp = new String[text.length()][];for (int i = 0; i < srcChar.length; i++) {char c = srcChar[i];if (String.valueOf(c).matches("[\\u4E00-\\u9FA5]")) {try {temp[i] = PinyinHelper.toHanyuPinyinStringArray(srcChar[i],format);} catch (Exception e) {}} else if (((int) c >= 65 && (int) c <= 90)|| ((int) c >= 97 && (int) c <= 122)) {temp[i] = new String[]{String.valueOf(srcChar[i])};} else {temp[i] = new String[]{""};}}String[] pingyinArray = doExchange(temp)[0];Set<String> pinyinSet = new HashSet<String>();for (int i = 0; i < pingyinArray.length; i++) {pinyinSet.add(pingyinArray[i]);}return pinyinSet;}/*** 处理转换** @return*/private String[][] doExchange(String[][] strJaggedArray) {int len = strJaggedArray.length;if (len >= 2) {int len1 = strJaggedArray[0].length;int len2 = strJaggedArray[1].length;int newlen = len1 * len2;String[] temp = new String[newlen];int index = 0;for (int i = 0; i < len1; i++) {for (int j = 0; j < len2; j++) {temp[index] = strJaggedArray[0][i] + strJaggedArray[1][j];index++;}}String[][] newArray = new String[len - 1][];for (int i = 2; i < len; i++) {newArray[i - 1] = strJaggedArray[i];}newArray[0] = temp;return doExchange(newArray);} else {return strJaggedArray;}}}
接口类
/*** WebClientTest* @return String*/@PostMapping("/Test/PinYin")public ResponseVO PinYin(){ResponseVO responseVO = new ResponseVO<String>();CharacterPinYinConvert convert = new CharacterPinYinConvert();responseVO.setData(convert.toPinYin("罗马")+"======"+convert.toPinYins("一加一的和等于二"));return responseVO;}
测试
convert.toPinYin() ,只显示一个结果(即使有多音字)
convert.toPinYins() ,显示多个结果(多音字的情况)

接口多音字部分的值
YIJIAYIDIHUDENGYUERYIJIAYIDEHUODENGYUERYIJIAYIDEHUDENGYUERYIJIAYIDIHEDENGYUERYIJIAYIDEHAIDENGYUERYIJIAYIDIHUODENGYUERYIJIAYIDIHAIDENGYUERYIJIAYIDEHEDENGYUER