tomcat如何進行請求信息編碼

2019-11-14 08:54:44

字體：大中小

來源：轉載

供稿：網友

我們知道網絡傳輸的，都是二進制字節流，那么服務器如何編碼，怎么知道哪個字符集進行編碼呢，那我們深入分析下tomcat連接，仔細探討下。接下來，我們看一下段代碼，這是一個很簡單的表單。

controller中，我們直接用 HttpServletRequest，不用sPRing獲取參數。

@RequestMapping(value = "/demo01", method = RequestMethod.GET) public String dologin1(HttpServletRequest request) throws UnsupportedEncodingException { log.info(request.getCharacterEncoding()); log.info("name:中國" + request.getParameter("name")); log.info("name1:張三" + request.getParameter("name1")); return "login";}

運行tomcat，結果如下，中文亂碼：這里寫圖片描述我們用fiddler查看請求的詳情：我們來經過測試下：

@Test public void test() throws UnsupportedEncodingException { String str = "中國"; byte[] bytes = str.getBytes("utf-8"); System.out.println(Hex.encodeHex(bytes)); System.out.println(new String(bytes, "iso8859-1")); String str1 = "張三"; byte[] bytes1 = str1.getBytes("utf-8"); System.out.println(Hex.encodeHex(bytes1)); System.out.println(new String(bytes1, "iso8859-1")); }

打印如下：

e4b8ade59bbdiso8859-1編碼: ??-???e5bca0e4b889?? ??‰

由此，可以發現，我使用的谷歌瀏覽器，默認使用的中文編碼為utf-8,而tomcat編碼默認的是iso8859-1編碼，由于編碼對應的字符不同，所以造成亂碼。既然有編碼問題，那么肯定可以解決，查看tomcat手冊發現tomcat連接器可以指定uri編碼，參數URIEncoding：This specifies the character encoding used to decode the URI bytes, after %xx decoding the URL. If not specified, ISO-8859-1 will be used. 在server.xml中配置如下：

此時運行tomcat，uri參數問題解決，結果如下：這里寫圖片描述

那請求體參數如何進行編碼呢？我們查看servelt源碼發現，請求體的編碼可以在獲取參數前進行設置，由此猜想，tomcat解析請求體參數是在第一次使用時進行解析，也不難理解，字符串解析是耗性能的，既然不需要使用，那么不用解析，同樣就不用消耗這部分性能。

/** * Overrides the name of the character encoding used in the body of this * request. This method must be called prior to reading request parameters * or reading input using getReader(). Otherwise, it has no effect. * * @param env <code>String</code> containing the name of * the character encoding. * @throws UnsupportedEncodingException if this * ServletRequest is still in a state where a * character encoding may be set, but the specified * encoding is invalid */ public void setCharacterEncoding(String env) throws UnsupportedEncodingException;

改變controller代碼,增加utf-8編碼：

@RequestMapping(value = "/demo01", method = RequestMethod.POST) public String dologin(HttpServletRequest request) throws UnsupportedEncodingException { request.setCharacterEncoding("utf-8"); log.info(request.getCharacterEncoding()); log.info("name:中國" + request.getParameter("name")); log.info("name1:張三" + request.getParameter("name1")); return "login"; }

運行tomcat，發現編碼問題完美解決：這里寫圖片描述難道每次獲取參數前都要設置編碼嗎？肯定有更省事的方式，那就是過濾器，且我們可以直接用spring提供的現成的，org.springframework.web.filter.CharacterEncodingFilter,查看其代碼：

@Override protected void doFilterInternal( HttpServletRequest request, HttpServletResponse response, FilterChain filterChain) throws ServletException, IOException { if (this.encoding != null && (this.forceEncoding || request.getCharacterEncoding() == null)) { request.setCharacterEncoding(this.encoding); if (this.forceEncoding) { response.setCharacterEncoding(this.encoding); } } filterChain.doFilter(request, response); }

發現也就是設置request編碼而已，沒什么神秘的，不過既然有現成的，我們何必再造輪子呢。沒有從tomcat源碼中分析出問題有些遺憾，查看了tomcat部分源碼，也沒得要領，只能說明功力還不夠，需要繼續精進，不過合理的推導也不失為解決問題的一種好辦法。

上一篇：骨牌鋪方格

下一篇：Spring