Tagged: non-ascii

Apache Solr non-ASCII characters

- by admin

This day I had a problem while indexing French and Russian content with Apache Solr: French and Russian characters where corrupted.

The solution was very simple: I added
URIEncoding="UTF-8"

attribute to the Connector tag which runs Solr in Tomcat's server.xml. Something like
<Connector port="8080" protocol="HTTP/1.1" 
connectionTimeout="20000"
redirectPort="8443" URIEncoding="UTF-8" />

Then Tomcat has to be restarted :-)

« All tags