At the start of the Sogou Partner Conference in Beijing, the audience heard a recording that sounded like American President Donald Trump congratulating Sogou on the recent IPO and how this was a step forward for American-Chinese relations. We were wondering if President Trump had started watching CCTV instead of Fox News until we realized that the recording was synthesized. Well played, Sogou.
The playful recording signaled the search companyโs high spirits, who had a successful IPO at the New York Stock Exchange in November 2017. CEO Wang Xiaochuan announced two new products in his keynote for the conference: the Sogou Travel Translator and the Sogou Smart Translation Recorder. Targeting Chinese globetrotters, the Travel Translator offers real-time speech and image translation for 17 languages, including Chinese, English, German and Arabic. While the Translation Recorder offers instant speech to text function and translation of the recorded text for 17 languages.

โWe think the translator is a cut above the rest in the market,โ said Wang Xiaochuan when asked about how the new products compare to the translation offered by competitors during an interview after the conference. This is a bold claim, considering others in the market include the Google Pixel Buds and Chinese voice recognition company iFlytekโs range of translation hardware and software.
Sogou has the data to back that up, however, at least for the Chinese language. Their Sogou Chinese keyboard dominates at over 70% of the input method market and while they fall behind Baidu in the search market, Sogouโs platform is Chinaโs largest search engine by voice. According to Sogou CTO Yang Hongtao, these and other Sogou services combine to provide over 200 million voice requests each day, generating around 240 thousand hours of voice data per dayโall of which will help Sogou refine its natural language processing technology.

The Sogou Travel Translator and the Transcription Pen represent Sogouโs first foray into AI-powered hardware products and will be available for pre-order on JD.com from March 12. TechNode tested the Travel Translator after the conference. The Travel Translator returned an accurate English translation for โWhere is the coffee shop?โ in Chinese (ๅๅกๅบๅจๅช้?). However, when speaking the same phrase in English, the syntax of the Chinese translation was not quite right. A staff explained that the Travel Translator will continue to be improved.
Sogou has started to focus on AI research and development in recent years. It unveiled an IBM Watson-like robot, called Sogou Wangzai (or Sogou Doggy) in 2017. The canine robot appears on Jiangsu TVโs Whoโs Still Standing to battle contestants in general knowledge trivia. Baidu also has a robot called Xiaodu who has appeared on various TV game shows since 2014. Unavoidably, the two robots have drawn comparisons. On Zhihu, Chinaโs version of Quora, some netizens thought Wangzaiโs antics made it more charming.

โI just want to talk with this pretty girl here,โ Wangzai said to host Guo Xiaomin during his debut on Whoโs Still Standing. When we asked Wangzai at the Sogou conference who is the current US President, Wangzai gave the correct answer.
Another Sogou natural language processing technology showcased at the conference was lip reading. Sogou is the first company in China to develop this technology, which IBM and Googleโs DeepMind have been working on for a while. Applications for this technology include assistance for people with hearing or speech disabilities, silent dictation, and subtitling.

TechNode gave Sogouโs lip reading AI a go. When reading a scripted Chinese text, the accuracy was almost 100%. However, when we went off script and said: โHowโs the weather?โ in Chinese (ๅคฉๆฐๆไนๆ ท), the AI picked up the โhowโ (ๆไนๆ ท) but not the rest of the sentence. This is expected as the accuracy rate for unscripted, natural lip reading for Sogou is at 60%. For lip reading in scripted scenarios such as while driving or giving commands to smart home devices, Sogou claims that the accuracy rate is 90%. For comparison, Googleโs DeepMind can annotate English TV footage with a 46.8% accuracy.
There is no doubt of AIโs role in Sogouโs future development, as CEO Wang Xiaochuan explained: โWe are entering a new era where AI technology is unlocking a world of possibilities. We will continue to explore ways that we can leverage Sogouโs expertise in AI, particularly in natural language processing, to develop other smart hardware solutions to everyday challenges.โ
