With the rapid development of mobile smart terminals and cloud computing, the wave of artificial intelligence is quietly subverting every bit of our life. The experience puts forward more new requirements in terms of linguistics, emotional shaping, and logical construction.
Think about the changes brought about by the development of VUI technology: I am lying on the sofa, playing games with both hands, I only need to use my voice to control the air conditioner, order a takeout, and eat it in about an hour, I believe this The experience must be good!
1. Development of VUI
Then, when the original GUI (Graphical User Interface, Graphical User Interface) is so rich, why add a new interaction method? The biggest difference between them is that the input method is different. The most notable feature is "freeing hands". When acquiring the information we care about, we can communicate in the most natural language, and our eyes and hands can handle other things at the same time.
1.1 The first period of VUI
In the 1990s, the first feasible, non-specific (everyone can speak to him) speech recognition system was born, and the emergence of the Interactive Voice Response (IVR) system represented the first of VUI. an important period [1].
Humans interact and perform tasks such as airline ticket reservations, bank transfers, business inquiries, etc. through telephone lines. I believe that everyone has used 12306 to book tickets and train tickets. We interact with the system by entering digital commands. Its main features are as follows:
Advantages: Good at recognizing and broadcasting long characters.
Disadvantage: Users rarely have the opportunity to suspend the system, the system takes the initiative.
Picture: 12306 phone booking
We let the system perform identity and command recognition by entering the ID number, etc., and the system will also broadcast various sites such as: 1 Beijing, 2 Tianjin, 3 Shandong and other long voices for us to choose. Recalling the process, we must constantly communicate with the system. When interacting, if there is an error in the middle, you can only hang up and start again, so the entire interaction process will easily make the user in a cautious and cramped state.
1.2 The second period of VUI
We are now in the early stage of the second period. At present, many apps such as Siri and Google that integrate visual and voice information, as well as pure voice design products such as Amazon Echo, have gradually developed and become mainstream [1]. With the development of speech recognition technology, AI technology, and Internet technology, we have been able to use speech to process many things in mobile devices, but there are still many things that cannot be done through speech at present, and we need to explore.
Figure: Google Voice APP
Figure: Echo voice assistant product
2. Advantages and disadvantages of VUI compared to GUI
Taking the GUI design principles that the TXD team has deposited as the inspection standard, we will cut horizontally and compare the advantages and disadvantages of VUI vertically.
Figure: TXD Design Principles
The main advantages are:
The main disadvantages are:
Therefore, through comparison, we found b2b data that GUI has more advantages in terms of clarity, efficiency and generality, which is precisely the key for people to obtain information, can accurately provide help to users, and has good ductility and versatility. A question-and-answer" point-like way of obtaining information is more efficient. VUI is the most natural and intimate interaction method pursued by design. It is an "interactive experience with emotion and warmth", which really starts from the user's point of view. From my personal point of view, at the current stage of technological development, VUI is more of an aid, and at least it will not completely replace GUI in a short period of time.