Is the era of artificial speech translation upon us? - IELTS Reading Answers & Explanations
From Cambridge IELTS 19 Academic Reading Test 3 · Part 3 · Questions 27–40
Reading Passage
Is the era of artificial speech translation upon us?
Once the stuff of science fiction, technology that enables people to talk using different languages is now here. But how effective is it?
Noise, Alex Waibel tells me, is one of the major challenges that artificial speech translation has to meet. A device may be able to recognize speech in a laboratory, or a meeting room, but will struggle to cope with the kind of background noise I can hear in my office surrounding Professor Waibel as he speaks to me from Kyoto station in Japan. I’m struggling to follow him in English, on a scratchy line that reminds me we are nearly 10,000 kilometers apart-and that distance is still an obstacle to communication even if you’re speaking the same language, as we are. We haven’t reached the future yet. If we had, Waibel would have been able to speak more comfortably in his native German and I would have been able to hear his words in English.
At Karlsruhe Institute of Technology, where he is a professor of computer science, Waibel and his colleagues already give lectures in German that their students can follow in English via an electronic translator. The system generates text that students can read on their laptops or phones, so the process is somewhat similar to subtitling. It helps that lecturers speak clearly, don’t have to compete with background chatter, and say much the same thing each year.
The idea of artificial speech translation has been around for a long time. Douglas Adams’ science fiction novel, The Hitchhiker’s Guide to the Galaxy, published in 1979, featured a life form called the ‘Babel fish’ which, when placed in the ear, enabled a listener to understand any language in the universe. It came to represent one of those devices that technology enthusiasts dream of long before they become practically realizable, like TVs flat enough to hang on walls: objects that we once could only dream of having but that are now commonplace. Now devices that look like prototype Babel fish have started to appear, riding a wave of advances in artificial translation and voice recognition.
At this stage, however, they seem to be regarded as eye-catching novelties rather than steps towards what Waibel calls ‘making a language-transparent society.’ They tend to be domestic devices or applications suitable for hotel check-ins, for example, providing a practical alternative to speaking traveler’s English. The efficiency of the translator is less important than the social function. However, ‘Professionals are less inclined to be patient in a conversation,’ founder and CEO at Waverly Labs, Andrew Ochoa, observes. To redress this, Waverly is now preparing a new model for professional applications, which entails performance improvements in speech recognition, translation accuracy and the time it takes to deliver the translated speech.
For a conversation, both speakers need to have devices called Pilots (translator earpieces) in their ears. ‘We find that there’s a barrier with sharing one of the earphones with a stranger,’ says Ochoa. That can’t have been totally unexpected. The problem would be solved if earpiece translators became sufficiently prevalent that strangers would be likely to already have their own in their ears. Whether that happens, and how quickly, will probably depend not so much on the earpieces themselves, but on the prevalence of voice-controlled devices and artificial translation in general.
Waibel highlights the significance of certain Asian nations, noting that voice translation has really taken off in countries such as Japan with a range of systems. There is still a long way to go, though. A translation system needs to be simultaneous, like the translator’s voice speaking over the foreign politician being interviewed on the TV, rather than in sections that oblige speakers to pause after every few remarks and wait for the translation to be delivered. It needs to work offline, for situations where internet access isn’t possible, and to address apprehensions about the amount of private speech data accumulating in the cloud, having been sent to servers for processing.
Systems not only need to cope with physical challenges such as noise, they will also need to be socially aware by addressing people in the right way. Some cultural traditions demand solemn respect for academic status, for example, and it is only polite to respect this. Etiquette-sensitive artificial translators could relieve people of the need to know these differing cultural norms. At the same time, they might help to preserve local customs, slowing the spread of habits associated with international English, such as its readiness to get on first-name terms.
Professors and other professionals will not outsource language awareness to software, though. If the technology matures into seamless, ubiquitous artificial speech translation, it will actually add value to language skills. Whether it will help people conduct their family lives or relationships is open to question-though one noteworthy possibility is that it could overcome the language barriers that often arise between generations after migration, leaving children and their grandparents without a shared language.
Whatever uses it is put to, though, it will never be as good as the real thing. Even if voice-morphing technology simulates the speaker’s voice, their lip movements won’t match, and they will look like they are in a dubbed movie. The contrast will underline the value of shared languages, and the value of learning them. Sharing a language can promote a sense of belonging and community, as with the international scientists who use English as a lingua franca, where their predecessors used Latin. Though the practical need for a common language will diminish, the social value of sharing one will persist. And software will never be a substitute for the subtle but vital understanding that comes with knowledge of a language.
Questions
Questions 27–30 Multiple Choice (One Answer)
Choose the correct letter, A, B, C or D.
Write the correct letter in boxes on your answer sheet.
Questions 31–34 Matching Sentence Endings
Complete each sentence with the correct ending, A-F, below.
Write the correct letter, A-F, in boxes on your answer sheet.
A. but there are concerns about this
B. as systems do not need to conform to standard practices.
C. but they are far from perfect.
D. despite the noise issues
E. because translation is immediate.
F. and have an awareness of good manners.
Questions 35–40 Yes / No / Not Given
Do the following statements agree with the views of the writer in Reading Passage?
In boxes on your answer sheet, write
YES if the statement agrees with the views of the writer
NO if the statement contradicts the views of the writer
NOT GIVEN if it is impossible to say what the writer thinks about this
Answers & Explanations Summary
| # | Answer | Evidence | Explanation |
|---|---|---|---|
| Q27 | D | I’m struggling to follow him in English, on a scratchy line that reminds me we are nearly 10,000 kilometers apart-and that distance is still an obstacle to communication even if you’re speaking the same language, as we are. | Excerpt/Passage Explanation: The passage says that the speaker is having a hard time understanding another person. They are talking in English, but even though they are speaking the same language, there is still a problem because they are very far apart—about 10,000 kilometers. This distance makes it harder for them to communicate. Answer Explanation: The answer means that both people in the conversation are having difficulties. It is not just about speaking a different language; they are also having trouble understanding each other because of the distance. Reason For Correctness: The correct answer is D because the passage shows that the speaker finds it hard to understand the other person, even though they are speaking English. The distance makes it challenging for both people to communicate clearly. |
| Q28 | A | At Karlsruhe Institute of Technology, where he is a professor of computer science, Waibel and his colleagues already give lectures in German that their students can follow in English via an electronic translator. | Excerpt/Passage Explanation: The passage talks about how at Karlsruhe Institute of Technology, a professor named Waibel teaches in German. The students who speak different languages can understand the lectures in English because of an electronic translator. Answer Explanation: The answer A means that when the professor talks in German, he often repeats what he says. This helps the electronic translator give better translations to the students. Reason For Correctness: The correct answer is A because the repeated content of lectures helps the electronic translator understand better and give clearer translations to the students. |
| Q29 | C | The idea of artificial speech translation has been around for a long time. Douglas Adams’ science fiction novel, The Hitchhiker’s Guide to the Galaxy, published in 1979, featured a life form called the ‘Babel fish’ which, when placed in the ear, enabled a listener to understand any language in the universe. | Excerpt/Passage Explanation: The passage talks about a science fiction book called The Hitchhiker's Guide to the Galaxy. This book was written by Douglas Adams and published in 1979. It includes a special creature called the 'Babel fish'. This fish can be put in a person's ear, and it helps them understand all languages in the universe. Answer Explanation: The answer C means that the idea of using machines or creatures to translate speech into different languages was not seen as surprising or unexpected. Reason For Correctness: The correct answer is C because the passage mentions the Babel fish as an example of artificial speech translation, showing that the idea of translating languages has been thought of for a long time. This implies that such a concept was not surprising to people. |
| Q30 | B | For a conversation, both speakers need to have devices called Pilots (translator earpieces) in their ears. ‘We find that there’s a barrier with sharing one of the earphones with a stranger,’ says Ochoa. That can’t have been totally unexpected. | Excerpt/Passage Explanation: The passage talks about how two people need special devices called Pilots in their ears to have a conversation. Ochoa mentions that sharing one earphone with a stranger is difficult, and this reaction makes sense. Answer Explanation: The answer B means that it is normal for people to feel unsure about sharing earphones with someone they do not know. Reason For Correctness: The correct answer is B because the passage indicates that there is a problem with sharing earphones, and it is expected for people to feel reluctant to do so, especially with strangers. |
| Q31 | C | Waibel highlights the significance of certain Asian nations, noting that voice translation has really taken off in countries such as Japan with a range of systems. There is still a long way to go, though. | Excerpt/Passage Explanation: The passage says that in Japan, technology for translating spoken language is growing quickly. It mentions that there are many different systems available, but there is still much improvement needed for them. Answer Explanation: The answer 'C' means that although the translation systems are improving, they still have many problems and are not perfect yet. Reason For Correctness: The correct answer is 'C' because the passage discusses the advancements in voice translation in Japan but also states that there is still a long way to go, indicating that the systems have flaws and are not fully developed. |
| Q32 | E | A translation system needs to be simultaneous, like the translator’s voice speaking over the foreign politician being interviewed on the TV, rather than in sections that oblige speakers to pause after every few remarks and wait for the translation to be delivered. | Excerpt/Passage Explanation: The passage explains that in TV interviews, a good translation system should allow the translator to speak at the same time as the person being interviewed. This way, the viewer can hear both the foreign speaker and the translation right away, without waiting. This makes the interview feel more fluid and immediate. Answer Explanation: The answer E means that TV interviews with translation voiceovers are successful because the translation happens at the same time as the original speech. This helps viewers understand what's happening without delays. Reason For Correctness: The correct answer E is right because the passage describes how immediate translation helps in TV interviews. It emphasizes the importance of simultaneous translation for better understanding, which is exactly what answer E states. |
| Q33 | F | Systems not only need to cope with physical challenges such as noise, they will also need to be socially aware by addressing people in the right way. Some cultural traditions demand solemn respect for academic status, for example, and it is only polite to respect this. | Excerpt/Passage Explanation: The passage talks about how translation systems need to do more than just translate words. They also have to understand social situations and show respect when talking to people. Different cultures have different ways of showing respect, especially to those with academic titles or statuses. Answer Explanation: The answer indicates that future translation systems should be polite and address people correctly, following good manners. Reason For Correctness: The correct answer is F because it directly relates to the idea in the passage about the importance of addressing people appropriately and respecting cultural traditions. |
| Q34 | B | At the same time, they might help to preserve local customs, slowing the spread of habits associated with international English, such as its readiness to get on first-name terms. | Excerpt/Passage Explanation: The passage talks about how users might keep their local customs and traditions. It says that this could slow down the spread of common habits from international English, like using first names too quickly. Answer Explanation: The answer is B, which means that people can maintain their customs because their systems do not have to follow standard rules. Reason For Correctness: The correct answer is B because the passage suggests that users can keep their local customs, and one reason they can do this is that their systems are flexible and don’t have to follow traditional ways. |
| Q35 | NO | Whatever uses it is put to, though, it will never be as good as the real thing. Even if voice-morphing technology simulates the speaker’s voice, their lip movements won’t match, and they will look like they are in a dubbed movie. | Excerpt/Passage Explanation: The passage says that language translation systems will not be as good as real human communication. Even if technology can mimic a speaker's voice, it will not completely match how they look when they speak, just like a poorly dubbed movie. Answer Explanation: The answer is 'NO' because the statement about translation systems being very useful is not correct according to the passage. Reason For Correctness: The correct answer is 'NO' because the passage suggests that language translation systems are not as good as real human communication. This implies that while they may be useful, they will always have limitations and will never reach the quality of real interactions. |
| Q36 | YES | Whether it will help people conduct their family lives or relationships is open to question-though one noteworthy possibility is that it could overcome the language barriers that often arise between generations after migration | Excerpt/Passage Explanation: The passage is saying that it is not clear if automated translation will help families or their relationships. It mentions that one good thing could be that it helps people communicate better across different languages, especially between older and younger generations in families after moving to a new country. Answer Explanation: The answer 'YES' means that the reader agrees that the overall value of automated translation for family life is not yet proven. Reason For Correctness: The correct answer is 'YES' because the passage states that it is uncertain whether automated translation will help family lives, indicating that its overall value is still unknown. |
| Q37 | NO | though one noteworthy possibility is that it could overcome the language barriers that often arise between generations after migration, leaving children and their grandparents without a shared language. | Excerpt/Passage Explanation: The passage explains that automated translation could help solve problems caused by different languages in families who move to a new country. For example, it might help children talk with their grandparents even if they speak different languages. Answer Explanation: The answer 'NO' means that the statement is not correct. It says that automated translation makes life harder for immigrant families, but the writer believes it can help them. Reason For Correctness: The correct answer is 'NO' because the writer thinks automated translation can help families communicate better and overcome language problems, not make life more difficult. |
| Q38 | NOT GIVEN | Sharing a language can promote a sense of belonging and community, as with the international scientists who use English as a lingua franca, where their predecessors used Latin. Though the practical need for a common language will diminish, the social value of sharing one will persist. And software will never be a substitute for the subtle but vital understanding that comes with knowledge of a language. | Excerpt/Passage Explanation: The passage talks about how sharing a language helps people feel like they belong to a group. It mentions that scientists who speak English connect better today, just like those who spoke Latin before. Even though someday we might not need a common language as much, the importance of feeling together through language will still be there. It also says that software (like translation tools) cannot replace the deep understanding that comes from knowing a language well. Answer Explanation: The answer is 'NOT GIVEN' because the passage does not provide any information about scientists looking at the visual parts of translating languages. It does not say whether scientists are considering visual aspects or not. Reason For Correctness: The correct answer is 'NOT GIVEN' because the passage does not mention anything about scientists specifically studying visual aspects in language translation. It focuses more on the importance of sharing a language and understanding it deeply. |
| Q39 | NOT GIVEN | Sharing a language can promote a sense of belonging and community, as with the international scientists who use English as a lingua franca, where their predecessors used Latin. Though the practical need for a common language will diminish, the social value of sharing one will persist. | Excerpt/Passage Explanation: The passage talks about how sharing a language helps people feel like they belong and are part of a community. It mentions that international scientists use English as a common language, while in the past, scientists used Latin. The passage says that even though the need for a common language may become less important, the connection and value of sharing a language will still be important. Answer Explanation: The answer 'NOT GIVEN' means that the passage does not say anything about whether English is easier to translate into other languages than Latin. It doesn't provide enough information to know what the writer thinks about this. Reason For Correctness: The correct answer is 'NOT GIVEN' because the passage does not compare the ease of translating English and Latin. It only discusses the use of English and Latin among scientists without making a statement about translation. |
| Q40 | YES | Though the practical need for a common language will diminish, the social value of sharing one will persist. | Excerpt/Passage Explanation: The passage says that while people will need to use a common language less for practical reasons in the future, the importance of sharing a language for social reasons will still be important. Answer Explanation: The answer is YES, which means that the statement agrees with what the writer believes. Reason For Correctness: The correct answer is YES because the writer believes that there is a difference between practical needs (which will decrease) and social needs (which will continue to be important) when it comes to language. |
