US20110195384A1
2011-08-11
13/088,365
2011-04-17
The present invention provides assistance for a person to better comprehend the samples of a target language and, simultaneously, it facilitates that the person learns that language better. In order to do that, some modified versions of those language samples are provided to the learner, in such a way that they allow the learner to comprehend the structure of said language samples, which allows to comprehend the meaning. Said modified versions are created by replacing some groups of words by some groups of characters which are called replacing words. Said replacing words allow to summarize subordinate clauses in the language sample, thus simplifying the complexity of the language sample. This allows the learner to focus on one idea at a time, and thus gradually build her comprehension of the language sample.
Get notified when new applications in this technology area are published.
G06F40/289 » CPC main
Handling natural language data; Natural language analysis; Recognition of textual entities Phrasal analysis, e.g. finite state techniques or chunking
G09B9/04 » CPC further
Simulators for teaching or training purposes for teaching control of vehicles or other craft for teaching control of land vehicles
G09B9/06 » CPC further
Simulators for teaching or training purposes for teaching control of vehicles or other craft for teaching control of ships, boats, or other waterborne vehicles
G09B19/06 IPC
Teaching not covered by other main groups of this subclass Foreign languages
The present patent application is a divisional application of application Ser. No. 10/582,186, filed on 2007, 07, Mar.)
The current invention belongs to the area of systems to aid the comprehension of languages and/or the learning of languages in general, and foreign languages in particular.
The main references to the prior art are the following ones:
Scientific research during the last decades has shed much light on how language works and on the brain processes that underlay it. During the last years in particular the technical character of the research on language has exploded. Many technical and scientific resources are being used to understand the brain processes that are related to the learning of second languages. Underlying this approaches is the idea that the way language works in the brain shares many features with the way other organs work, and that a technical and scientific approach to the processes that take place will allow to develop better systems that have not been possible thus far for learning languages and for correcting defects in the learned language.
For example, experiments are being undertaken that use electro encephalograms, functional magnetic resonance imaging, positron emission tomography, and magneto encephalograms. These experiments have shown that concepts such as âverbâ, âsentenceâ, âsemanticsâ and so on are associated to sophisticated electrophysiological processes. For example, some of those experiments have shown that the brain builds the inflexion part of verbs using two different brain areas, depending on whether it is a regular or an irregular verb [Pinker, 1999].
In some of these experiments, the scientists measure the evoked potentials. Evoked potentials are variations on the measured electromagnetic field which are produced by the brain when a subject is processing samples of language which have certain characteristics. It has been shown that the evoked potentials are different when the subject finds a syntactic anomaly than when the subject finds a semantic anomaly. This provides evidence that the electrophysiological response coming from syntactic processing is different from that one coming from semantic processing [Hagoort et al, 1993].
In other experiments, Hahne and Friederici describe evidences that the brain develops the structural analysis of sentences in two steps. The first step is applied to all language samples in an automatic fashion, and the second one is applied in a controlled way with an intensity that depends on the complexity of the language sample [Hahne and Friederici, 1999].
Streb and colleagues have shown that the electrophysiological processes of the brain depend on the grammatical categories that are being processed. In order to do that, they studied the processes that are developed to recover a proper noun from two different references: a pronoun or the same proper noun [Streb et al, 1999].
Nieto and colleagues have also shown that the brain processes that correspond to different grammatical categories are different. In particular, they have shown it for verbs and nouns [Nieto et al, 1999].
These and other evidences have led many scientists to consider that the brain has a âlanguage organâ, i.e. a part of it that is specifically devoted to processing language. Understanding the technical characteristics of the processes that the brain uses to process language will lead to the development of better systems for language learning.
In relation in particular with second language learning, Kim and colleagues used functional magnetic resonance images in 1997 to show that persons that have learned a second language in adulthood use different areas of the brain than persons that had learned the second language during childhood [Kim et al, 1997].
It seems reasonable to assume that those persons that use a different part of the brain for speaking the second language than for speaking the first language are using general cognitive brain resources, instead of specific language resources. And that they are using those brain resources that are used to solve general problems, such as solving a crossword, an equation, or a text evaluation.
This possibility is supported by the phenomenon known as âcognitive deficitâ, which is a decrease in the ability to perform general activities in a language that is not well known. The origin of the cognitive deficit is probably the competition that takes place between the mental process that manages the second language and the mental process devoted to the general activity that is being performed, given that both processes might be using the same mental resources at the same time.
A variable that was not taken into account in Kim's experiment was how the subjects had learned the second language for those subjects that had learned it in adulthood. In fact, there are some cases of adult learners that manage to achieve a command of the second language that is virtually indistinguishable from that of native speakers, despite having learned it late. It is reasonable to assume that these people have somehow manage to use the second language with the mental resources specifically devoted to language.
In line with the previous paragraph, in the area of second language learning the ultimate goal should be to achieve as good a command as possible of the second language, and this would lead us to develop methodologies that would allow the learner to produce an electrophysiological response to language input equal to that of native speakers. It is reasonable to think that the way in which the second language is learned will condition the way in which brain resources will be used.
In fact, the way in which an activity is developed influences the way in which brain resources are used, as Possner and Raichle show [Possner et al, 1999, p. 129]. These authors describe how the brain map that is produced during performing a verbal production task varies depending on whether the subject has had enough practice previously, despite externally the activity seems to be the same in both cases.
The issue then is how could a second language be learned in such a way that people use the language specific brain resources instead of the general cognitive resources.
In general, most of the second language learners devote long hours to analyzing sentences word by word, often translating words. There is some logic to this approach, given that learning a language requires the comprehension of messages created in that language. The problem is that, if general cognitive resources are being used, such as analysis and translation, the language will be approached using general cognitive resources.
The problem that exists when trying to understand messages in a language that is being learned is that, in order for those messages to be useful for the learner, they must contain linguistic forms that must be more complex that what the learner already knows. If this did not happen, the learner would not have the opportunity to learn new linguistic aspects [Yano 1994]. In this situation, the environment must provide the learner with the opportunity to interact in order to get clues or hints that allow the learner to infer what those messages mean. Learners must somehow have access to what the sentences they are listening mean [Baker 2001, p. 224].
What is necessary, then, is a system that facilitates the comprehension of samples of the second language without the learner having to resort to cognitive analytical efforts, and it is also necessary that that system can also be applied to all kinds of texts.
Such a system would be specially useful for learning second languages, because it would allow learners to work with authentic texts. Authentic texts are samples of language that have been generated to satisfy some communicative or informative need in the community in which that language is the native language. Second language teachers consider that authentic texts are specially useful for learning second languages, and the reason why they do not use them more often is that they do not have systems that allow learners to understand them.
As far as has been reviewed in the prior art, there are not references that solve the current need for systems that facilitate the comprehension of messages in a language that is being learned. In the next lines a general view is provided of the more significant references that have been found.
The general evaluation of the prior art is that we are in need of systems that facilitate the comprehension of foreign languages. Even though this need has been felt for a long time, there have not yet appeared systems that can effectively solve it.
The appearance of the personal computer generated high expectations in the field of second language learning. It was thought that technology linked with pedagogy would provide resources that did not exist yet.
However, the general perception of the field is that, despite those great hopes, and despite the fast expansion of informatics tools in the fields of teaching and comprehension of languages, the final results have been very limited.
Barriere y Duquette, for example, believe that only limited benefits have been achieved with the utilization of automatic tools for facilitating learning [Barriere et al 2002]. The same authors cite other references in which only low learning gains were achieved.
In conclusion, language comprehension and language learning is a vital need in the current global society, and nevertheless there are not systems nor methods that can satisfactory solve this need.
The challenge, then, is to develop systems and methods that facilitate comprehension, but this is an issue that has not been solved yet. A way to approach this question is through the following three key aspects:
The present invention is a means for facilitating language comprehension and learning, and it responds to the perceived need for betters systems for aiding in comprehension and learning of language. It is mainly based on two aspects:
Modern science has shown that, in order to understand a message, it is necessary to assimilate the internal structure of the sentences that the message contains. References [Jackendoff 2002, p. 10], [Lasnik 2000], [Radford 1999 p. 14], [Haegeman 1999 p. 138] and [Partee 2000] explain in detail the characteristics that such structure has. The structure of language is not a mere academic creation, but it is related to the electrophysiological processes that take place in the brain, and it is intimately related also with the variations in frequency, intensity and length of the sounds that make up language. As a person perceives a sentence, his/her mind automatically recreates that sentence's structure, performing during that process very complex operations.
[Fodor 2000] describes some of these operations. This reference shows how the response times after certain word recognition activities are different in different points of the sentence, as a result of the process that builds the structure of the sentence following certain rules of modern linguistic theory.
An example, adapted from [Fodor 2000], and shown in Exhibit 1 in this description, allows to realize the importance that assimilating sentences' structure has in order to comprehend their meanings. The sentence in the example is difficult to understand, even for native speakers of the language in which it is written (which in this case is English). The difficulty to understand it is not due to sophisticated or ambiguous structures, but to the particular form the sentence has. It has too many embedded phrases, and processing them requires too much computational load. Later on in this description it will be shown how the present invention facilitates the assimilation of the sentence's structure and therefore the comprehension of its meaning.
The mouse the cat the dog scared chased jumped over the wall.
In order to identify the structure of messages which are received in spoken language, people use the variations in frequency, intensity and length of soundsâi.e. the intonationâthat they perceive [Quilis et al 1975], [Boysson-Bardies, 2001], y [Christophe et al, 2003]. Such variations allow for the listener to assimilate the structure of the sentence, and therefore to understand the sentence.
Jackendoff believes that the importance of intonation is such that there are certain structures that are not perceived as grammatically correct because the variations that they demand in terms of frequency, intensity and length do not satisfy certain general requirements imposed by intonation [Jackendoff 2002].
The comprehension of structure and variations of frequency, intensity and length are also essential for children to acquire language. It has been seen that such variations provide very important clues for children to assimilate the structure of language and therefore to learn it. For example, it has been shown that very young infants who still do not understand language are able to detect the borders of phrases through the perception of variations in intensity, fundamental frequency and length of the language samples they listened to [Boysson-Bardies, 2001].
In written language, people cannot use intonation, and they resort to their own knowledge of language in order to choose which structure, among the possible structures, is the appropriate one for the message they are perceiving. There exists a phenomenon, called âgarden pathâ, that shows how processing written language is different from processing spoken language, and that it has some limitations. For example, when reading the sentence âThe horse raced across the barn fellâ, a normal speaker will notice that when he reaches the word âfellâ his interpretation of the sentence is not correct.
The first interpretation is that âracedâ is the main verb of the sentence, but in the correct interpretation the main verb is âfellâ, given that the sentence is equivalent to âThe horse that was raced across the barn fellâ.
This problem does not exist when the sentence is perceived aurally, given that the sentence's intonation communicates the structure of the sentence, and that information eliminates any ambiguity. While reading a written text, the reader initially assumes that sentences have a given structure, and changes it incrementally so that the information that he is reading makes sense. Somehow, the reader subconsciously feels the intonation that would be associated to the text that he is reading. The reader perceives that two different interpretations create different subconscious feelings about intonation.
The learner of a foreign language is in disadvantage both in spoken and written language. In spoken language, he does not have enough knowledge about that language's characteristic intonation patterns. In written language, he does not have the capacity what possible structures might exist for a written language sample.
The present invention approaches the process of language comprehension through perception and assimilation of language's structure. In order to do that, the learner will interact with different language samples, and these interactions will provide different views about the language samples that will clarify its structure. In this invention, each of this views is called PERSPECTIVE.
The invention allows the learner to easily understand the structure and meaning of language samples without needing to make much use of the general cognitive resources, given that utilizing those general resources would prevent the utilization of the specific resources that the brain uses for language.
The invention can be applied both to learning a second language or to learning a native language for people that have some problem with the native language. Dyslexic people, for example, seem to have problems which are similar to the problems of learners of foreign languages in the understanding of functional words [Davis 2002], [VanPatten 1996] (functional words are those whose purpose is not to provide a meaning in itselfâsuch as âdogââbut to somehow contribute to the global processing of the sentence.
The issue now is how to facilitate for the person that does not have a good command of a given language to understand the structure of that language. The issue is how to replace all the information that is provided by the variations of frequency, intensity and duration, and that is subconsciously recreated during reading.
All this is possible thanks to the means that are provided by computer technology. Without computers, it would be very difficult to implement the processes of the present invention.
Despite a system such as the current invention is very necessary, no similar system exists today.
The following paragraphs describe a number of concepts and techniques that are included in the invention. It is understood that it is not strictly necessary that all of them are implemented simultaneously.
In order for the learner to use the invention, it is necessary that the language samples are prepared, so that they contain certain information that will allow to show their structure. This preparation can be done automatically, utilizing tools from computational linguistics, manually, or in a mixed fashion, combining some processing performed by automatic tools and a revision performed by a human tutor. The utilization of tools from computational linguistics is not included in this invention, and they are not a limiting factor for the scope of the invention and the patent.
In general, the language samples will be divided in smaller fragments, and the identification of the structure will be performed over those fragments. As will later be explained, in the preferred embodiment such fragments are sentences, but they could have other nature without affecting the scope of the invention. The criterion used to divide the language samples will not limit the scope of the invention.
In order to show the structure of the language samples, that structure must first be identified. The structure will depend on what type of syntax is chosen, but in general it will lead to the identification of several groups of words, the relation among those groups defining the structure of the language samples. In general, these word groups will be phrases.
In order to facilitate the learner to understand the structure, it is important to use as few phrases as possible, so that they do not create confusion. The decision on which phrases to choose will depend on the actual embodiment of the invention. In the section where the preferred embodiment is described, there is an explanation about what phrases will be chosen in the preferred embodiment. In general, in this patent application, the term âchosen phrasesâ will be used in order to refer to the phrases that will be taken into consideration when defining the structure of the language samples.
Chosen phrases can be divided into two groups:
As is generally done in linguistics, the structure of a text fragment is codified using delimiter characters, such as brackets. In this patent application, a language sample in which delimiter characters have been inserted in order to define its structure will be called separated sentence, separated text, or separated fragment.
Exhibit 2 shows a separated sentence for an example text:
[[The man [at the bar]][with the jacket [on his shoulder]]]] finished his sandwich [before the bar closed.]]
The ESCALATOR TREE is the main answer to the perceived need of a replacement for the information provided by the variations in frequency, intensity and duration in order to indicate the structure of the language samples. In the escalator tree, the words are laid in different levels, as in an escalator, as is shown in Exhibit 3 for the sentence that was shown in Exhibit 1.
| The mouse | jumped over the wall. | |||
| the gat | chased. | |||
| the dog scared | ||||
Depending on how the levels are assigned for the different words, there will exist different types of escalator trees, and they will facilitate comprehension in different degrees. In the example, the transitions downwards open new fragments, and the transitions upwards close them.
In order to read an escalator tree that has used a level assignation criteria such as in Exhibit 3, the reader must progress in horizontal direction, changing the level in vertical direction every time a new word is reached that is not at the same level. Every time a transition downwards is done, the reader is entering into a text fragment that remarks or points out something that has to do with the previous fragment. In the case of transitions upwards, the effect is the opposite, i.e. the remark or particularization is finished and the reader returns to the domain of the previous text fragment. (Sometimes this is not exactly so. For examples, subordinate clauses sometimes are before their main clauses, and in head final languages, such as Japanese, this happens most of the time)
In Exhibit 3 it can be seen that reading the first row in horizontal direction the reader obtains âThe mouse jumped over the wallâ. If the second row is read after the transition that exists between âmouseâ and âtheâ, the reader obtains âthe cat chasedâ, which clarifies what mouse is being referred to. If the third row is read after the transition that there is between âcatâ and âtheâ the reader obtains âthe dog scaredâ, which clarifies what cat is being referred to.
As can be seen, now it is possible to understand the meaning of the sentence. Furthermore, now it can be even read, either aloud or silently, because the reader has information about its structure, and can generate, even though with effort, the variations of frequency, intensity and duration of the sounds that correspond to its structure. If the sentence is recorded and listened to, even though it is still difficult to understand it perfectly, part of it at least can now be understood.
Graphical means can be used in order to further increase the ease of comprehension, such as horizontal lines, different colors or formats, etc.
An important issue is what criteria to use in order to assign the levels to the words in the escalator tree. This depends on the actual embodiment that will be carried out, as will explained in the description of the preferred embodiment. In general, when assigning levels to the different parts of the sentence, the goal is to make the most important parts of the sentence as salient as possible. Acting this way, it will happen that the level that corresponds to each fragment of the sentence will be independent of its level of embedding. Later on in this document, FIG. 5 and Exhibit 12 will show how in a possible text string, the components of the string can have an embedding level different from the level they have in the escalator tree.
The first way to aid the learner is to help him to easily identify the different components of a fragment. In order to do that, the invention contains a number of techniques.
The SELECTION OF MINIMAL CHOSEN PHRASE is a technique that helps the user-learner to easily identify the phrases that make up the sentence. When the user selects a given position in the sentence, the system will identify the minimal chosen phrase of that word, and will mark it in a given format. The minimal chosen phrase of a position is the chosen phrase that includes that position and which does not include any other phrase.
For example, for the sentence in Exhibit 2, if the learner would select âshoulderâ, the invention could show what appears in Exhibit 4a. The result is that the user would easily see that those three words must be jointly considered.
The man at the bar with the jacket on his shoulder finished his sandwich before the bar closed.
Additionally, the chosen phrases that are ancestors to the minimal chosen phrase can be marked with a special format that is different from the format used for the selected chosen phrase, as is shown in the Exhibit 4b for the same example. This will facilitate for the learner to perceive the context in which the selected chosen phrase is being used.
The man at the bar with the jacket on his shoulder finished his sandwich before the bar closed.
The ASCENSION OF CHOSEN PHRASES is a functionality that changes the chosen phrase that is selected at a given moment, so that the new selected chosen phrase will be the parent of the current selected chosen phrase. For example, if this functionality is applied to the example in Exhibit 4b, the result would be the text that is shown in Exhibit 4c. This functionality allows the learner to easily interact with the text and explore its structure, thanks to the means provided by computer technology.
The man at the bar with the jacket on his shoulder finished his sandwich before the bar closed.
In order to further facilitate the learner's activities, some embodiments of the invention allow to enable and disable the chosen phrases. When a chosen phrase is disabled, the invention will act as if that phrase does not exist. A way to get the learner's attention focused to the appropriate parts is to disable the basic chosen phrases. The justification is that in the hierarchical processing that the brain performs, clause phrases are more relevant than basic phrases. In these circumstances, the invention would only show those phrases that come from clauses (either subordinated or coordinated clauses). Once the learner is able to understand this level, he could enable all the phrases and focus his attention to understand the details that are provided by basic phrases.
The techniques which have been described in this section can be applied in all the perspectives in which the structure of the language samples is shown.
For instance, FIG. 1 shows an example of an escalator tree, in which different chosen phrases have been shown after the different selections have been done by the learner. Also, certain words have been marked in order to better show the borders of the different phrases. FIG. 2 shows the same sentence without escalator tree, in order to better illustrate the effect that the escalator tree has on the perception of the sentence structure.
PROJECTION SUBSTITUTION is a technique whose goal is to further facilitate for the learner to assimilate the structure of the language samples. It words by replacing phrases or word groups by other simpler words that will allow the learner to selectively focus on different parts of the sentence.
For instance, the phrase âthe man who is the brother of the woman in redâ can be generally replaced by âthe manâ or by âmanâ. This is a general principle in linguistics, and in this invention it is extended to other cases, such a is explained later on, in order to provide assistance to the language learner.
This technique is based on two linguistic concepts: the projection principle and the criterion of constituent replacement, both of them explained in [Radford 1999], [Ouhalla 1999], [Haegeman 1999].
An example will be shown now about how this can be applied in the invention, for the example fragment âJohn went to the store after Mary turned on the radio that she had both the previous weekâ:
Instead of using invented text strings, such as âXYZ2â, words that exist in the language can also be used. The projection principle and the criterion of constituent replacement ensure that this can always be done. For example, instead of writing âJohn went to the store after XYZ1â it could be possible to write âJohn went to the store after thatâ
The main advantage in using existent real words is that it is possible to aurally playback the text. However, there are two important disadvantages:
Another advantage of using words which are not real is that it is possible to use indices that allow the learner to easily navigate through the different text fragments associated to the sentence.
In this invention, and without limiting effects, he words that are used in order to replace projections are called REPLACING WORDS, independently of whether they are real existing words or non existing words.
In order to facilitate comprehension, different formats and different replacing words can be used for different types of phrases. For example, it might be possible to use words such as âOOOO1â to replace a relative phrase, such as âman who came for lunchâ, in âThis is the man who came for lunchâ. The replacement would yield âThis is OOOO1â. This replacement could be combined with another replacement that might be âThis is the man that XYZ1â, and use both in combination.
Projection replacement is used complementarily with the EXPANSION OF REPLACING WORDS, which is a technique that is based on replacing back the replacing words by the text fragment they are associated with, which can also, in turn, contain other replacing words.
In order to use projection replacement and expansion of replacing words, it is necessary to user computer technology. Exhibit 5 shows the details about how this can be applied in the invention. The boxes in which the text samples are included could represent windows in a computer system. In a situation in which the learner is working on the sentence âJohn went to the store after Mary turned on the radio that she had bought the previous weekâ, in a possible embodiment of the invention the following steps might be taken:
| Step 0. | The learner examines the initial sentence. |
| John when to the store after Mary turned on the | |
| radio that she had bought the previous week. | |
| Step 1. | The invention shows the basic structure of the sentence |
| John went to the store after XYZ1. | |
| Step 2. | The user-learner selects the special word âXYZ1â, and |
| the invention expands such special word. | |
| John went to the store after Mary turned on the radio that XYZ2 | |
| Step 3. | The user-learner selects the special word âXYZ2â |
| John went to the store after Mary turned on the radio that | |
| she had bought the previous week. | |
Projection replacement can be applied in all perspectives. For example, Exhibit 6 shows how it can be applied in the escalator tree.
| John went to the store | ||
| after Mary turned on the radio | ||
| that XYX1. | ||
In order to facilitate the comprehension of the different phrases that compose the text fragment that the user wants to understand, several concepts can be used. In order to facilitate the exposition, in what follows some of the concepts that are used in the preferred embodiment will be succinctly described. In the section that describes the preferred embodiment, detailed information about these concepts is shown.
In any of the clarifications it is possible to add words to re present omitted grammatical functions, such as subjects or verbs. These words could be shown within brackets, in order not to confuse the learner.
The INFORMATIVE TREE is a structure that further facilitates the understanding of the structure of the language samples. It is based on the TOWER TREE. Exhibit 7 shows the tower tree for the sentence âThe man at the bar with the jacket on his shoulder finished his sandwich before the bar closed.â As can be seen, the tower tree is the vertical arrangement of the different chosen phrases that exist in the text fragment, where each phrase can be optionally indented according to its embedding level.
| 1 | The man at the bar with the jacket on his shoulder |
| finished his sandwich before the bar closed. | |
| 2 | âThe man at the bar with the jacket on his shoulder |
| 3 | ââThe man at the bar |
| 4 | âââat the bar |
| 5 | ââwith the jacket on his shoulder |
| 6 | âââon his shoulder |
| 8 | âbefore the bar closed |
The INFORMATIVE TREE facilitates the navigation through the phrases, replacing words and other aspects of the target language. For the learner, this helps to understand the samples of target language; for the tutor, it helps to prepare those samples.
FIG. 3 shows an example informative tree for the example sentence âJohn went to the store after Mary turned on the radio that she had bought the previous weekâ. The purpose of FIG. 3 is to facilitate the exposition, and it does not have limiting effects. It is understood that other different types of informative trees with other structures could have been created.
As can be seen in the Figure, the informative tree is a tower tree to which some fields have been added in order to augment the information that is provided about the different chosen phrases of the language fragment on which the user is working. In particular, some clarifications, which have been previously described, have been added.
With this informative tree, the user could be interested in the basic structure of the first chosen phrase, and could see that its clarification 1 is âJohn went to the store after XYZ1â. If now he is interested in completing its comprehension by inspecting in closer detail the replacing word âXYZ1â and the text it corresponds to, he can go to the row where the chosen phrase that corresponds to that replacing word is located, so that he can see the clarification 1 of this new chosen phrase, which is âMary turned on the radio that VWZ1â, and understand its structure and meaning. In the same row, he can now find more information, such as the clarification 2. If the sentence was in Spanish, it would be âMary encendiera la radio que VWZ1â. In this case, the learner might go to the clarification 4. The clarification 4, in particular, would indicate him that it is an event that is not âcurrent timeâ, meaning that the verb is not in indicative mode.
In order to facilitate the navigation in the informative tree, when the user selects a replacing word, such as âXYZ1â, the system might mark the row of the phrase for which âXYZ1â is a replacing word.
In order to further enhance the navigability in the informative tree, when a chosen phrase is selected, the system can also mark somehow its children, generally using a special format or a special background color. Also, it could optionally have the option to open and close children.
In the examples that have been covered, the focus has been put only on clause phrases with the purpose of facilitating the exposition, but basic phrases could have also been used.
The informative tree of the tutor could have several differences respect to the tree of the user-learner. In particular, it could have added columns that might be used to show data that are not relevant for the learner.
In order to understand a sentence it is essential to identify its grammatical components. In order to do that, the speaker resorts to syntactic and semantic criteria. In some languages, such as Japanese, it even happens that the components can have a suffix that identifies them, as shown in Exhibit 8.
| John-ga | Mary-ni | hon-o | yatta | |
| Juan-SU | Mary-IO | book-DO | gave |
| (âJohn gave a book to Maryâ) | |
| Where SU stands for subject, | |
| IO for indirect object, and | |
| DO for direct object. |
In order to help the learner to identify the grammatical components, the present invention uses the GRAMMATICAL PERSPECTIVE, which is characterized by exposing such grammatical components in an organized way. A possible embodiment, that corresponds to the preferred embodiment, uses a table and is shown in the Exhibit 9 to facilitate the exposition and without any limiting character.
| COL 1 | COL 2 | COL 3 | |
| (Icon 1) | Topic | ||
| (Icon 2) | Subject | John | |
| (Icon 3) | Verb | has painted | |
| (Icon 4) | Direct Object | a portrait | |
| (Icon 5) | Indirect Object | ||
| (Icon 6) | Time Adjunct | before XYZ1 | |
| (Icon 7) | Place Adjunct | ||
| (Icon 8) | Manner Adjunct | ||
| (Icon 9) | Cause Adjunct | ||
| (Icon 10) | End Adjunct | ||
| (Icon 12) | Companionship Adjunct | ||
| (Icon 13) | Beneficiary Adjunct | for Sarah | |
| (Icon 14) | Attribute | ||
| (Icon 15) | Agent in passives | ||
The purpose of Exhibit 9 is to facilitate the exposition; it is understood that the number of grammatical components that are chosen might be different from those shown in the Exhibit. Similarly, in other embodiments, the previous table might have other different characteristics. For example, the data could be shown in a horizontal table. Also, the order could be different, in order to show an order that might be similar to the basic order of the target language. Also, the cells that correspond to the grammatical components that do not exist in the sentence could be hidden.
The table might optionally have some additional characteristics, such as showing the name of the grammatical function that corresponds to each part of the table. It is also possible to insert symbols or nicknames for each grammatical component which will allow to identify them quickly and intuitively.
Furthermore, the table can optionally be enriched with some other functionality that is described in the following paragraphs. The first one would be inserting similar sentences, as shown in Exhibit 10, which is characterized by the possibility to add new columns or rows and show other example sentences, which can be known to the user, which contain similar grammatical components. The purpose of this technique is to take advantage of the fact that right after the mind has processed a given linguistic stimulus (the stimulus being either syntactic construction or a word), the mind has a certain tendency to facilitate the processing of other similar stimulus. This phenomenon is often used in techniques known as âprimingâ, whose goal is to explore how the mind associates linguistic stimulus.
| COL 1 | COL 2 | COL 3 | COL 4 | COL 5 |
| (Icon 2) | Subject | John | Louis | Joseph |
| (Icon 3) | Verb | has | has | has built |
| painted | written | |||
| (Icon 4) | Direct Object | a portrait | a book | a house |
| (Icon 6) | Time Adjunct | before | ||
| XYZ1 | ||||
| (Icon 13) | Beneficiary Adjunct | for Sara | for Mary | for Robert |
As has been seen, the invention is based on facilitating comprehension of the structure of target language samples. In order to do that, the learner interacts with the system, and as a result several perspectives are generated of the language sample on which the user is working at each moment.
The main advantage of the invention is that it solves the problem of how to facilitate that the learner comprehends samples of the target language that have higher complexity than the knowledge that the learner has at that moment. This problem was not solved until know. As far as has been known, no similar approach to languages learning has been found.
The most important aspects of the invention are the following ones:
In summary, for the learner, the invention presents a highly systematized method to present the structure of target language samples, which should in turn facilitate the comprehension. For the tutor, the invention includes a highly systematical method for preparing the samples of target language, which should facilitate his tasks.
FIG. 1 shows an escalator tree.
FIG. 2 shows a sentence in which different types of phrases have been selected.
FIG. 3 shows an informative tree.
FIG. 4 shows a general scheme of the invention.
FIG. 5 shows a tree and how the innermost parts usually open last.
In the preferred embodiment, the different perspectives are shows in different interaction windows, these interaction windows being areas of the screen of a computerized system.
FIG. 4 schematically shows the general layout of the invention for the preferred embodiment. In the Figure, it is possible to observe interaction windows 170 that provide the perspectives 180. Each one of the interaction windows 170 can be provided with certain additional functionality, beyond the one described in this section, in order to facilitate the user interaction. In the preferred embodiment, the existing windows provide the following perspectives:
If the learner is working with a clause such as âthe man dressed in red is eatingâ, a possibility would be to playback that clause but emphasizing âdressed in redâ, applying changes in frequency, syllable length and/or intensity that are characteristic of the target language on which the learner is working. This emphasis can be done with previous recordings or with a text to speech converter.
The invention is based on a written text, which can be the transcript of an oral sample or of a sample of a signed language. If the direct transcription is not appropriate, it is possible to make some modifications. It is also possible to make modifications when the original fragment has grammatical structures that are very complex, that could even be difficult to understand even with support tools. For example, an original fragment in Spanish such as âJuan es el que vino ayerâ could be converted into âJuan es el hombre que vino ayerâ.
In the preferred embodiment, general word management and edition of parts of the text is made by word tagging, similarly as how it is done in HTML, XML and other computer applications. For instance, a chosen phrase such as âthe fatherâ, could be really stored in the computer as â<1|the><2|father>â. Text tagging is a technique that is used very often and therefore it is considered to be within the prior art.
Concrete Decisions about the Preferred Embodyment
In the preferred embodiment the following decisions have been taken, and these decisions have no limiting effects on the scope of the invention:
1. Division in Sentences The language samples are divided into sentences, so the learner is working with sentences.
Phrases are chosen following the general rules of chomskyan syntax, as it is described in [Radford 1999], [Ouhalla 1999] and [Haegeman 1999]. For the application of these concepts, it is considered that the word groups such as the Spanish word group âdespues deâ are compounded prepositions (rather than compounded adverbs), so âdespues deâ would be considered to be a preposition (rather than an adverb). The abbreviation of the main concepts that are used are the following:
CP: Complementizer Phrase
IP: Inflexion Phrase
VP: Verbal Phrase
NP: Noun Phrase
AP: Adjective Phrase
PP: Preposition Phrase
DP: Determinant Phrase
BP: Adverbial Phrase
The actual criteria used for choosing phrases are shown in Exhibit 11.
In order to codify the level of each sentence component, several mechanisms can be used. In the preferred embodiment, several types of delimiting characters are used. For example, the string ((A) (((((B) (H)) [C]) [(D)[E]] [F])) only uses two types of delimiting characters: parenthesis and brackets. In the preferred embodiment, the parenthesis do not generate a level increase, but the brackets do. FIG. 5 and Exhibit 12 show how the layout of the components of that string varies, as explained below
FIG. 5 shows the different embedding levels of the example string. One of the last nodes to be shown would be H. If this structure was a phrase, where H is the head word (for example, a noun phrase where H is the noun), the head word would be shown at the end.
Applying the current criteria for level assignation, the previous string would be shown as can be seen in Exhibit 12. It can be seen that the most relevant part of the phrase, H, is located at the top row, so that it is easily perceived by the user.
| ABHCDEF |
| ABHCDEF |
| ABHCDEF |
The concrete criteria for level assignation in the preferred embodiment are shown in Exhibit 13:
Due to the way how the criteria to choose phrases have been defined, the level of a phrase will be the same as the level of the head word of the equivalent phrase, because its last embedding is the embedding that generates the chosen phrase itself. For example, in â((The man [in red]) is eating)â, âTheâ is the head of âThe man in redâ,
In practice, these criteria imply that in a given sentence the grammatical components of the sentence (topic, subject, auxiliaries, verb, direct object, indirect object, adjuncts, beneficiary, attribute and the agent of passive) will all be in the same row, even though in the most general case they have different embedding levels.
In the preferred embodiment, the definition of links is based on the following phrase categorization:
In the preferred embodiment, there exist two basic computer systems, which are schematically represented in FIG. 4, the Tool and the Application. The Tool is used by the tutor in order to prepare the language samples. The Application is used by the learner in order to understand the language samples. In the preferred embodiment, both the tool and the application use basic hardware/software means. Such means can have the same type in the Tool and in the Application, or they can have different types.
In the preferred embodiment, the hardware means would be a personal computer, such as the Dell Dimension XPS, that would contain a monitor, a CPU, a keyboard, a mouse, a RAM memory unit, a fixed permanent memory unit (such as hard drive) and a removable permanent memory unit (such as a unit for a diskette). The basic software means include Microsoft Windows 2000 as operating systems, and the device drivers that the manufacturer considers to be appropriate for the devices existing in the computer.
The expert in the field will notice that other hardware means for performing the same function can be used. For example, it is possible to use a SUN workstation, a portable computer or a PDA (Personal Digital Assistant) device. It would be also possible to use other configurations lacking either mouse or keyboard, given that using it would be possible to enter data and manage the screen cursor. For example, in the utilization of personal computers, the âTabâ key is often used to change the control that is to be active in the screen, and the arrow keys are often used to modify the position of the cursor. Similarly, there could be other configurations not using either mouse nor keyboard, but other interacting means such as an electronic pen or tactile screens. Similarly as well, it would be possible that the system would not have a removable permanent memory unit, with data created by the Tool being transferred to the Application by some teleinformatic connection, such as a modem and the telephone line. Besides that, the system does not need to be a local system, but it could be a terminal that interoperates with a remote server. The expert in the field will also notice that besides the operating system chosen for the preferred embodiment, there are other operating systems that can perform the same function.
A possible alternative is to create the Application so that it shows the perspectives in a fixed fashion, rather than allowing interactions.
Other possibilities that exist to embody the invention are based on modifying the characteristics of the platforms on which the invention is used, for example by employing a television or text documents.
Other alternative is to choose different criteria for assigning levels, so that other types of phrases or word groups are more salient.
On the other hand, it is also possible to create the escalator tree marking level changes that are higher than one. This means that when moving from a word to the next one, the codified level change could be for example of two lines. In order to codify the separated sentence in this case, it could be possible to use additional delimiter characters, such as â{â and â}â or compounded characters, such as for example â(#â and â)#â, which would cover a higher number of level changes.
Other possible embodiment has to do with the creation of the escalator tree in descending direction, i.e. starting on the bottom row and expanding the tree towards the upper rows.
In yet another embodiment, the criteria to choose phrases are different. For example, they can be modified so that condition (b), previously shown in Exhibit 11, would be replaced by the condition (b) that is shown in Exhibit 14. This new criteria list is called âType II Criteria for Choosing Phrasesâ.
The criteria to choose phrases Type II are the same as the criteria Type I except for the fact that condition b in the Type I criteria is replaced by the following condition b:
2. Condition b: The phrase is a DP and:
Another possible alternative would be achieved by considering that the expressions such as the Spanish word group âdespues deâ are adverbial phrases in which the head word is an adverb, such as âdespuesâ and that the prepositional phrase is an argument. In this case, the alternative list of criteria would be equivalent to the previous one, but adding condition g, which is shown in Exhibit 15.
7. Condition g: The phrase is a BP and:
1. A system for facilitating the comprehension and/or the learning of a target language, wherein
said system is applied over at least one initial text string corresponding to a language sample,
said system is embodied in at least one computerized system,
said system comprises the following means:
(a) a processing unit
(b) a display
(c) means to provide a second text string by replacing one or more text parts in said initial string by a group of characters, said one or more text parts being termed in this invention âreplaced partsâ, and said group of characters being termed in this invention âreplacing fragmentâ, wherein said âreplacing fragmentâ is different from said âreplaced partsâ,
(d) means to present a text string in said display,
wherein the result of this action of replacing text parts is termed in this invention âprocessed text stringâ, and the end result is similar to the example shown in Exhibit 5, and it facilitates comprehending the structure of said initial text string, therefore facilitating comprehending the structure of said language sample, therefore facilitating comprehending the meaning of said language sample, and therefore facilitating learning the target language when learning is sought by the user.
2. A system as claimed in claim 1, wherein said initial text string already contains at least one replacing fragment inserted after a previous action of replacing.
3. A system as claimed in claim 1, wherein said replacing fragments are created by linking different characters, and the end result is not a real existing word in the target language.
4. A system as claimed in claim 1, wherein said replacing fragments comprise a numerical index, wherein the fraction of said replacing fragments that is not the index is called the âbodyâ of said replacing fragment.
5. A system as claimed in claim 1, wherein the replacing fragments that are used for different types replaced parts have different bodies, wherein there are different types of replaced parts depending on the grammatical criteria that are applied, wherein one type of replaced parts is ârelative clauseâ.
6. A system as claimed in claim 1, further comprising means for providing a third text string inserting back the replaced parts in place of the replacing fragments that previously had replaced them.
7. A method for facilitating the comprehension and/or the learning of a target language, wherein
said method is applied over at least one initial text string corresponding to a language sample,
said method is embodied in at least one computerized system,
said method comprises the following steps:
(a) providing a processing unit
(b) providing a display
(c) providing a second text string by replacing one or more text parts in said initial string by a group of characters, said one or more text parts being termed in this invention âreplaced partsâ, and said group of characters being termed in this invention âreplacing fragmentâ, wherein said âreplacing fragmentâ is different from said âreplaced partsâ,
(d) presenting said second text string in said display,
wherein the result of this action of replacing text parts is termed in this invention âprocessed text stringâ, and the end result is similar to the example shown in Exhibit 5, and it facilitates comprehending the structure of said initial text string, therefore facilitating comprehending the structure of said language sample, therefore facilitating comprehending the meaning of said language sample, and therefore facilitating learning the target language when learning is sought by the user.
8. A method as claimed in claim 7, wherein said initial text string already contains at least one replacing fragment inserted after a previous action of replacing.
9. A system as claimed in claim 7, wherein said replacing fragments are created by linking different characters, and the end result is not a real existing word in the target language.
10. A system as claimed in claim 7, wherein said replacing fragments comprise a numerical index, wherein the fraction of said replacing fragments that is not the index is called the âbodyâ of said replacing fragment.
11. A system as claimed in claim 7, wherein the replacing fragments that are used for different types replaced parts have different bodies, wherein there are different types of replaced parts depending on the grammatical criteria that are applied, wherein one type of replaced parts is ârelative clauseâ.
12. A system as claimed in claim 7, further comprising the step of providing a third text string inserting back the replaced parts in place of the replacing fragments that previously had replaced them.
13. A computer program that, when executed by one or more processors of a computer, allows said one of more processors to perform the steps of the method of claim 7.