Grant Agreement no.: 248064November 30th, 2012
FiledNovember 30th, 2012where c gets a high positive value if the link “points” to a web page in language2 and 0 otherwise. For instance, let us suppose that FBC aims to detect pairs of parallel web documents for the Environment domain in EN and IT. If the EN web page in Figure 3 is fetched, the selected link (in the red ellipse) will get a high score since the anchor text “italiano (it)” implies that this link points to the Italian translation of the English page. It is worth mentioning that the current version of FBC uses this method with the aim of visiting candidate translations before following other links.
Grant Agreement no.: 248064November 30th, 2012
FiledNovember 30th, 2012where c gets a high positive value if the link “points” to a web page in language2 and 0 otherwise. For instance, let us suppose that FBC aims to detect pairs of parallel web documents for the Environment domain in EN and IT. If the EN web page in Figure 3 is fetched, the selected link (in the red ellipse) will get a high score since the anchor text “italiano (it)” implies that this link points to the Italian translation of the English page. It is worth mentioning that the current version of FBC uses this method with the aim of visiting candidate translations before following other links.