Faviki is featured on Google Code

September 23, 2008

Google Code

Image by Thomas Hawk via Flickr

Faviki is a featured project on Google Code for it’s creative usage of Google AJAX Language API!

This API allows you to translate and detect the language of blocks of text. Despite the fact it has a word “AJAX” in it’s name, the API can be also accessed from non-JavaScript environments.

What is it all about? As we have written recently, Faviki uses Zemanta API to make auto suggestions for tags. That’s OK for English pages, but what about other languages?

They have to be translated first, so Faviki asks Google AJAX Language API for help 🙂 A great thing is that you don’t need to specify the original language, it recognizes it automatically!

Automatic translations made this way are not perfect, but they seem to be good enough for Zemanta to find appropriate concepts from English Wikipedia, which are finally translated again into user language (using DBpedia data about language connections).

So, the whole process looks like this (simplified version):

  1. Faviki fetches a web page and extracts a core text (without HTML and non-relevant content).
  2. Then it tries to figure out if a content is in English. If it isn’t, it is sent to Google language API, which detects the original language automatically, translates it into English and returns the translation.
  3. The content is then sent to and analyzed by Zemanta API, which then finds relevant links. Faviki uses links from English Wikipedia – titles are used as semantic tags.
  4. If users language is not English, we must translate them. Using DBpedia datasets “Links to Wikipedia Article” , we can find names of  Wikipedia’s  titles in one of 13 languages. These datasets actually contain the connections between English Wikipedia articles and articles from Wikipedia in other languages.
  5. Finally, suggested tags are offered to a user.

Faviki combines three services to make multilingual semantic tags possible. We hope this will help our non English speaking users to tag their bookmarks faster and more easily. These great services will continue improving in time, so expect that the suggested tags will be better, too.

Reblog this post [with Zemanta]

7 Responses to “Faviki is featured on Google Code”

  1. […] has been selected as a featured project on Google code, and in that context, Vuk describes the process of how Faviki retrieves its suggestions in a little more detail. It’s really interesting! It also sheds more light on the way that DBpedia is used in Faviki: […]

  2. […] Faviki is featured on Google Code Possibly related posts: (automatically generated)Faviki is featured on Google CodeZemanta semantic plugin for non-English bloggers! […]

  3. […] Vom Vorteil und Elend der Folksonomien Präsentation zum gestrigen Webmontag gibt es auf Slideshare – zwei Bilder sind verloren gegangen, beim Berechnen, hm. Das erste findet man, wenn man auf tagcare.org geht, das zweite ist vom Blog von Faviki. […]

  4. I’m very interested in the work you all are doing at faviki. I am building a social networking site designed for collaborating on and sharing content between three language groups. Our userbase includes material in English, Estonian and Russian – though the vast majority of our users speak English only as a second language (if at all). The Faviki approach would work very well for our site except that neither Estonian nor Russian is included in the DBpedia language base. Are there options for overcoming this?

  5. @Matt,

    I am glad you are interested in our work at Faviki.

    One correction: Russian is included in DBpedia, and it’s possible to tag in Faviki in Russian – take look of https://faviki.wordpress.com/2008/09/10/september-release-multi-language-semantic-tagging-private-bookmarks-enhanced-ui/.

    However, there is no support for Estonian in DBpedia. There is a framework for extracting code from Wikipedia on http://wiki.dbpedia.org/Documentation, hope that might help you.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s

%d bloggers like this: