Common Tag Standard is released!
June 11, 2009
As strong believers in the semantic tagging (we wrote about it here and here), we are happy to announce that today one big step toward realization of the idea is made.
Faviki is involved in the development of the new open tagging format – Common Tag, together with AdaptiveBlue, DERI (NUI Galway), Freebase, Yahoo!, Zemanta, and Zigtag. This is the first time that this number of web companies have stepped together from day one to introduce a tagging standard.
People use tags to organize, share and discover content on the Web. However, in the absence of a common tagging format, the benefits of tagging have been limited. Individual things like New York City are often represented by multiple tags (like “nyc”, “new_york_city”, and “newyork”), making it difficult to organize related content; and it is not always clear what a particular tag represents – does the tag “orange” represent the fruit or the color?
The Common Tag format was developed to address the current shortcomings of tagging and help everyone, including end users, publishers, and developers get more out of Web content. It is an outcome of an effort to develop the easiest way to let publishers get more out of their content by semantically marking it up.
Common Tag format is based on RDFa, a standard mechanism for placing structured content within HTML documents. The format uses the URIs of concepts defined on the Web as a way of anchoring the meaning of Tag objects. Common concepts can be found, among others, in two big databases of structured content (or controlled vocabularies, as librarians call it) – Freebase and DBpedia.
Common Tag is based on a small vocabulary defining:
- A class Tag, which holds the metadata provided by a Common Tag for a specific Resource.
- Two properties:
There are also few subclasses and optional properties, you can have a look at the whole specification. Also, developers may feel free to make use of RDFa’s flexibility to extend the expressiveness of the Common Tag format.
An example of two tags indicating that the document is about Twitter (DBpedia URI) and Web 2.0 (Freebase URI):
<body xmlns:ctag="http://commontag.org/ns#" rel="ctag:tagged">
<span typeof="ctag:Tag"
rel="ctag:means" resource="http://dbpedia/resource/Twitter" />
<span typeof="ctag:Tag"
rel="ctag:means" resource="http://rdf.freebase.com/ns/en/web_2_0" />
</body>
Faviki has implemented the Common Tag format (check out the extracted RDFa from Faviki Semantic Web topic page), and we hope that our users will benefit from it, as more publishers, developers and end users join in supporting the Common Tag format.
http://dbpedia/resource/Twitter
W3C SW Case Study by Faviki: Semantic Tags
December 10, 2008
We are honored to have been invited to write a case study about Faviki and the idea behind semantic tags for the W3C Semantic Web Activity website.
The goal of W3C SW case studies is, primarily, to help the Web community at large understand and appreciate the advantages of possibly using Semantic Web technologies in real applications. It was a challenge to write a document that should convince (often skeptical) IT managers and other technology people that there can be made some interesting applications based on SW technology.
I tried to show the benefits of using the semantic tags and described how they are used in Faviki. The key idea of the case study is that the semantic tags, as an intersection point of Web 2.0 and the Semantic Web, have the potential to enable much faster evolution of the Web by providing a solid foundation from which the Semantic Web can grow soundly.
I already wrote on this blog about the need for a tag evolution back in May, so I was happy to present the idea, that has matured in the meantime, to a wider audience.
Many thanks to Ivan Herman for this opportunity and the comments which helped make the entry better.
Also, a big thank you to Maja, Sebastian and Rod for their suggestions.
Zemanta Launches Public Semantic API
December 9, 2008
Today our friends and partners at Zemanta launched a public semantic API, as well as a front side SDK.
Zemanta API analyses unstructured documents/texts and returns five types of content objects:
- machine readable static tags
- general categories and custom taxonomies
- named entities with links to objects from major online knowledge databases: Wikipedia, Amazon, IMDB, RottenTomatoes, CrunchBase,… and to selected pool of online media and blogs
- pictures from Flickr, CC sources and professional agencies
- articles from selected media sources and blogs

This is the first API that returns disambiguated entities linked to DBPedia, Freebase, MusicBrainz, and Semantic Crunchbase. The data can be returned in the standard format of Semantic web – RDF.
There is the extensive developers documentation available, including architecture overview, code samples for most popular programming languages, frontside integration SDK, developers forum and application gallery.
API is free to use for up to 10.000 API calls per month, and for a subscription fee above that.
Zemanta API adds great value to Faviki, by analyzing the text from web pages that are saved by users and suggesting related DBpedia concepts. This makes Faviki users’ lives much easier, because now they can add semantic tags with a just one click.
Zemanta API is a powerful technology that has lots of potential. We can’t recommend it highly enough. Keep up the good work Zemanta
Related articles by Zemanta
Faviki makes it to the ReadWriteWeb’s annual top 10 list of Semantic applications to watch
November 22, 2008
ReadWriteWeb, a popular blog about web technology, has started publishing its annual list of “10 Semantic Web Apps to Watch” last year. This year, I’m happy to announce that Faviki made it to that list.
As the number of Semantic Web startups rapidly increases, I understand that editors at RWW consider this list to be a prediction of success in this brand new part of the market. I am very pleased that Faviki’s idea of semantic bookmarking quickly caught their eye.
I suggest you check out this list. You will find some very interesting and diverse projects, ranging from semantic search engines to resaurant review web sites.
Related articles by Zemanta
Faviki is featured on Google Code
September 23, 2008
Image by Thomas Hawk via Flickr
Faviki is a featured project on Google Code for it’s creative usage of Google AJAX Language API!
This API allows you to translate and detect the language of blocks of text. Despite the fact it has a word “AJAX” in it’s name, the API can be also accessed from non-JavaScript environments.
What is it all about? As we have written recently, Faviki uses Zemanta API to make auto suggestions for tags. That’s OK for English pages, but what about other languages?
They have to be translated first, so Faviki asks Google AJAX Language API for help
A great thing is that you don’t need to specify the original language, it recognizes it automatically!
Automatic translations made this way are not perfect, but they seem to be good enough for Zemanta to find appropriate concepts from English Wikipedia, which are finally translated again into user language (using DBpedia data about language connections).
So, the whole process looks like this (simplified version):

- Faviki fetches a web page and extracts a core text (without HTML and non-relevant content).
- Then it tries to figure out if a content is in English. If it isn’t, it is sent to Google language API, which detects the original language automatically, translates it into English and returns the translation.
- The content is then sent to and analyzed by Zemanta API, which then finds relevant links. Faviki uses links from English Wikipedia – titles are used as semantic tags.
- If users language is not English, we must translate them. Using DBpedia datasets “Links to Wikipedia Article” , we can find names of Wikipedia’s titles in one of 13 languages. These datasets actually contain the connections between English Wikipedia articles and articles from Wikipedia in other languages.
- Finally, suggested tags are offered to a user.
Faviki combines three services to make multilingual semantic tags possible. We hope this will help our non English speaking users to tag their bookmarks faster and more easily. These great services will continue improving in time, so expect that the suggested tags will be better, too.


![Reblog this post [with Zemanta]](http://img.zemanta.com/reblog_e.png?x-id=f2ba5d03-9737-4522-97ea-ce1e9c990501)
![Reblog this post [with Zemanta]](http://img.zemanta.com/reblog_e.png?x-id=bb12dbff-602b-4904-a9db-589b4d7cd0fc)
![Reblog this post [with Zemanta]](http://img.zemanta.com/reblog_e.png?x-id=b2b8749e-2747-487c-80fc-41323dcd1793)
![Reblog this post [with Zemanta]](http://img.zemanta.com/reblog_e.png?x-id=c8e2e6d0-153f-441a-9fc2-e5e7f4afb8a5)
![Reblog this post [with Zemanta]](http://img.zemanta.com/reblog_e.png?x-id=8aea445b-f7d1-47b0-867e-d74dc0380d5f)
![Reblog this post [with Zemanta]](http://img.zemanta.com/reblog_e.png?x-id=d76198bf-b22f-4172-9c07-fdf54393e871)

![Reblog this post [with Zemanta]](http://img.zemanta.com/reblog_e.png?x-id=c3eb605c-408a-49da-bf59-9bab830570c5)
![Reblog this post [with Zemanta]](http://img.zemanta.com/reblog_e.png?x-id=f4caa102-4465-4107-ad28-cd69cf5842de)