Sindice posts

  • Sig.ma Enterprise Edition (EE) available

    By Giovanni Tummarello with no comments.

    The original Sig.ma The http://sig.ma service was created as a demonstration of live, on the fly Web of Data mashup. Provide a query and Sig.ma will demonstrate how the Web of Data is likely to contain surprising structured information about it (pages that embed RDF, RDFa, Microdata, Microformats) By using the Sindice search engine Sig.ma allows a [...]

  • Sindice, its startup company and 12 billion+ live triples SPARQL endpoint

    By Giovanni Tummarello with 3 comments.

    Today we’re happy to share two big news: 1) We’re launching today our startup company set to drive commercialization of Sindice services. See press release below. Hurray! 2) We’re making public our main Sindice SPARQL endpoint, containing the entire Sindice dataset. This currently indexes over 12 billion triples and is live updated as data comes [...]

  • Sindice reindexed: find your datasets (much faster)

    By Giovanni Tummarello with no comments.

    Having streamlined several procedures inside Sindice, rebuilding the sindice index from scratch now takes just a few hours. Over the weekend, we built a new Sindice index based on the latest updates of Siren and improvements to the pipelines. This is now in production and sports the following enhancements: Ranking no more big docs first [...]

  • Searching infinite amounts of Web Data: The new Sindice Index and Frontend

    By Giovanni Tummarello with 3 comments.

    Several goals have kept the the Sindice Team constantly busy in the past year or so. Luckly we’re now getting close to their deployment and today we’re happy to begin by introducing SIREn, the new Sindice core index,  its supporting new frontend and the API.  SIREn: Sindice’s own semantic search engine SIREn (Semantic Information Retrieval [...]

  • Sindice migration

    By smulcahy with no comments.

    This is mainly a test post to verify that the Sindice blog continues to work after migrating it to a new server. But it is also a good opportunity to briefly mention the upgrades we are making to the Sindice infrastructure. I’m happy to report that Sindice has been suffering from some growing pains over [...]

  • Sindice now supports Efficient Data discovery and Sync

    By Giovanni Tummarello with 1 comment.

    So far semantic web search engines and semantic aggregation services have been inserting datasets by hand or have been based on “random walk” like crawls with no data completeness or freshness guarantees. After quite some work, we are happy to announce that Sindice is now supporting effective large scale data acquisition with *efficient syncing* capabilities based on [...]

  • Sindice planned downtime this weekend

    By smulcahy with no comments.

    Hi. Due to an expansion of one of our datacentres (and the electrical work that this implies), Sindice and related services such as sig.ma will be down from 1730 GMT+1, 11-Jun-2010 (Friday) to 1730 GMT+1, 12-Jun-2010 (Saturday). This major upgrade will give us increased room to grow the Sindice infrastructure over time. On 27-May-2010 we [...]

  • Any23 v0.4.0 Released

    By micmos with no comments.

    Dear All, the Sindice FBK team is proud to announce the Any23 0.4.0 release. In this new release we paid particular attention in data validation and correction, in  particular  we can claim  to extract the  Open Graph Protocol[1]  metadata also whether affected by syntactical errors[2]. We’ve also added full support for the N-Quads[3] format. As [...]

  • Any23 v0.3.0 Released

    By micmos with no comments.

    Dear All, we’re pleased to announce the Any23 0.3.0 release. Please keep in mind this is a beta, so everybody using Any23 in a development session is invited to migrate to this latest version and report in our issue tracker [1] any eventual bug. As usual we have a live demo running at [2], please feel [...]

  • Any23 v0.2 Released

    By micmos with no comments.

    We are proud to announce a new release of Any23 – Anything to Triples http://developers.any23.org/ Any23 is a Java library that parses RDF from a variety of Web document formats. The currently supported input formats are RDFa, RDF/XML, Turtle, N3, N-Triples, and a number of Microformats. Any23 is an Open Source project originated from the code created within [...]