Letöltések

ECML/PKDD 2010 Discovery Challenge Data Set

The Web Quality datasets in this site are provided to advance research on Web document classification. These labels are intended for research purposes only. We advise you not to use these labels directly for search engine ranking or filtering.

Wimmut: searching and navigating Wikipedia

We have developed a Java application with a user-friendly graphical interface for searching Wikipedia content and navigating network of pages. The client Java application can be downloaded.

MCMC for metabolic networks

In our model the evolution of a metabolic network is characterized by gain and loss of reactions connecting two metabolites and can be described as a discrete space continuous time Markov process.

LiveJournal data

The data set is intended for research purposes only and freely available as per Creative Commons Attribution-Noncommercial-Share Alike 3.0, which basically states that you are free to use the labels and that we make no warranties about them. You can download and use the data for research in any institution public or private.

Temporal Features for Web Spam Detection

Temporal features for Web spam detection calculated from monthly snapshots of the .uk domain between October 2006 and May 2007.

Recticular Alignment: hatékony iteratív szekvenciaillesztés

A Reticular Alignment modul egy hatékony iteratív szekvenciaillesztő algoritmust valósít meg. A korábbi sarok-levágó módszerekkel ellentétben a dinamikus programozási táblának nem egy konvex részét választja ki, hanem egy olyan illesztési hálózatot amely nem csak az optimális, hanem a legjobb szub-optimális útvonalakat is tartalmazza.