Publikációk
2012
Flexible and Efficient Distributed Resolution of Large Entities
Kiadvány:
Seventh International Symposium on Foundations of Information and Knowledge Systems (FoIKS) March 5-9, 2012. Kiel, Germany
2011
Efficient Multi-Start Strategies for Local Search Algorithms
Kiadvány:
Journal of Artificial Intelligence Research, Volume 41, pages 407-444
Affordable Supercomputing for Data Mining Applications
Kiadvány:
Procedia Computer Science, Volume 7, 2011, Pages 136-138 Proceedings of the 2nd European Future Technologies Conference and Exhibition 2011 (FET 11). Elsevier
Temporal analysis for web spam detection: An overview
Kiadvány:
Proc. TWAW in conjunction with WWW 2011. CEUR Workshop Proceedings. 8 pages
City Sentinel VAST 2011 Mini Challenge 1 Award: Outstanding Integration of Computational and Visual Methods
Kiadvány:
IEEE VAST 2011 Symposium, part of VisWeek 2011. 2 pages.
Web spam classification: a few features worth more
Kiadvány:
WebQuality '11
Entity Resolution with Heavy Indexing
Kiadvány:
In Proceedings of the 2011 International Conference on Advances in Databases and Information Systems (ADBIS 2011), CEUR Workshop Proceedings
SZTAKI @ ImageCLEF 2011
Kiadvány:
In Working Notes of the ImageCLEF 2011 Workshop at CLEF 2011 Conference, Amsterdam, The Netherlands
Longitudinal Analytics on Web Archive Data: It's About Time!
Kiadvány:
5th Biennial Conference on Innovative Data Systems Research
Infrastructures and Bounds for Distributed Entity Resolution
Kiadvány:
In Proceedings of the 9th International Workshop on Quality in Databases In conjunction with VLDB 2011 (QDB 2011).
2010
SZTAKI @ TREC 2010
Kiadvány:
TREC 2010 Working Notes
Interest point and segmentation-based photo annotation
Kiadvány:
CLEF 2009 workshop. Multilingual information access evaluation II. Multimedia experiments. Corfu, 2009. (Lecture notes in computer science 6242.) (Oldalszám 340-347.)
Geographically organized small communities and the hardness of clustering social networks
Kiadvány:
Data mining for social network data, (Annals of information systems 12.) (Oldalszám 177-199.)
SZTAKI @ TRECVID 2010
Kiadvány:
TRECVID 2010 Working Notes
SZTAKI @ ImageCLEF 2010
Kiadvány:
CLEF 2010. Conference on multilingual and multimodal information access evaluation. Notebook Papers of CLEF 2010 LABs and workshops. Padua, 2010. (Terjedelem 1-4. oldal)
An efficient block model for clustering sparse graphs
Kiadvány:
MLG 2010. Proceedings of the 8th workshop on mining and learning with graphs, in conjunction with SIGKDD 2010. Washington, 2010. (Oldalszám 62-69.)
2009
Kapcsolatok és távolságok: a hazai vezetékes hívás-szokások elemzése
Kiadvány:
Magyar Tudomány (Kötetszám 170, Füzetszám 6, Oldalszám 697-706.)
SZTAKI @ ImageCLEF 2009
Kiadvány:
10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009
SZTAKI @ TRECVID 2009
Kiadvány:
TRECVID 2009. TREC video retrieval evaluation. Working Notes.
Web Spam Challenge Proposal for Filtering in Archives
Kiadvány:
In Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web (AIRWeb)
Web Spam Filtering in Internet Archives
Kiadvány:
In Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web (AIRWeb)
Linked Latent Dirichlet Allocation in Web Spam Filtering
Kiadvány:
In Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web (AIRWeb)
2008
Overview of the ImageCLEF 2007 Object Retrieval Task
Kiadvány:
Overview of the ImageCLEF 2007 Object Retrieval Task|In: CLEF 2007 proceedings, Lecture Notes in Computer Science Volume Volume 5152, Springer (2008)
Telephone Call Network Data Mining: A Survey with Experiments (Chapter 12)
Kiadvány:
in: Handbook of Large-Scale Random Networks, Bolyai Society Mathematical Studies, Vol. 18. eds: B. Bollobás, R. Kozma, D. Miklós, Springer
Latent Dirichlet Allocation in Web Spam Filtering
Kiadvány:
in Proc. Airweb 2008 in conjunction with WWW 2008
Annotating documents by Wikipedia concepts
Kiadvány:
Web Intelligence 2008
Web Spam: a Survey with Vision for the Archivist
Kiadvány:
In Proc. IWAW 2008.
Increasing cluster recall of cross-modal image retrieval
Kiadvány:
In Working Notes of the 2008 CLEF Workshop, Aarhus, Denmark, Sept. 2008.
Web Spam Hunting @ Budapest
Kiadvány:
in Proc. Airweb 2008 in conjunction with WWW 2008
A Comparative Analysis of Latent Variable Models for Web Page Classification
Kiadvány:
In Proc LA-Web 2008.
Large-Scale Principal Component Analysis on LiveJournal Friends Network
Kiadvány:
In proc Workshop on Social Network Mining and Analysis Held in conjunction with The 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2008) August 24-27, 2008, Las Vegas, NV
SZTAKI @ ImageCLEF 2008 Visual Concept Detection
Kiadvány:
in Working Notes of the 2008 CLEF Workshop, Aarhus, Denmark, Sept. 2008.
Multimodal Retrieval by Text--Segment Biclustering
Kiadvány:
In: CLEF 2007 proceedings, Lecture Notes in Computer Science Volume 5152, Springer (2008)
2007
Performing cross-language retrieval with wikipedia
Kiadvány:
CLEF 2007 workshop. Corss language system evaluation campaign. Budapest, 2007. (Terjedelem 16 (CD) oldal) Eds Nardi, A.; Peters, C.; Quochi, V.
Cross-modal retrieval by text and image feature biclustering
Kiadvány:
CLEF 2007 workshop. Corss language system evaluation campaign. Budapest, 2007. (Terjedelem 8 (CD) oldal) Eds Nardi, A.; Peters, C.; Quochi, V.
Spectral Clustering in Telephone Call Graphs
Kiadvány:
in Proc. WebKDD/SNAKDD Workshop 2007 in conjunction with KDD 2007.
Who Rated What: a combination of SVD, correlation and frequent sequence mining
Kiadvány:
in Proc. KDD Cup and Workshop 2007 in conjunction with KDD 2007.
Cross-Language Retrieval with Wikipedia
Kiadvány:
In: CLEF 2007 proceedings, Lecture Notes in Computer Science Volume 5152, Springer (2008)
Impact of non-Poissonian activity patterns on spreading processes
Kiadvány:
Phys. Rev. Lett. 98. (15), 158702
Web Spam Detection via Commercial Intent Analysis
Kiadvány:
in Proc. Airweb 2007 in conjunction with WWW 2007.
Methods for large scale SVD with missing values
Kiadvány:
in Proc. KDD Cup and Workshop 2007 in conjunction with KDD 2007.
Semi-Supervised Learning: A Comparative Study for Web Spam and Telephone User Churn
Kiadvány:
in Proc. Graph Labelling Workshop and Web Spam Challenge 2007 in conjunction with ECML/PKDD 2007.
2006
Link-Based Similarity Search to Fight Web Spam
Kiadvány:
in Proc. Airweb 2006 in conjunction with SIGIR 2006
Detecting Nepotistic Links by Language Model Disagreement
Kiadvány:
In Proc. WWW2006 , poster section
Identifying Document Topics Using the Wikipedia Category Network
Kiadvány:
Web Intelligence 2006: 456-462
Sociodemographic Exploration of Telecom Communities
Kiadvány:
NSF US-Hungarian Workshop on Large Scale Random Graphs Methods for Modeling Mesoscopic Behavior in Biological and Physical Systems, 2006.
Shaping SQL-Based Frequent Pattern Mining Algorithms
Kiadvány:
In: Knowledge Discovery in Inductive Databases (KDID'05), 2006 Springer, LNCS 3933, 188-201.
Exploiting extremely rare features in text categorization
Kiadvány:
In Proc. ECML 2006. 759-766
Improved Approximation Algorithms for Large Matrices via Random Projections
Kiadvány:
In proc. 47th FOCS, 2006.
Two-Phase Data Warehouse Optimized for Data Mining
Kiadvány:
in Proc BIRTE workshop 2006.
The Dynamics of Information Access on the Web
Kiadvány:
Phys. Rev. E 73, 066132 (2006)
To Randomize or Not To Randomize: Space Optimal Summaries for Hyperlink Analysis
Kiadvány:
Technical Report, 2005. Rövid verzió In Proc. WWW2006.
PageRank és azon túl: Hiperhivatkozások szerepe a keresésben (PageRank and Beyond: The Role of Hyperlinks in Search, in Hungarian)
Kiadvány:
Magyar Tudomany, pp. 1325-1331, November 2006.
2005
On the Feasibility of Low-rank Approximation for Personalized PageRank
Kiadvány:
WWW2005, Poszter Szekció.
Shaping SQL-Based Frequent Pattern Mining Algorithms
Kiadvány:
In Proc. KDID 2005 in conjunction with ECML/PKDD2005.
Feature selection based on word-sentence relation
Kiadvány:
In Proc. ICMLA 2005.
Scaling Link-based Similarity Search
Kiadvány:
Technical Report, 2004. Rövid verzió: WWW 2005
SpamRank -- Fully Automatic Link Spam Detection
Kiadvány:
AIRWeb'05 in conjunction with WWW 2005
Architecture for mining massive web logs with experiments
Kiadvány:
In Proc. HUBUSKA: Open Workshop on Generic Issues of Knowledge Technologies, 2005.
2004
Magyar nyelvű tartalom a világhálón
Kiadvány:
Információs társadalom internet információtechnika. (Kutatási jelentés 26) (Oldalszám 48-55)
Towards Scaling Fully Personalized PageRank
Kiadvány:
Technical Report, 2004.
Towards Scaling Fully Personalized PageRank
Kiadvány:
WAW 2004, a FOCS 2004 workshopja. Megjelent a LNCS 3243/2004 számában, Springer Verlag.
A Scalable Randomized Method to Compute Link-Based Similarity Rank on the Web Graph
Kiadvány:
ClustWeb , az EDBT 2004 workshopja. Megjelent a LNCS 3268 számában, Springer Verlag.
2003
Deformable Polygon Representation and Near-Mincuts
Kiadvány:
Building Bridges Between Mathematics and Computer Science to be published by Springer Verlag in conjunction with the Bolyai Mathematical Society of Budapest
Formal description of a distributed location service for mobile ad hoc networks
Kiadvány:
LECTURE NOTES IN COMPUTER SCIENCE (Kötetszám 2589, Oldalszám 204-217)
A magyar web
Kiadvány:
megjelent a Híradástechnika 2003/3 58. számában
Where to Start Browsing the Web?
Kiadvány:
I2CS, 2003. Megjelent az LNCS 2877/2003 számában, Springer Verlag.
Searching a small national domain -- a preliminary report
Kiadvány:
WWW2 003 Poszter szekcióján. A poster PowerPoint formátumban.
Keresés a Világhálón
Kiadvány:
megjelent a Híradástechnika 2003/3 58. számában
Algorithms on the Web Graph
Kiadvány:
3rd Hungarian-Japanese Symposium on Discrete Mathematics and Its Applications, Tokyo, 2003.
2002
Szinguláris felbontás és alkalmazásai
Kiadvány:
MTA SZTAKI Technical Report, 2002.