Paper List (Tentative)

Date

Topic and Paper

Presenter

Nov 19, 2014 Locality Sensitive Hashing and Its Applications to Data Mining
A. Andoni and P. Indyk, “Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions,” Comm. ACM 51:1, pp. 117–122, 2008.
Kemal Gürkan Toker
Clustering
Zhang, C.; Ouyang, D. & Ning, J. “An artificial bee colony approach for clustering”, Expert Systems with Applications , 2010, 37, 4761 - 4767
Selim Yılmaz
Dimensionality Reduction
Bolón-Canedo, Verónica and Sánchez-Maroño, Noelia and Alonso-Betanzos, Amparo, A review of feature selection methods on synthetic data, Knowledge and Information Systems, Springer, Volume 34, Issue 3, 2013, Pages 483-519, ISSN 0219-1377, http://dx.doi.org/10.1007/s10115-012-0487-8
Ahmet Nasser
MapReduce and Other Efficient Structures for Big Data
Özlem Gürses
Nov 26, 2014 MapReduce and Other Efficient Structures for Big Data
J. Dean and S. Ghemawat, “Mapreduce: simplified data processing on large clusters,” Comm. ACM 51:1, pp. 107–113, 2008.
Osman Alper Öcal
Association Rule Mining
Huan Wu, Zhigang Lu, Lin Pan, Rongsheng Xu ,Wenbao Jiang, "An Improved Apriori-based Algorithm for Association Rules Mining" , Sixth International Conference on Fuzzy Systems and Knowledge Discovery, 2009.
Çağdaş Bak
Association Rule Mining
Liu G., Zhang H., and Wong L. "Finding Minimum Representative Pattern Sets" in KDD'12 Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining Pages 51-59.
Volkan Ulutaş
Mining Sequences and Temporal Data
Y. Matsubara, Y. Sakurai, C. Faloutsos, T. Iwata, and M. Yoshikawa. "Fast Mining and Forecasting of Complex Time-Stamped Events". KDD’12, August 12–16, 2012,
Melike Aytürk
Dec 3, 2014 Mining Sequences and Temporal Data
Yunfu Yin, Zhigang Zheng, and Longbing Cao. 2012. USpan: an efficient algorithm for mining high utility sequential patterns. In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '12).
Emre Kağan Akkaya
Mining Data Streams
Elfeky, M.G., Aref, W.G. and Elmagarmid, A.K., "STAGGER: Periodicity Mining of Data Streams Using Expanding Sliding Windows," ICDM '06. Sixth International Conference on Data Mining.-Hong Kong, 2006, pp. 188-199, Print ISSN: 1550-4786
Rima Türker
Recommender Systems
Cong Yu; Lakshmanan, L.V.S.; Amer-Yahia, S., "Recommendation Diversification Using Explanations," Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on , vol., no., pp.1299,1302, March 29 2009-April 2 2009 doi: 10.1109/ICDE.2009.225
Merve Şimşek
Recommender Systems
Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl, "Item-based Collaborative Filtering Recommendation Algorithms", WWW 2010.
Sinan Göker
Dec 10, 2014 Large Scale Data Mining
U Kang, Duen Horng Chau, and Christos Faloutsos. 2011. Mining large graphs: Algorithms, inference, and discoveries. In Proceedings of the 2011 IEEE 27th International Conference on Data Engineering (ICDE '11)
Emre Gürbüz
Link Analysis
Z. Gyongi, H. Garcia-Molina, and J. Pedersen, “Combating link spam with trustrank,” Proc. 30th Intl. Conf. on Very Large Databases, pp. 576–587, 2004
Nihat Tekeli
Mining Social Network Graphs
The Louvain method - BGLL Algorithm P. Chaturvedi, M. Dhara, and D. Arora, "Community Detection in Complex Network via BGLL Algorithm", International Journal of Computer Applications, Volume 48– No.1, 2012.
A. Melih Akça
Mining Social Network Graphs
Nitai B. Silva, Ing-Ren Tsang, George D.C. Cavalcanti, and Ing-Jyh Tsang, "A Graph-Based Friend Recommendation System Using Genetic Algorithm" Evolutionary Computation (CEC), 2010 IEEE Congress on Page(s):1 - 7, Barcelona, 18-23 July 2010
Ümit Uşnar
Anomaly Detection
Kazimierz Choroś "Real anomaly detection in telecommunication multidimensional data using data mining techniques" ICCCI'10 Proceedings of the Second international conference on Computational collective intelligence: technologies and applications - Volume PartI Pages 11-19
Emre Kurt
Dec 17, 2014 Opinion Mining
Cambria, E., Schuller, B., Xia, Y., & Havasi, C. (2013). New avenues in opinion mining and sentiment analysis. IEEE Intelligent Systems, 1.
Yasin Eşref
Opinion Mining
Calais Guerra, Pedro Henrique, et al. "From bias to opinion: a transfer-learning approach to real-time sentiment analysis." Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2011.
Ahmet Çağrı Şimşek
Privacy Preserving Data Mining
Aggarwal, Charu C and Philip, S Yu, "A general survey of privacy-preserving data mining models and algorithms", 2008
Ahmet Ömercioğlu
Privacy Preserving Data Mining
Rakesh Agraval, Ramakrishman Srikant, Privacy Preserving Data Mining, IBM Almaden Research Center , Lecture Notes in Computer Science Volume 2992, pp 183-199, 2004
Kevser Karakurt
Dec 24, 2014 Advertising On the Web
Ahmet Yeniçağ
Other Data Mining Applications (Tweet Mining)
Chenliang Li, Aixin Sun, and Anwitaman Datta. 2012. Twevent: segment-based event detection from tweets. In Proceedings of the 21st ACM international conference on Information and knowledge management (CIKM '12). ACM, New York, NY, USA, 155-164.
Hayrettin Erdem
Other Data Mining Applications (Document Mining)
Ian H.Witten, Gordon W.Paytner, Eibe Frank, Carl Gutwin, Craig G.Nevill-Manning, “KEA: practical automatic keyphrase extraction”, The Fourth ACM Conference, pp. 254-255, 1999.
Sinan Polat
Other Data Mining Applications (Image Mining)
Gunhee Kim, Christos Faloutsos, and Martial Hebert. Unsupervised Modeling of Object Categories Using Link Analysis Techniques. Conference on Computer Vision and Pattern Recognition (CVPR 2008). Anchorage, USA, June 24-26, 2008.
Berkan Demirel
Other Data Mining Applications

List of example papers to present

Here is some example candidate papers to present.
  • MapReduce
    • J. Dean and S. Ghemawat, “Mapreduce: simplified data processing on large clusters,” Comm. ACM 51:1, pp. 107–113, 2008.
  • Locality Sensitive Hashing
    • A. Andoni and P. Indyk, “Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions,” Comm. ACM 51:1, pp. 117–122, 2008.
  • Link Analysis
    • Z. Gyongi, H. Garcia-Molina, and J. Pedersen, “Combating link spam with trustrank,” Proc. 30th Intl. Conf. on Very Large Databases, pp. 576–587, 2004.
  • Advertising on the Web
    • Mehta, A.; Saberi, A.; Vazirani, U.; Vazirani, V., "AdWords and generalized on-line matching," Foundations of Computer Science, 2005. FOCS 2005. 46th Annual IEEE Symposium on , vol., no., pp.264,273, 23-25 Oct. 2005