Selected Publications List of papers with Search Labs affiliation 2008 - Atish Das Sarma, Sreenivas Gollapudi, and Samuel Ieong. Bypass Rates: Reducing Query Abandonment using Negative Inferences. In KDD 2008, Las Vegas, Nevada.
- Ariel Fuxman, Panayiotis Tsaparas, Kannan Achan, Rakesh Agrawal. Using the Wisdom of the Crowds for Keyword Generation. In WWW 2008, Beijing, China.
- Atish Das Sarma, Sreenivas Gollapudi, and Rina Panigrahy. Estimating PageRank on Graph Streams. In PODS 2008, Vancouver, Canada (Best Paper Award).
- Sreenivas Gollapudi and Rina Panigrahy. The Power of Two Min-Hashes in Similarity Search among Hierarchical Data Objects. In PODS 2008, Vancouver, Canada.
2007 - Atish Das Sarma, Deeparnab Chakrabarty, and Sreenivas Gollapudi. Public Advertisement Broker Markets. In WINE ’07: 558-563.
- Rakesh Agrawal, Tyrone Grandison, Christopher Johnson, Jerry Kiernan: "Enabling 21st Century Healthcare IT Revolution", Communications of the ACM (CACM), 20(2), February 2007.
- Rakesh Agrawal, Alexandre Evfimievski, Jerry Kiernan, Raja Velu: "Auditing Disclosure by Relevance Ranking", 26th ACM SIGMOD Int'l Conference on Management of Data, Beijing, China, June 2007.
- Stelios Paparizos, Jignesh M. Patel and H. V. Jagadish. SIGOPT : Using schema to optimize XML query processing. In Proc. ICDE Conf., Istanbul, Turkey, Apr. 2007.
- Ntoulas, J. Cho. Pruning Policies for Two-Tiered Inverted Index with Correctness Guarantee. In Proceedings of the ACM International Information Retrieval (SIGIR) Conference, 2007, Amsterdam, Netherlands.
- D. Kukulenz, A. Ntoulas. Answering Bounded Continuous Search Queries in the World Wide Web. In Proceedings of the World Wide Web (WWW) Conference, 2007, Banff, Canada.
- Rakesh Agrawal. Humane Data Mining. World Wide Web (WWW) Conference, 2007, Banff, Canada (Invited Talk).
- P. Ipeirotis, A. Ntoulas, J. Cho, and L. Gravano. Modeling and Managing Content Changes in Text Databases. ACM Transactions on Database Systems (TODS), vol. 32, no. 3, September 2007.
- A. Gionis, H. Mannila, P. Tsaparas, Clustering Aggregation. ACM Transactions on Knowledge Discovery from Data (TKDD), 2007.
- A. Gionis, H. Mannila, T. Mielikäinen, P. Tsaparas. Assessing data mining results via swap randomization. ACM Transactions on Knowledge Discovery from Data (TKDD), 2007.
2006 - Rakesh Agrawal, Ralf Rantzau, Evimaria Terzi: "Context-Sensitive Ranking", 25th ACM SIGMOD Int'l Conf. On Management of Data, June 2006.
- Christopher Johnson, Rakesh Agrawal: "Intersections of Law and Technology in Balancing Privacy Rights with Free Information Flow", 4th IASTED International Conference on Law and Technology, Cambridge, MA, Oct. 2006
- Sreenivas Gollapudi and Rina Panigrahy. Exploiting Asymmetry for Hierarchical Topic Extraction. In CIKM '06, Arlington, VA, Nov. 2006.
- Sreenivas Gollapudi and Rina Panigrahy. A Dictionary for Approximate String Search and Longest Prefix Match. In CIKM '06, Arlington, VA. 2006.
. List of papers without Search Labs affiliation 2007 - Ariel Fuxman, Renée J. Miller: First-order query rewriting for inconsistent databases. Journal of Computer and System Sciences (JCSS) 73(4): 610-635 (2007)
2006 - Stelios Paparizos and H.V. Jagadish. The Importance of Algebra for XML Query Processing. In EDBT '06 Workshop on XML Data Management (DataX'06) (invited), Published in Springer-Verlag, Lecture Notes in Computer Science,Volume 4254 (2006) pp126-135.
- Sreenivas Gollapudi, Ravi Kumar, and D. Sivakumar. Programmable Clustering. In PODS '06, Chicago, IL, 2006.
- Periklis Andritsos, Ariel Fuxman, Renée J. Miller: Clean Answers over Dirty Databases: A Probabilistic Approach. ICDE 2006
- Ariel Fuxman, Mauricio A. Hernández, C. T. Howard Ho, Renée J. Miller, Paolo Papotti, Lucian Popa: Nested Mappings: Schema Mapping Reloaded. VLDB 2006: 67-78
- Ariel Fuxman, Phokion G. Kolaitis, Renée J. Miller, Wang Chiew Tan: Peer data exchange. ACM Transactions on Database Systems (TODS) 31(4): 1454-1498 (2006)
- Jennifer L. Beckmann, Alan Halverson, Rajasekar Krishnamurthy, and Jeffrey F. Naughton. Extending RDBMSs To Support Sparse Datasets Using An Interpreted Attribute Storage Format. To appear in Proceedings of the 22nd International Conference on Data Engineering, (ICDE), Atlanta, Georgia, April 2006.
- Alan Halverson, Jennifer L. Beckmann, Jeffrey F. Naughton, and David J. DeWitt. A Comparison of C-Store and Row-Store in a Common Framework. Technical Report #1570, Computer Sciences Department, University of Wisconsin-Madison. 2006.
- Ntoulas, M. Najork, M. Manasse, D. Fetterly. Detecting Spam Web Pages through Content Analysis. In Proceedings of the World Wide Web (WWW) Conference, 2006, Edinburgh, Scotland.
- S. Stamou, A. Ntoulas, V. Krikos, P. Kokosis, D. Christodoulakis. Classifying Web Data in Directory Structures. In Proceedings of the 8th Asia Pacific Web Conference (APWeb), January 16-18, Harbin, China, LNCS 3841, Springer-Verlag, pp. 238-249, 2006.
- P. Tsaparas, L. Marino-Ramirez, O. Bodenreider, E. V. Koonin, and I. K. Jordan Global similarity and local divergence in human and mouse gene co-expression networks , BMC Evolutionary Biology 2006, 6:70 .
- T. Mielikäinen, E. Terzi, P. Tsaparas, Aggregating Time Partitions, KDD, Philadelphia, 2006.
- Gionis, H. Mannila, T. Mielikäinen, P. Tsaparas, Assessing data mining results via swap randomization, KDD, Philadelphia, 2006. (Best paper award runner-up)
- E. Terzi, P. Tsaparas, Efficient algorithms for sequence segmentation, SIAM Conference on Data Mining (SDM), 2006.
- K. Kenthapadi and R. Panigrahy. Balanced Allocation on Graphs. In Proceedings of the ACM-SIAM Symposium on Discrete Algorithms (SODA), 2006.
- Dwork, K. Kenthapadi, F. McSherry, I. Mironov, and M. Naor. Our Data, Ourselves: Privacy via Distributed Noise Generation. In Proceedings of EUROCRYPT, 2006.
- S.U. Nabar, B. Marthi, K. Kenthapadi, N. Mishra, and R. Motwani. Towards Robustness in Query Auditing. In Proceedings of the International Conference on Very Large Data Bases (VLDB), 2006.
- G. Aggarwal, T. Feder, K. Kenthapadi, S. Khuller, R. Panigrahy, D. Thomas, and A. Zhu. Achieving Anonymity via Clustering. In Proceedings of the ACM Symposium on Principles of Database Systems (PODS), 2006.
2005 - Stelios Paparizos and H.V. Jagadish. Pattern tree algebras: sets or sequences? In Proc. VLDB Conf., Trondheim, Norway, Sep. 2005.
- Sreenivas Gollapudi and D. Sivakumar. Exploting Anarchy in Networks: A Game-Theoretic Approach to Combining Throughput and Fairness. In INFOCOM '05., Miami, FL, 2005.
- Ariel Fuxman, Renée J. Miller: First-Order Query Rewriting for Inconsistent Databases. ICDT 2005: 337-351
- Ariel Fuxman, Phokion G. Kolaitis, Renée J. Miller, Wang Chiew Tan: Peer data exchange. PODS 2005: 160-171
- Ariel Fuxman, Elham Fazli, Renée J. Miller: ConQuer: Efficient Management of Inconsistent Databases. SIGMOD Conference 2005: 155-166
- Ariel Fuxman, Diego Fuxman, Renée J. Miller: ConQuer: A System for Efficient Querying Over Inconsistent Databases. VLDB 2005: 1354-1357. Demonstration.
- P. Ipeirotis, A. Ntoulas, J. Cho, L. Gravano. Modeling and Managing Content Changes in Text Databases. (Best Paper Award) In Proceedings of the IEEE International Conference on Data Engineering (ICDE), 2005, Tokyo, Japan.
- Ntoulas, P. Zerfos, J. Cho. Downloading Textual Hidden Web Content through Keyword Queries. In Proceedings of the Joint Conference on Digital Libraries (JCDL), 2005, Denver, USA.
- Ntoulas, G. Chao, J. Cho. The Infocious Web Search Engine: Improving Web Searching through Linguistic Analysis. In Proceedings of the World Wide Web (WWW) Conference, 2005, Chiba, Japan.
- S. Stamou, V. Krikos, P. Kokosis, A. Ntoulas, D. Christodoulakis. Web Directory Construction using Lexical Chains. In Proceedings of the International Conference on Applications of Natural Language to Databases (NLDB), 2005.
- V. Krikos, S. Stamou, A. Ntoulas, P. Kokosis, D. Christodoulakis. DirectoryRank: Ordering Pages in Web Directories. In Proceedings of the 7th ACM International Workshop on Web Information and Data Management (WIDM05), Bremen, Germany, 2005.
- Borodin, J. S. Rosenthal, G. O. Roberts, P. Tsaparas, Link Analysis Ranking: Algorithms, Theory and Experimets , ACM Transactions on Internet Technologies (TOIT), Vol 5, No 1, February 2005.
- S. Papadimitriou, A. Gionis, P. Tsaparas, A. Väisänen, H. Mannila, C. Faloutsos, Parameter-Free Spatial Data Mining Using MDL, 5th International Conference on Data Mining (ICDM) 2005
- F. Afrati, G. Das, A. Gionis, H. Mannila, T. Mielikäinen, P. Tsaparas, Mining chains of relations, 5th International Conference on Data Mining (ICDM) 2005
- Gionis, A. Hinnenburg, S. Papadimitriou, P. Tsaparas. Dimension Induced Clustering. KDD, Chicago, 2005
- D. Donato, S. Millozzi, S. Leonardi, P. Tsaparas. Mining the Inner Structure of the Web Graph. WebDB workshop, Baltimore, 2005
- D. Donato, S. Leonardi, P. Tsaparas. Stability and Similarity of Link Analysis Ranking Algorithms. ICALP, Lisbon, Portugal, 2005.
- Gionis, H. Mannila, P.Tsaparas, Clustering Aggregation, ICDE, Japan, Tokyo, 2005
- K. Kenthapadi, N. Mishra, and K. Nissim. Simulatable Auditing. In Proceedings of the ACM Symposium on Principles of Database Systems (PODS), 2005.
- K. Kenthapadi and G.S. Manku. Decentralized Algorithms using both Local and Random Probes for P2P Load Balancing. In Proceedings of the ACM Symposium on Parallel Algorithms and Architectures (SPAA), 2005.
- G. Aggarwal, T. Feder, K. Kenthapadi, R. Motwani, R. Panigrahy, D. Thomas, and A. Zhu. Approximation Algorithms for k-Anonymity. Journal of Privacy Technology, 2005. Preliminary version: Anonymizing Tables. In Proceedings of the International Conference on Database Theory (ICDT), 2005.
- G. Aggarwal, M. Bawa, P. Ganesan, H. Garcia-Molina, K. Kenthapadi, et al. Two Can Keep a Secret: A Distributed Architecture for Secure Database Services. In Proceedings of the Conference on Innovative Data Systems Research (CIDR), 2005.
2004 - Stelios Paparizos, Yuqing Wu, Laks V.S. Lakshmanan and H.V. Jagadish. Tree Logical Classes for Efficient Evaluation of XQuery. In Proc. SIGMOD Conf., Paris, France, Jun. 2004.
- Sreenivas Gollapudi and D. Sivakumar. Framework and algorithms for trend analysis in massive temporal data sets. In CIKM '04, Arlington, VA, 2004.
- Sreenivas Gollapudi and D. Sivakumar. A mechanism for equitable bandwidth allocation under QoS and budget constraints. In IWQoS '04, Montreal, Canada, 2004.
- Sreenivas Gollapudi and D. Sivakumar. Data Stream Algorithms for Scalable Bandwidth Management. In ICC '04, Paris, France, 2004.
- Ariel Fuxman, Lin Liu, John Mylopoulos, Marco Roveri, Paolo Traverso: Specifying and analyzing early requirements in Tropos. Requir. Eng. 9(2): 132-150 (2004)
- Periklis Andritsos, Ariel Fuxman, Anastasios Kementsietsidis, Renée J. Miller, Yannis Velegrakis: Kanata: Adaptation and Evolution in Data Sharing Systems. SIGMOD Record 33(4): 32-37 (2004)
- Alan Halverson, Vanja Josifovski, Guy Lohman, Hamid Pirahesh and Mathias Moerschel. ROX: Relational Over XML. In Proceedings of the Thirtieth International Conference on Very Large Data Bases (VLDB), Toronto, Canada, September 2004.
- Ntoulas, J. Cho, C. Olston. What's New on the Web? The Evolution of the Web from a Search Engine Perspective. In Proceedings of the World Wide Web (WWW) Conference, 2004, New York, USA.
- P. Tsaparas, Using Non-Linear Dynamical Systems for Web Searching and Ranking , Principles of Database Systems (PODS), Paris, 2004
- P. Andritsos, R. J. Miller, P. Tsaparas, Information-Theoretic Tools for Mining Database Structure from Large Data Sets , SIGMOD, Paris, 2004
- P. Andritsos, P. Tsaparas, R. J. Miller, K. C. Sevcik. LIMBO: Scalable Clustering of Categorical Data , 9th International Conference on Extending DataBase Technology (EDBT), Heraklion, Greece, 2004.
- G. Aggarwal, M. Bawa, P. Ganesan, H. Garcia-Molina, K. Kenthapadi, et al. Enabling Privacy for the Paranoids. In Proceedings of the International Conference on Very Large Data Bases (VLDB), 2004.
- N. Gupta, P. Mittal, K. S. Patwardhan, S. Dutta Roy, S. Chaudhury, and S. Banerjee. On-line Predictive Appearance-based Tracking. IEEE International Conference on Image Processing (ICIP), Singapore 2004.
2003 - Zhimin Chen, H.V. Jagadish, Laks V.S. Lakshmanan and Stelios Paparizos. From Tree Patterns to Generalized Tree Patterns: On Efficient Evaluation of XQuery. In Proc. VLDB Conf., Berlin, Germany, Sep. 2003.
- Roger Barga, David Lomet, Stelios Paparizos, Haifeng Yu and Sirish Chandrasekaran. Persistent Applications via Automatic Recovery. In the 7th International Database Engineering and Application Symposium (IDEAS'03), Hong Kong, July 16-18, 2003
- Stelios Paparizos, Shurug Al-Khalifa, Adriane Chapman, H.V.Jagadish, Laks V.S. Lakshmanan, Andrew Nierman, Jignesh M. Patel, Divesh Srivastava, Nuwee Wiwatwattana, Y.Wu and C.Yu. TIMBER: A native system for querying XML. In Proc. SIGMOD Conf., San Diego, CA, Jun. 2003.
- Ariel Fuxman, Renée J. Miller: Towards Inconsistency Management in Data Integration Systems. IIWeb 2003: 143-148
- Ariel Fuxman, Lin Liu, Marco Pistore, Marco Roveri, John Mylopoulos: Specifying and Analyzing Early Requirements: Some Experimental Results. RE 2003: 105-
- Alan Halverson et al. Mixed Mode XML Query Processing. In Proceedings of the 29th International Conference on Very Large Data Bases (VLDB), Berlin, Germany, September 2003.
- P. Tsaparas, T. Palpanas, Y. Kotidis, N. Koudas, D. Srivastava. Ranked Join Indices, ICDE, Bangalore, India, March 2003.
2002 - H.V. Jagadish, Shurug Al-Khalifa, Adriane Chapman, Laks V.S. Lakshmanan, Andrew Nierman, Stelios Paparizos, Jignesh M. Patel, Divesh Srivastava, Nuwee Wiwatwattana, Y.Wu and C.Yu. TIMBER: A Native XML Database. The VLDB Journal, Volume 11 Issue 4 (2002) pp 274-291
- Stelios Paparizos, Shurug Al-Khalifa, H.V. Jagadish, Laks V.S. Lakshmanan, Andrew Nierman, Divesh Srivastava and Yuqing Wu. Grouping in XML. In EDBT '02 Workshop on XML Data Management (XMLDM'02), Published in Springer-Verlag, Lecture Notes in Computer Science, Volume 2490 (2002) pp128-147.
- Stelios Paparizos, Shurug Al-Khalifa, H.V.Jagadish, Andrew Nierman and Yuqing Wu. A Physical Algebra for XML. Technical Report, Univ. of Michigan, March 2002.
- Periklis Andritsos, Ronald Fagin, Ariel Fuxman, Laura M. Haas, Mauricio A. Hernández, C. T. Howard Ho, Anastasios Kementsietsidis, Renée J. Miller, Felix Naumann, Lucian Popa, Yannis Velegrakis, Charlotte Vilarem, Ling-Ling Yan: Schema Management. IEEE Data Eng. Bull. 25(3): 32-38 (2002)
- P. Raghavan, P. Tsaparas, Mining Significant Associations in Large Scale Text Corpora, International Conference on Data Mining (ICDM), Maebashi Japan, December 2002.
- J. Cho, A. Ntoulas. Effective Change Detection using Sampling. In Proceedings of the International Conference on Very Large Databases (VLDB), 2002, Hong Kong, China.
- S. Stamou, A. Ntoulas, M. Kyriakopoulou, D. Christodoulakis. Expanding EuroWordNet with Domain-Specific Terminology Using Common Lexical Resources: Vocabulary Completeness and Coverage Issues. In Proceedings of the 1st Global Wordnet Conference (GWC), January 21-25, Mysore, India, 2002.
- S. Stamou, A. Ntoulas, J. Hoppenbrouwers, M. Saiz-Noeda, D. Christodoulakis. EUROTERM: Extending the EuroWordNet with Domain-Specific Terminology Using an Expand Model Approach. In Proceedings of the 1st Global Wordnet Conference (GWC), January 21-25, Mysore, India, 2002.
- N. Gupta, P. Mittal, S. Dutta Roy, S. Chaudhury, S. Banerjee. CONDENSATION-based Predictive EigenTracking. IAPR-sponsored Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP), India 2002.
2001 - Aidong Zhang and Sreenivas Gollapudi. QoS Management in Educational Digital Library Environments. Multimedia Tools Appl. 10(2/3): 133-156 (2000).
- Ariel Fuxman, Paolo Giorgini, Manuel Kolp, John Mylopoulos: Information systems as social structures. FOIS 2001: 10-21
- Ariel Fuxman, John Mylopoulos, Marco Pistore, Paolo Traverso: Model Checking Early Requirements Specifications in Tropos. RE 2001: 174-181
- Michael J. Carey, Steve Kirsch, Mary Roth, Bert Van der Linden, Nicolas Adiba, Michael Blow, Daniela Florescu, David Li, Ivan Oprencak, Rajendra Panwar, Runping Qi, David Rieber, John C. Shafer, Brian Sterling, Tolga Urhan, Brian Vickery, Dan Wineman and Kuan Yee: The Propel Distributed Services Platform. VLDB 2001: 671-674
- Ntoulas, S. Stamou, M. Tzagarakis. Using a WWW Search Engine to Evaluate Normalization Performance for a Highly Inflectional Language. In Proceedings of the ACL/EACL Student Research Workshop, July 6-11, Toulouse, France, 2001.
- Borodin, J. S. Rosenthal, G. O. Roberts, P. Tsaparas, Finding Authorities and Hubs From Link Structures on the World Wide Web (extended version), 10th World Wide Web Conference, Hong Kong, 2001.
2000 - John Mylopoulos, Ariel Fuxman, Paolo Giorgini: From Entities and Relationships to Social Actors and Dependencies. ER 2000: 27-36
- John C. Shafer and Rakesh Agrawal: Continuous querying in database-centric Web applications. WWW 2000: 519-531 (2000)
- Ntoulas, S. Stamou, I. Tsakou, C. Tsalidis, M. Tzagarakis, A. Vagelatos. Use of a Morphosyntactic Lexicon for the Implementation of the Greek Wordnet. In Proceedings of the 2nd International Conference on Natural Language Processing (NLP), June 2-4, Greece, LNCS 1835, Springer-Verlag, pp. 49-56, 2000.
1999 - Erik Selberg. Towards Comprehensive Web Search. Ph. D. Thesis, University of Washington, June, 1999.
- Erik Selberg and Oren Etzioni. On the Instability of Web Search. In RIAO '00: Content-based Multimedia Access, Apr., 1999.
- Sreenivas Gollapudi and Aidong Zhang. Buffer Model and Management in Distributed MultimediPresentation Systems. Multimedia Syst. 6(3): 206-218 (1998).
1997 - Erik Selberg. The MetaCrawler Architecture for Resource Aggregation on the Web. IEEE Expert, Jan. / Feb. 1997, 12(1).
- John C. Shafer and Rakesh Agrawal: Parallel Algorithms for High-dimensional Similarity Joins for Data Mining Applications. VLDB 1997: 176-185
1996 - Sreenivas Gollapudi and Aidong Zhang. NetMedia: A Client-Server Distributed Multimedia Environment. In IW-MMDBMS 1996: 160-167.
- Rakesh Agrawal, Manish Mehta, John C. Shafer, Ramakrishnan Srikant, Andreas Arning and Toni Bollinger: The Quest Data Mining System. KDD 1996: 244-249
- John C. Shafer, Rakesh Agrawal and Manish Mehta: SPRINT: A Scalable Parallel Classifier for Data Mining. VLDB 1996: 544-555
- Rakesh Agrawal and John C. Shafer: Parallel Mining of Association Rules. IEEE Transactions on Knowledge and Data Engineering 8(6): 962-969 (1996)
- David J. DeWitt, Jeffrey F. Naughton, John C. Shafer and Shivakumar Venkataraman: Parallelising OODBMS Traversals: A Performance Evaluation. The VLDB Journal 5(1): 3-18 (1996)
1995 - Erik Selberg and Oren Etzioni. Multi-Service Search and Comparison using the MetaCrawler. In Proceedings of the 4th International World Wide Web Conference, Dec., 1995.
- Andrew Berman, Virgil Bourassa, and Erik Selberg. TRON: Process-Specific File Protection for the UNIX Operating System. In Proceedings of the 1995 Winter USENIX Conference, Jan., 1995.
1994 - David J. DeWitt, Jeffrey F. Naughton, John C. Shafer and Shivakumar Venkataraman: ParSets for Parallelizing OODBMS Traversals: Implementation and Performance. PDIS 1994: 111-120
| | |