Fingerpointing | Priya Narasimhan
PUBLICATIONS

Draco: Statistical Diagnosis of Chronic Problems in Large Distributed Systems. Soila P. Kavulya, Kaustubh Joshi (AT&T), Matti Hiltunen (AT&T), Scott Daniels (AT&T), Rajeev Gandhi and Priya Narasimhan. To appear in IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), Boston, MA, June 2012.

Practical Experiences with Chronics Discovery in Large Telecommunications Systems. Soila P. Kavulya, Kaustubh Joshi (AT&T), Matti Hiltunen (AT&T), Scott Daniels (AT&T), Rajeev Gandhi and Priya Narasimhan (CMU). Best Papers from SLAML 2011 in Operating Systems Review, Volume 45, Number 3, December 2011.

Understanding and improving the Diagnostic Workflow of MapReduce Users. Jason D. Campbell (Intel), Arun B. Ganesan, Ben Gotow, Soila P. Kavulya, James Mulholland, Priya Narasimhan, Sriram Ramasubramanian, Mark Shuster, Jiaqi Tan (DSO National Laboratories, Singapore), ACM Symposium on Computer Human Interaction for Management of Information Technology (CHIMIT), Boston, MA, December 2011.

Behavior-Based Problem Localization for Parallel File Systems. Michael P. Kasick, Rajeev Gandhi and Priya Narasimhan, USENIX Workshop on Hot Topics in Dependability (HotDep), Vancouver, BC (October 2010).

ASDF: An Automated, Online Framework for Diagnosing Performance Problems K. Bare, S. Kavulya, J. Tan, X. Pan, E. Marinelli, M. Kasick, R.Gandhi, P. Narasimhan. Architecting Dependable Systems, in Lecture Notes in Computer Science, Volume 6420/2010, No. 7, Pages 201-226, 2010.

An Analysis of Traces from a Production MapReduce Cluster. Soila Kavulya, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. IEEE/ACM International Conference on Cluster, Cloud and Grid Computing (CCGrid), Melbourne, Australia (May 2010).

Visual, Log-based Causal Tracing for Performance Debugging of MapReduce Systems. Jiaqi Tan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. IEEE International Conference on Distributed Computing Systems (ICDCS), Genoa, Italy (June 2010).

Hardware Performance Counter-Based Problem Diagnosis for e-Commerce Systems. Keith A. Bare, Soila Kavulya and Priya Narasimhan. IEEE/IFIP Network Operations and Management Symposium (NOMS), Osaka, Japan (April 2010).

Kahuna: Problem Diagnosis for MapReduce-Based Cloud Computing Environments. Jiaqi Tan, Xinghao Pan, Soila Kavulya, Rajeev Gandhi and Priya Narasimhan, IEEE/IFIP Network Operations and Management Symposium (NOMS), Osaka, Japan (April 2010).

Black-Box Diagnosis in Parallel File Systems. Michael P. Kasick, Jiaqi Tan, Rajeev Gandhi and Priya Narasimhan. USENIX Conference on File and Storage Technologies (FAST), San Jose, CA (Feb 2010).

Blind Men and the Elephant (BLIMEy): Piecing together Hadoop for Diagnosis. Xinghao Pan, Jiaqi Tan, Soila Kavulya, Rajeev Gandhi and Priya Narasimhan. International Symposium on Software Reliability Engineering (ISSRE), Mysore, India (Dec 2009).

Ganesha: Black-Box Fault Diagnosis for MapReduce Systems. Xinghao Pan, Jiaqi Tan, Soila Kavulya, Rajeev Gandhi and Priya Narasimhan. Workshop on Hot Topics in Measurement and Modeling of Computer Systems (HotMetrics 2009), Seattle, WA (June 2009).

Mochi: Visual Log-Analysis Based Tools for Debugging Hadoop. Jiaqi Tan, Xinghao Pan, Soila Kavulya, Rajeev Gandhi and Priya Narasimhan. USENIX Workshop on Hot Topics in Cloud Computing (HotCloud), San Diego, CA (June 2009).

System Call-Based Problem Diagnosis for PVFS. Michael P. Kasick, Keith A. Bare, Eugene E. Marinelli III, Jiaqi Tan, Rajeev Gandhi and Priya Narasimhan. USENIX Workshop on Hot Topics in Dependability (HotDep), Estoril Portugal (June 2009).

SALSA: Analyzing Logs as StAte Machines. Jiaqi Tan, Xinghao Pan, Soila Kavulya, Rajeev Gandhi and Priya Narasimhan. USENIX Workshop on Tackling Computer Systems Problems with Machine Learning Techniques (SysML), San Diego, CA (December 2008).

Gumshoe: Diagnosing Performance Problems in Replicated File-Systems. Soila Pertet, Rajeev Gandhi and Priya Narasimhan. IEEE Symposium on Reliable Distributed Systems (SRDS), Naples, Italy (October 2008).

Fingerpointing Correlated Failures in Replicated Systems. Soila Pertet, Rajeev Gandhi and Priya Narasimhan. USENIX Workshop on Tackling Computer Systems Problems with Machine Learning Techniques (SysML), Cambridge, MA (April 2007).

Towards Fingerpointing in the Emulab Dynamic Distributed System. Michael P. Kasick, Priya Narasimhan, Kevin Atkinson, Jay Lepreau. USENIX Workshop on Real, Large Distributed Systems (WORLDS), Seattle, WA (November 2006).




THESES

Hyrax: Cloud Computing on Mobile Devices Using MapReduce. School of Computer Science Master's Thesis, Carnegie Mellon University, September 2009.

CPU Performance Counter-Based Diagnosis for Software Systems. School of Computer Science Master's Thesis, Carnegie Mellon University, September 2009.

Log-Based Approaches to Characterizing and Diagnosing MapReduce Systems. School of Computer Science Master's Thesis, Carnegie Mellon University, July 2009.

Diagnosing Performance Problems in Parallel File Systems. Electrical & Computer Engineering Department Master's Thesis, Carnegie Mellon University, May 2009.

The Blind Men and the Elephant: Piecing Together Hadoop for Diagnosis. School of Computer Science Master's Thesis, Carnegie Mellon University, May 2009.

RAMS and BlackSheep: Inferring White-box Application Behavior Using Black-box Techniques. Jiaqi Tan and Priya Narasimhan. School of Computer Science Senior Honors Thesis and Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-103, May 2008.




TECHNICAL REPORTS

ASDF: Automated, Online Fingerpointing for Hadoop. Keith Bare, Michael P. Kasick, Soila Kavulya, Eugene Marinelli, Xinghao Pan, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-104, May 2008.

Group Communication: Helping or Obscuring Failure Diagnosis? Soila Pertet, Rajeev Gandhi and Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06 -107, June, 2006.

Causes of Failure in Web Applications. Soila Pertet and Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-109. December 2005.

Last updated: March 2011, Priya Narasimhan