Wednesday, July 20, 2011

A Tutorial on eXtreme Software Analytics at ASE 2011

Later register the tutorial by

Dongmei Zhang (Microsoft Research Asia) and Tao Xie (North Carolina State University).
xSA: eXtreme Software Analytics - Marriage of eXtreme Computing and Software Analytics.
A tutorial at the 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011), Tutorial, Lawrence, Kansas, November 2011.

2 full papers + 1 short paper at ASE 2011

Kai Pan, Xintao Wu, and Tao Xie.
Generating Program Inputs for Database Application Testing.
In Proceedings of the 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011), Lawrence, Kansas, November 2011.
Download: [BibTeX]

Qian Wu, Guangtai Liang, Qianxiang Wang, Tao Xie, and Hong Mei.
Iterative Mining of Resource-Releasing Specifications.
In Proceedings of the 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011), Lawrence, Kansas, November 2011.
Download: [BibTeX]

Wujie Zheng, Hao Ma, Michael R. Lyu, Tao Xie, and Irwin King.
Mining Test Oracles of Web Search Engines.
In Proceedings of the 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011), Short Paper, Lawrence, Kansas, November 2011.
Download: [BibTeX]

Congratulations to Suresh Thummalapenta for his OOPSLA 2011 paper

Suresh Thummalapenta, Tao Xie, Nikolai Tillmann, Jonathan de Halleux, and Zhendong Su.
Synthesizing Method Sequences for High-Coverage Testing.
In Proceedings of ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA 2011), Portland, Oregon, October 2011.
Download: [BibTeX]

Congratulations to Kunal Taneja for his ESEC/FSE 2011 paper

Kunal Taneja, Mark Grechanik, Rayid Ghani, and Tao Xie.
Testing Software In Age Of Data Privacy: A Balancing Act.
In Proceedings of the 8th joint meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering (ESEC/FSE 2011), Szeged, Hungary, September 2011.
Download: [PDF][BibTeX]

A Technical Briefing on Text Analytics for Software Engineering at ESEC/FSE 2011

Register now for Technical Briefings at ESEC/FSE 2011, including

Lin Tan and Tao Xie. Text Analytics for Software Engineering: Applications of Natural Language Processing. A Technical Briefing at the 8th joint meeting of the http://www.blogger.com/img/blank.gifEuropeanhttp://www.blogger.com/img/blank.gif Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering (ESEC/FSE 2011), Szeged, Hungary, September 2011.

Abstract

Software engineering data contains a rich amount of natural language text: requirements documents, code comments, identifier names, commit logs, release notes, mailing list discussions, etc. The natural language text is essential in the software engineering process to help software engineers and researchers better understand and maintain software. Given the overwhelming amount of available natural language text, there is a high demand of text analytics including natural language processing (NLP) and text mining techniques to automatically analyze the natural language text to improve software quality and productivity. The history of applying NLP and text mining techniques to analyze software engineering data can date back to about a decade ago. In recent five years, text analytics for software engineering has become an emerging topic in the software engineering area. Various recent studies showed that automated analysis of natural language text can improve software reliability, programming productivity, software maintenance, and software quality in general.

This technical briefing (1) provides a quick overview of major text mining techniques as well as NLP techniques (e.g., Part-Of-Speech tagging, chunking, semantic labeling, semantic pattern matching, and negative-expression identification), machine learning techniques (e.g., clustering and decision-tree-based classification), and data mining techniques (e.g., frequent itemset mining); (2) introduces popular text analysis tools (e.g., WordNet and Weka); (3) summarizes major research work done in the area of text analytics for software engineering; and (4) outlines future research directions and highlights research challenges. More information on the technical briefing could be found at https://sites.google.com/site/text4se/.

The ESEC/FSE program includes a complementary technical briefing on “Management of Unstructured Information during Software Evolution: Applications of Text Retrieval”, by Andrian Marcus. We recommend attending both of them.