The proliferation of open-source projects has led to large amounts of source code and related artifacts: arguably, the rich and open resources associated with software--including open source repositories, Q/A sites, change histories, and communications between developers--are the richest and most detailed information resource for any technical area. Recently it has been discovered that “natural”, human-produced software has many interesting statistical regularities. As a consequence code corpora, just like natural language corpora, are amenable to statistical modeling, and a number of software tasks such as coding, testing, porting, bug-patching etc are potentially enhanced by the use of these statistical models.

This interdisciplinary workshop will explore issues related to the statistical modeling of software corpora, including topics such as: modeling repetitiveness in source code; use of language models for the code suggestion in IDEs; using probabilistic grammars to mine programming idioms; statistical methods for type inference in a dynamically typed languages; statistical machine translation for porting applications between programming languages, or “mini-fying”Javascript; using statistical language models to find bugs; or statistical methods for automatic code patching, code summarization, code retrieval, code annotation, or test generation.

The workshop follows several earlier workshops on this topic at Microsoft Research, Dagstuhl event, and SIGSOFT FSE.

We are delighted that the workshop will feature two invited speakers: Graham Neubig , of Carnegie-Mellon University, and Danny Tarlow , of Google Brain.


We gratefully acknowledge funding, from NSF, to support a limited number of US travellers to the workshop, especially students and members of under-represented groups, and researchers that might not normally attend AAAI.

Call for participation

We invite short position papers, of at most 4 pages in length. Submissions will be reviewed primarily for relevance, will not appear in ACM Digital Library, and may be published subsequently elsewhere. A few of the submissions will be invited for presentation.


Please submit your paper here:
Submissions should be AAAI Submission format.


Important dates

October 16, 2017
Workshop Submissions Due (AOE time)
November 9, 2017
Notifications Sent to Authors
November 21, 2017
Final Workshop Papers Due at AAAI

Program Committee

Program Chairs
Prem DevanbuUniversity of California, Davis
William CohenCarnegie-Mellon University
Program Committee
Earl BarrUniversity College, London
Jacob DevlinGoogle
Doug DowneyNorthwestern University
Aditya KanadeIndian Institute of Science
Ray MooneyUT Austin
Graham NeubigCarnegie-Mellon University
Tien NguyenUT Dallas
Charles SuttonUniversity of Edinburgh
Bogdan VasilescuCarnegie-Mellon University
Martin VechevETH, Zurich