[ICLR'17] DEEPCODER: LEARNING TO WRITE PROGRAMS

最新推荐文章于 2024-09-10 07:54:37 发布

weixin_34129145

最新推荐文章于 2024-09-10 07:54:37 发布

阅读量239

点赞数

文章标签： sketch python

DEEPCODER: LEARNING TO WRITE PROGRAMS

Basic Information

Authors: Matej Balog, Alexander L. Gaunt, Marc Brockschmidt, Sebastian Nowozin, Daniel Tarlow
Publication: ICLR'17
Description: Generate code based on input-output examples via neural network techniques

INDUCTIVE PROGRAM SYNTHESIS (IPS)

The Inductive Program Synthesis (IPS) problem is the following: given input-output examples, produce a program that has behavior consistent with the examples.

Building an IPS system requires solving two problems:

Search problem: to find consistent programs we need to search over a suitable set of possible programs. We need to define the set
(i.e., the program space) and search procedure.
Ranking problem: if there are multiple programs consistent with the input-output examples, which one do we return?

Domain Specific Languages (DSLs)

DSLs are programming languages that are suitable for a
specialized domain but are more restrictive than full-featured programming languages.
Restricted DSLs can also enable more efficient special-purpose search algorithms.
The choice of DSL also affects the difficulty of the ranking problem.

Search Techniques

Technique for searching for programs consistent with input-output examples.

Special-purpose algorithm
Satisfiability Modulo Theories (SMT) solving

Ranking

LEARNING INDUCTIVE PROGRAM SYNTHESIS (LIPS)

The components of LIPS are:

a DSL specification,

An attribute function A that maps programs P of the DSL to finite attribute vectors a = A(P). (Attribute vectors of different programs need not have equal length.) Attributes serve as the link between the machine learning and the search component of LIPS: the machine learning model predicts a distribution q(a | E), where E is the set of input-output examples, and the search procedure aims to search over programs P as ordered by q(A(P) | E). Thus an attribute is useful if it is both predictable from input-output examples, and if conditioning on its value significantly reduces the effective size of the search space.
Possible attributes are the (perhaps position-dependent) presence or absence of high-level functions (e.g., does the program contain or end in a call to SORT). Other possible attributes include control
flow templates (e.g., the number of loops and conditionals).
a data-generation procedure,
Generate a dataset ((P⁽ⁿ⁾, a⁽ⁿ⁾, E⁽ⁿ⁾))^N_n=1 of programs P⁽ⁿ⁾ in the chosen DSL, their attributes a⁽ⁿ⁾, and accompanying input-output examples E⁽ⁿ⁾).
a machine learning model that maps from input-output examples to program attributes,
Learn a distribution of attributes given input-output examples, q(a | E).
a search procedure that searches program space in an order guided by the model from (3).

Interface with an existing solver, using the predicted q(a | E) to guide the search.

DEEPCODER: Instantiation of LIPS

DSL AND ATTRIBUTES
A program in our DSL is a sequence of function calls, where the result of each call initializes a fresh variable that is either a
singleton integer or an integer array. Functions can be applied to any of the inputs or previously computed (intermediate) variables. The output of the program is the return value of the last function
call, i.e., the last variable. See Fig. 1 for an example program of length T = 4 in our DSL.
Overall, our DSL contains the first-order functions HEAD, LAST, TAKE, DROP, ACCESS, MINIMUM, MAXIMUM, REVERSE, SORT, SUM, and the higher-order functions MAP, FILTER, COUNT, ZIPWITH, SCANL1.

Example

DATA GENERATION
MACHINE LEARNING MODEL
1. an encoder: a differentiable mapping from a set of M input-output examples generated by
  a single program to a latent real-valued vector, and
2. a decoder: a differentiable mapping from the latent vector representing a set of M inputoutput
  examples to predictions of the ground truth program’s attributes.

Probability