| dc.contributor.advisor | Martin Rinard |  | 
| dc.contributor.author | Cambronero, Jose | en_US | 
| dc.contributor.author | Rinard, Martin | en_US | 
| dc.contributor.other | Program Analysis and Compilation | en | 
| dc.date.accessioned | 2017-12-22T21:45:58Z |  | 
| dc.date.available | 2017-12-22T21:45:58Z |  | 
| dc.date.issued | 2017-12-21 |  | 
| dc.identifier.uri | http://hdl.handle.net/1721.1/112949 |  | 
| dc.description.abstract | We present CrowdLearn, a new system that processes an existing corpus of crowdsourced machine learning programs to learn how to generate effective pipelines for solving supervised machine learning problems. CrowdLearn uses a probabilistic model of program likelihood, conditioned on the current sequence of pipeline components and on the characteristics of the input data to the next component in the pipeline, to predict candidate pipelines. Our results highlight the effectiveness of this technique in leveraging existing crowdsourced programs to generate pipelines that work well on a range of supervised learning problems. | en_US | 
| dc.format.extent | 14 p. | en_US | 
| dc.relation.ispartofseries | MIT-CSAIL-TR-2017-015 |  | 
| dc.rights | Creative Commons Attribution 4.0 International | en | 
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ |  | 
| dc.subject | program synthesis | en_US | 
| dc.subject | automated machine learning | en_US | 
| dc.subject | code mining | en_US | 
| dc.title | Generating Component-based Supervised Learning Programs From Crowdsourced Examples | en_US | 
| dc.date.updated | 2017-12-22T21:45:58Z |  |