A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
-
Updated
Mar 6, 2023
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
PyTorch's implementation of the code2seq model.
A Tool for Mining Rich Abstract Syntax Trees from Code
Type4Py: Deep Similarity Learning-Based Type Inference for Python
Code for "PPOCoder: Execution-based Neural Code Generation using Proximal Policy Optimization"
Set of PyTorch modules for developing and evaluating different algorithms for embedding trees.
CD4Py: Code De-Duplication for Python
[ICLR 2021] "Generating Adversarial Computer Programs using Optimized Obfuscations" by Shashank Srikant, Sijia Liu, Tamara Mitrovska, Shiyu Chang, Quanfu Fan, Gaoyuan Zhang, and Una-May O'Reilly
Machine Learning for Source Code Analysis
VSCode Extension of Type4Py
Pre-training and fine-tuning GNN model on source code
This paper explores the idea of using heterogeneous graph neural networks (Het-GNN) to partition old legacy monoliths into candidate microservices. We additionally take membership constraints that come from a subject matter expert who has deep domain knowledge of the application.
Code for TeCo: an ML+Execution model for test completion
Add a description, image, and links to the ml4se topic page so that developers can more easily learn about it.
To associate your repository with the ml4se topic, visit your repo's landing page and select "manage topics."