Skip to content

oracle-samples/ml4code

Repository files navigation

Machine Learning for Source Code Knowledge Base

With the advancement in Natural Language Processing and Large Multimodal Models in general, a new field adopting the findings to source code and software development has been emerging. This knowledge base collects related papers, models, products, blogs of individuals, events, courses, tutorials, videos and many other artifacts on a best effort basis.

The aim is to generate a better understanding for everyone on how organizations can improve their software development lifecycle.

ai4code, ml4code, dl4code, bigcode, deepcode

Documentation

The subfolders contains markdown files with the knowledge-base content.

  • 01_papers: Collection and summaries of research papers.
  • 02_downstream: Collection of use cases
  • 03_dataset: Overview on publicly available source code datasets
  • 04_evaluation: Overivew on metrics and benchmarks
  • 05_models: Overview on models
  • 06_libraries: Overiew on ML libraries
  • 07_articles: Collection of web articles related to ai for source code.
  • 08_tooling: Overview of available tools and companies within the ai for code domain.
  • 09_resources: Machine Learning & Software Engineering references
  • 10_about: Maintenance

Contributing

This project welcomes contributions from the community. Before submitting a pull request, please review our contribution guide

Security

Please consult the security guide for our responsible security vulnerability disclosure process

License

Copyright (c) 2025 Oracle and/or its affiliates.

Released under the Universal Permissive License v1.0 as shown at https://oss.oracle.com/licenses/upl/.

About

AI 4 Code Knowledge Base

Resources

License

Security policy

Stars

Watchers

Forks