bigcode
/

santacoderpack

Text Generation

Eval Results (legacy)

text-generation-inference

Model card Files Files and versions

Muennighoff commited on Aug 16, 2023

Commit

fb30a58

·

1 Parent(s): ea5895e

Update README.md

Files changed (1) hide show

README.md +11 -4

README.md CHANGED Viewed

@@ -98,15 +98,14 @@ model-index:
 # Model Summary
-SantaCoderPack is an pre-trained model with the same architecture of SantaCoder on
-<th><a href=https://huggingface.co/datasets/bigcode/commitpack>CommitPack</a> using this format:
 ```
 <commit_before>code_before<commit_msg>message<commit_after>code_after
 ```
 - **Repository:** [bigcode/octopack](https://github.com/bigcode-project/octopack)
-- **Paper:** [TODO]()
 - **Languages:** Python, JavaScript, Java, C++, Go, Rust
 - **SantaCoderPack:**
 <table>
@@ -137,6 +136,7 @@ The model follows instructions provided in the input. We recommend prefacing you
 **Feel free to share your generations in the Community tab!**
 ## Generation
 ```python
 # pip install -q transformers
 from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -171,4 +171,11 @@ print(tokenizer.decode(outputs[0]))
 # Citation
-TODO

 # Model Summary
+SantaCoderPack is an pre-trained model with the same architecture of SantaCoder on <th><a href=https://huggingface.co/datasets/bigcode/commitpack>CommitPack</a> using this format:
 ```
 <commit_before>code_before<commit_msg>message<commit_after>code_after
 ```
 - **Repository:** [bigcode/octopack](https://github.com/bigcode-project/octopack)
+- **Paper:** [OctoPack: Instruction Tuning Code Large Language Models](https://arxiv.org/abs/2308.07124)
 - **Languages:** Python, JavaScript, Java, C++, Go, Rust
 - **SantaCoderPack:**
 <table>
 **Feel free to share your generations in the Community tab!**
 ## Generation
 ```python
 # pip install -q transformers
 from transformers import AutoModelForCausalLM, AutoTokenizer
 # Citation
+```bibtex
+@article{muennighoff2023octopack,
+      title={OctoPack: Instruction Tuning Code Large Language Models},
+      author={Niklas Muennighoff and Qian Liu and Armel Zebaze and Qinkai Zheng and Binyuan Hui and Terry Yue Zhuo and Swayam Singh and Xiangru Tang and Leandro von Werra and Shayne Longpre},
+      journal={arXiv preprint arXiv:2308.07124},
+      year={2023}
+}
+```