Benjamin-eecs
/

Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy

Feature Extraction

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Benjamin-eecs commited on Nov 24, 2024

Commit

abe5a38

·

verified ·

1 Parent(s): f0daef2

docs: update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -3,6 +3,8 @@ library_name: transformers
 license: llama3.1
 base_model:
 - meta-llama/Llama-3.1-8B-Instruct
 ---
 # Model Card for Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy
@@ -62,4 +64,4 @@ Training data consists of state-action pairs collected through NLRL actor-critic
 ```
 ## Model Card Contact
-[email protected]

 license: llama3.1
 base_model:
 - meta-llama/Llama-3.1-8B-Instruct
+tags:
+- nlrl
 ---
 # Model Card for Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy
 ```
 ## Model Card Contact
+[email protected]