Hierarchy-Transformers
/

HiT-MiniLM-L12-WordNetNoun

@@ -31,18 +31,17 @@ HiT-MiniLM-L12-WordNet is a HiT model trained on WordNet's subsumption (hypernym
 - **Developed by:** [Yuan He](https://www.yuanhe.wiki/), Zhangdie Yuan, Jiaoyan Chen, and Ian Horrocks
 - **Model type:** Hierarchy Transformer Encoder (HiT)
 - **License:** Apache license 2.0
-- **Hierarchy**: WordNet (Noun)
 - **Training Dataset**: Download `wordnet-mixed.zip` from [Datasets for HiTs on Zenodo](https://zenodo.org/doi/10.5281/zenodo.10511042)
 - **Pre-trained model:** [sentence-transformers/all-MiniLM-L12-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2)
-- **Training Objectives**: Jointly optimised on *hyperbolic clustering* and *hyperbolic centripetal* losses
 ### Model Versions
 | **Version** | **Model Revision** | **Note** |
 |------------|---------|----------|
-|v1.0 (Random Negatives)| `main` or `v1-random-negative`| The variant trained on random negatives, as detailed in the [paper](https://arxiv.org/abs/2401.11374).|
-|v1.0 (Hard Negatives)| `v1-hard-negative` | The variant trained on hard negatives, as detailed in the [paper](https://arxiv.org/abs/2401.11374). |
 ### Model Sources
@@ -106,7 +105,8 @@ parent_norms = model.manifold.dist0(parent_entity_embeddings)
 subsumption_scores = - (dists + centri_score_weight * (parent_norms - child_norms))
 ```
-Training and evaluation scripts are available at [GitHub](https://github.com/KRR-Oxford/HierarchyTransformers).
 Technical details are presented in the [paper](https://arxiv.org/abs/2401.11374).
@@ -125,7 +125,7 @@ HierarchyTransformer(
 Preprint on arxiv: https://arxiv.org/abs/2401.11374.
-*Yuan He, Zhangdie Yuan, Jiaoyan Chen, Ian Horrocks.* **Language Models as Hierarchy Encoders.** arXiv preprint arXiv:2401.11374 (2024).
 ```
 @article{he2024language,
@@ -139,4 +139,4 @@ Preprint on arxiv: https://arxiv.org/abs/2401.11374.
 ## Model Card Contact
-For any queries or feedback, please contact Yuan He (yuan.he@cs.ox.ac.uk).

 - **Developed by:** [Yuan He](https://www.yuanhe.wiki/), Zhangdie Yuan, Jiaoyan Chen, and Ian Horrocks
 - **Model type:** Hierarchy Transformer Encoder (HiT)
 - **License:** Apache license 2.0
+- **Hierarchy**: WordNet's subsumption (hypernym) hierarchy of noun entities.
 - **Training Dataset**: Download `wordnet-mixed.zip` from [Datasets for HiTs on Zenodo](https://zenodo.org/doi/10.5281/zenodo.10511042)
 - **Pre-trained model:** [sentence-transformers/all-MiniLM-L12-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2)
+- **Training Objectives**: Jointly optimised on *Hyperbolic Clustering* and *Hyperbolic Centripetal* losses (see definitions in the [paper](https://arxiv.org/abs/2401.11374))
 ### Model Versions
 | **Version** | **Model Revision** | **Note** |
 |------------|---------|----------|
+|v1.0 (Random Negatives)| `main` or `v1-random-negatives`| The variant trained on random negatives, as detailed in the [paper](https://arxiv.org/abs/2401.11374).|
+|v1.0 (Hard Negatives)| `v1-hard-negatives` | The variant trained on hard negatives, as detailed in the [paper](https://arxiv.org/abs/2401.11374). |
 ### Model Sources
 subsumption_scores = - (dists + centri_score_weight * (parent_norms - child_norms))
 ```
+Training and evaluation scripts are available at [GitHub](https://github.com/KRR-Oxford/HierarchyTransformers/tree/main/scripts). See `scripts/evaluate.py` for how we determine the hyperparameters on the validation set for subsumption prediction.
 Technical details are presented in the [paper](https://arxiv.org/abs/2401.11374).
 Preprint on arxiv: https://arxiv.org/abs/2401.11374.
+*Yuan He, Zhangdie Yuan, Jiaoyan Chen, Ian Horrocks.* **Language Models as Hierarchy Encoders.** To Appear at NeurIPS 2024.
 ```
 @article{he2024language,
 ## Model Card Contact
+For any queries or feedback, please contact Yuan He (`yuan.he(at)cs.ox.ac.uk`).