File size: 723 Bytes
5c251e3
 
 
 
 
 
 
8eb9818
 
 
 
 
 
 
ed4bb7e
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
---
language:
- he 
tags:
- language model
---

Checkpoint of the alephbertgimmel-base-512 from https://github.com/Dicta-Israel-Center-for-Text-Analysis/alephbertgimmel
(for testing purpose, please use original checkpoints of the authors of this model)

AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.

When using AlephBertGimmel, please reference:

```

Eylon Guetta, Avi Shmidman, Shaltiel Shmidman, Cheyn Shmuel Shmidman, Joshua Guedalia, Moshe Koppel, Dan Bareket, Amit Seker and Reut Tsarfaty, "Large Pre-Trained Models with Extra-Large Vocabularies: A Contrastive Analysis of Hebrew BERT Models and a New One to Outperform Them All", Nov 2022 [http://arxiv.org/abs/2211.15199]

```