| Model | 186 |L / H / A* | 187 |Batch size | 188 |Update steps | 189 |Corpus size | 190 |Fairseq | 191 |Transformers | 192 | 193 |
|---|---|---|---|---|---|---|
| RoBERTa (base) | 195 |12 / 768 / 12 | 196 |8k | 197 |125k | 198 |~20GB | 199 |200 | v0.9.0 201 | | 202 |203 | v3.4 204 | | 205 |
| RoBERTa‑v2 (base) | 208 |12 / 768 / 12 | 209 |8k | 210 |400k | 211 |~20GB | 212 |213 | v0.10.1 214 | | 215 |216 | v4.4 217 | | 218 |
| RoBERTa (large) | 221 |24 / 1024 / 16 | 222 |30k | 223 |50k | 224 |~135GB | 225 |226 | v0.9.0 227 | | 228 |229 | v3.4 230 | | 231 |
| RoBERTa‑v2 (large) | 234 |24 / 1024 / 16 | 235 |2k | 236 |400k | 237 |~200GB | 238 |239 | v0.10.2 240 | | 241 |242 | v4.14 243 | | 244 |
| DistilRoBERTa | 248 |6 / 768 / 12 | 249 |1k | 250 |10ep. | 251 |~20GB | 252 |253 | n/a 254 | | 255 |256 | v4.13 257 | | 258 |
| Student model | 391 |Teacher model | 392 |Download | 393 | 394 |
|---|---|---|
| polish-roberta-base-v2 | 396 |paraphrase-distilroberta-base-v2 | 397 |st-polish-paraphrase-from-distilroberta | 398 |
| polish-roberta-base-v2 | 401 |paraphrase-mpnet-base-v2 | 402 |st-polish-paraphrase-from-mpnet | 403 |
| Base models | 427 |Stage 1: Distilled models | 428 |Stage 2: Retrieval models | 429 ||||
|---|---|---|---|---|---|
| Student model | 432 |Teacher model | 433 |PL-MTEB Score |
434 | Download | 435 |PIRB NDCG@10 |
436 | Download | 437 |
| Encoders based on Polish RoBERTa | 441 ||||||
| polish-roberta-base-v2 | 444 |bge-base-en | 445 |61.05 | 446 |mmlw-roberta-base | 447 |56.38 | 448 |mmlw-retrieval-roberta-base | 449 |
| polish-roberta-large-v2 | 452 |bge-large-en | 453 |63.23 | 454 |mmlw-roberta-large | 455 |58.46 | 456 |mmlw-retrieval-roberta-large | 457 |
| Encoders based on Multilingual E5 | 460 ||||||
| multilingual-e5-small | 463 |bge-small-en | 464 |55.84 | 465 |mmlw-e5-small | 466 |52.34 | 467 |mmlw-retrieval-e5-small | 468 |
| multilingual-e5-base | 471 |bge-base-en | 472 |59.71 | 473 |mmlw-e5-base | 474 |56.09 | 475 |mmlw-retrieval-e5-base | 476 |
| multilingual-e5-large | 479 |bge-large-en | 480 |61.17 | 481 |mmlw-e5-large | 482 |58.30 | 483 |mmlw-retrieval-e5-large | 484 |
| Model | 583 |Parameters | 584 |Training method | 585 |PIRB NDCG@10 |
586 |
587 |
|---|---|---|---|
| polish-reranker-base-ranknet | 589 |124M | 590 |RankNet | 591 |60.32 | 592 |
| polish-reranker-large-ranknet | 595 |435M | 596 |RankNet | 597 |62.65 | 598 |
| polish-reranker-base-mse | 601 |124M | 602 |MSE | 603 |57.50 | 604 |
| polish-reranker-large-mse | 607 |435M | 608 |MSE | 609 |60.27 | 610 |