Tom Aarsen commited on
Commit
bcbd909
·
1 Parent(s): 56ccabc

Update README model architecture

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -377,8 +377,8 @@ This is a [Asymmetric Inference-free SPLADE Sparse Encoder](https://www.sbert.ne
377
  ```
378
  SparseEncoder(
379
  (0): Router(
380
- (query_0_IDF): IDF ({'frozen': False}, dim:30522, tokenizer: DistilBertTokenizerFast)
381
- (document_0_MLMTransformer): MLMTransformer({'max_seq_length': 512, 'do_lower_case': False}) with MLMTransformer model: DistilBertForMaskedLM
382
  (document_1_SpladePooling): SpladePooling({'pooling_strategy': 'max', 'activation_function': 'relu', 'word_embedding_dimension': 30522})
383
  )
384
  )
@@ -591,7 +591,7 @@ You can finetune this model on your own dataset.
591
  - `fp16`: True
592
  - `batch_sampler`: no_duplicates
593
  - `router_mapping`: {'query': 'query', 'answer': 'document'}
594
- - `learning_rate_mapping`: {'IDF\\.weight': 0.001}
595
 
596
  #### All Hyperparameters
597
  <details><summary>Click to expand</summary>
@@ -710,7 +710,7 @@ You can finetune this model on your own dataset.
710
  - `batch_sampler`: no_duplicates
711
  - `multi_dataset_batch_sampler`: proportional
712
  - `router_mapping`: {'query': 'query', 'answer': 'document'}
713
- - `learning_rate_mapping`: {'IDF\\.weight': 0.001}
714
 
715
  </details>
716
 
 
377
  ```
378
  SparseEncoder(
379
  (0): Router(
380
+ (query_0_SparseStaticEmbedding): SparseStaticEmbedding({'frozen': False}, dim:30522, tokenizer: DistilBertTokenizerFast)
381
+ (document_0_MLMTransformer): MLMTransformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'DistilBertForMaskedLM'})
382
  (document_1_SpladePooling): SpladePooling({'pooling_strategy': 'max', 'activation_function': 'relu', 'word_embedding_dimension': 30522})
383
  )
384
  )
 
591
  - `fp16`: True
592
  - `batch_sampler`: no_duplicates
593
  - `router_mapping`: {'query': 'query', 'answer': 'document'}
594
+ - `learning_rate_mapping`: {'SparseStaticEmbedding\\.weight': 0.001}
595
 
596
  #### All Hyperparameters
597
  <details><summary>Click to expand</summary>
 
710
  - `batch_sampler`: no_duplicates
711
  - `multi_dataset_batch_sampler`: proportional
712
  - `router_mapping`: {'query': 'query', 'answer': 'document'}
713
+ - `learning_rate_mapping`: {'SparseStaticEmbedding\\.weight': 0.001}
714
 
715
  </details>
716