language:-enlicense:apache-2.0tags:-sentence-transformers-sentence-similarity-feature-extraction-generated_from_trainer-dataset_size:6300-loss:MatryoshkaLoss-loss:MultipleNegativesRankingLossbase_model:BAAI/bge-base-enwidget:-source_sentence:>- Employee health, safety and wellness are top priorities at Hasbro. We support our colleagues’ well-being, which includes mental, physical and financial wellness, through a number of programs, including: robust employee assistance programs, childcare solutions, and a commitment to flexible work arrangements.sentences:->- What percentage of the total annual net trade sales did the sales returns reserve represent for the company during each of the fiscal years 2023, 2022, and 2021?-HowdoesHasbrosupportthewellnessofitsemployees?->- What was the conclusion of the Company's review regarding the impact of the American Rescue Plan, the Consolidated Appropriations Act, 2021, and related tax provisions on its business for the fiscal year ended June 30, 2023?-source_sentence:>- The Company has a minority market share in the global smartphone, personal computer and tablet markets. The Company faces substantial competition in these markets from companies that have significant technical, marketing, distribution and other resources, as well as established hardware, software and digital content supplier relationships. In addition, some of the Company’s competitors have broader product lines, lower-priced products and a larger installed base of active devices. Competition has been particularly intense as competitors have aggressively cut prices and lowered product margins.sentences:->- When did The Hershey Company declare the dividend that was paid on March 15, 2023?->- What factors contribute to the Company facing substantial competition in the markets for smartphones, personal computers, and tablets?-Howisgoodwillimpairmentanalyzed?-source_sentence:>- During fiscal 2022, there were cash payments of $6.7 billion for repurchases of common stock through open market purchases.sentences:->- What was the value of cash payments for common stock repurchases through open market purchases during fiscal 2022?->- How much did the Compute & Networking segment's gross margin decrease in fiscal year 2023?-WhatdifferentmethodsdoesAmazonusetoengageandretainemployees?-source_sentence:>- Walmart Luminate provides a suite of data products for merchants and suppliers.sentences:->- What pages do the Consolidated Financial Statements and their accompanying Notes and reports appear on in the document?->- What was the percentage change in NYSE total cash handled volume from 2022 to 2023?-WhatisthefunctionofWalmartLuminate?-source_sentence:>- Item 8. Financial Statements and Supplementary Data. The Consolidated Financial Statements, together with the Notes thereto and the report thereon dated February 16, 2024, of PricewaterhouseCoopers LLP, the Firm’s independent registered public accounting firm (PCAOB ID 238).sentences:-WhattypeofdatadoesItem8inafinancialdocumentcontain?->- How did the assumptions and estimates used for assessing the fair value of reporting units potentially impact the company's financial statements?->- What factors are considered when making estimates for financial statements?pipeline_tag:sentence-similaritylibrary_name:sentence-transformersmetrics:-cosine_accuracy@1-cosine_accuracy@3-cosine_accuracy@5-cosine_accuracy@10-cosine_precision@1-cosine_precision@3-cosine_precision@5-cosine_precision@10-cosine_recall@1-cosine_recall@3-cosine_recall@5-cosine_recall@10-cosine_ndcg@10-cosine_mrr@10-cosine_map@100model-index:-name:BGEbaseFinancialMatryoshkaresults:-task:type:information-retrievalname:InformationRetrievaldataset:name:dim768type:dim_768metrics:-type:cosine_accuracy@1value:0.20411392405063292name:CosineAccuracy@1-type:cosine_accuracy@3value:0.39082278481012656name:CosineAccuracy@3-type:cosine_accuracy@5value:0.45569620253164556name:CosineAccuracy@5-type:cosine_accuracy@10value:0.5427215189873418name:CosineAccuracy@10-type:cosine_precision@1value:0.20411392405063292name:CosinePrecision@1-type:cosine_precision@3value:0.1302742616033755name:CosinePrecision@3-type:cosine_precision@5value:0.0911392405063291name:CosinePrecision@5-type:cosine_precision@10value:0.054272151898734175name:CosinePrecision@10-type:cosine_recall@1value:0.20411392405063292name:CosineRecall@1-type:cosine_recall@3value:0.39082278481012656name:CosineRecall@3-type:cosine_recall@5value:0.45569620253164556name:CosineRecall@5-type:cosine_recall@10value:0.5427215189873418name:CosineRecall@10-type:cosine_ndcg@10value:0.3712962481916349name:CosineNdcg@10-type:cosine_mrr@10value:0.31667482921438606name:CosineMrr@10-type:cosine_map@100value:0.32569334518419213name:CosineMap@100-task:type:information-retrievalname:InformationRetrievaldataset:name:dim512type:dim_512metrics:-type:cosine_accuracy@1value:0.1787974683544304name:CosineAccuracy@1-type:cosine_accuracy@3value:0.38449367088607594name:CosineAccuracy@3-type:cosine_accuracy@5value:0.44936708860759494name:CosineAccuracy@5-type:cosine_accuracy@10value:0.5221518987341772name:CosineAccuracy@10-type:cosine_precision@1value:0.1787974683544304name:CosinePrecision@1-type:cosine_precision@3value:0.1281645569620253name:CosinePrecision@3-type:cosine_precision@5value:0.08987341772151898name:CosinePrecision@5-type:cosine_precision@10value:0.05221518987341772name:CosinePrecision@10-type:cosine_recall@1value:0.1787974683544304name:CosineRecall@1-type:cosine_recall@3value:0.38449367088607594name:CosineRecall@3-type:cosine_recall@5value:0.44936708860759494name:CosineRecall@5-type:cosine_recall@10value:0.5221518987341772name:CosineRecall@10-type:cosine_ndcg@10value:0.35214780800723905name:CosineNdcg@10-type:cosine_mrr@10value:0.2974972372915411name:CosineMrr@10-type:cosine_map@100value:0.30719274754259535name:CosineMap@100-task:type:information-retrievalname:InformationRetrievaldataset:name:dim256type:dim_256metrics:-type:cosine_accuracy@1value:0.17563291139240506name:CosineAccuracy@1-type:cosine_accuracy@3value:0.33860759493670883name:CosineAccuracy@3-type:cosine_accuracy@5value:0.3924050632911392name:CosineAccuracy@5-type:cosine_accuracy@10value:0.49683544303797467name:CosineAccuracy@10-type:cosine_precision@1value:0.17563291139240506name:CosinePrecision@1-type:cosine_precision@3value:0.11286919831223628name:CosinePrecision@3-type:cosine_precision@5value:0.07848101265822786name:CosinePrecision@5-type:cosine_precision@10value:0.04968354430379747name:CosinePrecision@10-type:cosine_recall@1value:0.17563291139240506name:CosineRecall@1-type:cosine_recall@3value:0.33860759493670883name:CosineRecall@3-type:cosine_recall@5value:0.3924050632911392name:CosineRecall@5-type:cosine_recall@10value:0.49683544303797467name:CosineRecall@10-type:cosine_ndcg@10value:0.32777016757909155name:CosineNdcg@10-type:cosine_mrr@10value:0.2748675155716295name:CosineMrr@10-type:cosine_map@100value:0.2839854758498125name:CosineMap@100-task:type:information-retrievalname:InformationRetrievaldataset:name:dim128type:dim_128metrics:-type:cosine_accuracy@1value:0.13449367088607594name:CosineAccuracy@1-type:cosine_accuracy@3value:0.27689873417721517name:CosineAccuracy@3-type:cosine_accuracy@5value:0.34335443037974683name:CosineAccuracy@5-type:cosine_accuracy@10value:0.40189873417721517name:CosineAccuracy@10-type:cosine_precision@1value:0.13449367088607594name:CosinePrecision@1-type:cosine_precision@3value:0.09229957805907173name:CosinePrecision@3-type:cosine_precision@5value:0.06867088607594937name:CosinePrecision@5-type:cosine_precision@10value:0.04018987341772152name:CosinePrecision@10-type:cosine_recall@1value:0.13449367088607594name:CosineRecall@1-type:cosine_recall@3value:0.27689873417721517name:CosineRecall@3-type:cosine_recall@5value:0.34335443037974683name:CosineRecall@5-type:cosine_recall@10value:0.40189873417721517name:CosineRecall@10-type:cosine_ndcg@10value:0.2642535058721437name:CosineNdcg@10-type:cosine_mrr@10value:0.2206462226240707name:CosineMrr@10-type:cosine_map@100value:0.2315340997045677name:CosineMap@100-task:type:information-retrievalname:InformationRetrievaldataset:name:dim64type:dim_64metrics:-type:cosine_accuracy@1value:0.08544303797468354name:CosineAccuracy@1-type:cosine_accuracy@3value:0.19462025316455697name:CosineAccuracy@3-type:cosine_accuracy@5value:0.24841772151898733name:CosineAccuracy@5-type:cosine_accuracy@10value:0.31645569620253167name:CosineAccuracy@10-type:cosine_precision@1value:0.08544303797468354name:CosinePrecision@1-type:cosine_precision@3value:0.06487341772151899name:CosinePrecision@3-type:cosine_precision@5value:0.04968354430379747name:CosinePrecision@5-type:cosine_precision@10value:0.031645569620253174name:CosinePrecision@10-type:cosine_recall@1value:0.08544303797468354name:CosineRecall@1-type:cosine_recall@3value:0.19462025316455697name:CosineRecall@3-type:cosine_recall@5value:0.24841772151898733name:CosineRecall@5-type:cosine_recall@10value:0.31645569620253167name:CosineRecall@10-type:cosine_ndcg@10value:0.19364593797751115name:CosineNdcg@10-type:cosine_mrr@10value:0.15531381856540089name:CosineMrr@10-type:cosine_map@100value:0.16408720453627956name:CosineMap@100
BGE base Financial Matryoshka
This is a sentence-transformers model finetuned from BAAI/bge-base-en on the json dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("RK-1235/bge-base-FIR-matryoshka-BASELINE-10epochs-FT")
# Run inference
sentences = [
'Item 8. Financial Statements and Supplementary Data. The Consolidated Financial Statements, together with the Notes thereto and the report thereon dated February 16, 2024, of PricewaterhouseCoopers LLP, the Firm’s independent registered public accounting firm (PCAOB ID 238).',
'What type of data does Item 8 in a financial document contain?',
"How did the assumptions and estimates used for assessing the fair value of reporting units potentially impact the company's financial statements?",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
Approximate statistics based on the first 1000 samples:
positive
anchor
type
string
string
details
min: 6 tokens
mean: 46.06 tokens
max: 371 tokens
min: 8 tokens
mean: 20.8 tokens
max: 51 tokens
Samples:
positive
anchor
As of December 31, 2023, a 5 percent change in the contingent consideration liabilities would result in a change in income before income taxes of $5.2 million.
How would a 5% change in the contingent consideration liabilities impact income before taxes as of December 31, 2023?
NIKE, Inc.'s principal business activity involves the design, development, and worldwide marketing and selling of athletic footwear, apparel, equipment, accessories, and services.
What is the principal business activity of NIKE, Inc.?
During 2023, changes in foreign currencies relative to the U.S. dollar negatively impacted net sales by approximately $3,484, 156 basis points, compared to 2022, attributable to our Canadian and Other International operations.
What was the overall impact of foreign currencies on net sales in 2023?
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
MatryoshkaLoss
@misc{kusupati2024matryoshka,
title={Matryoshka Representation Learning},
author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
year={2024},
eprint={2205.13147},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
MultipleNegativesRankingLoss
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}