Ambika14's picture
Upload folder using huggingface_hub
7dba9d8 verified
metadata
tags:
  - sentence-transformers
  - sentence-similarity
  - feature-extraction
  - dense
  - generated_from_trainer
  - dataset_size:124
  - loss:CachedMultipleNegativesRankingLoss
base_model: BAAI/bge-base-en-v1.5
widget:
  - source_sentence: >-
      trying to get my domain but don t see any option to claim it. please help
      to claim the domain with jupitravel. issue update of contact details in
      udyam registration certificate context the user is requesting an update of
      the mobile number and email id in the udyam registration certificate.
      details - udyam registration no udyam-mh- <NUM> - <NUM>
    sentences:
      - >-
        Startup, Credit and Finance. Fund of Fund scheme of CGTMSE. the fund of
        fund scheme of cgtmse category encompasses grievances related to the
        administration of the fund of fund scheme by cgtmse which provides
        equity or quasi-equity financing to msmes indirectly through venture
        capital funds aifs or msme-focused investment funds. this category
        covers a range of issues including situations where msmes have received
        in-principle approval or are shortlisted by a cgtmse-backed fund but
        face significant delays or partial disbursement of funds due to internal
        approvals layered decision-making lack of coordination between the fund
        manager and cgtmse eligibility and interpretation disputes where msmes
        are denied funding based on turnover vintage msme definition risk
        perception despite aligning with the scheme s intended objectives
        coordination and transparency issues with vc or fund managers including
        repeated document requests unclear approval authority lack of clarity on
        valuation funding tranches investment conditions poor communication that
        leaves msmes uncertain about timelines and outcomes these grievances are
        rare but high-stakes involving large capital amounts complex
        multi-stakeholder coordination and prolonged resolution cycles that
        often require escalation through the champions portal and intervention
        at senior or ministry level. example issues include delays in
        disbursement of funds despite in-principle approval partial funding
        disbursement due to internal approvals or lack of coordination
        ineligibility under the fund of fund scheme due to turnover criteria or
        other factors repeated document clarifications and lack of transparency
        from the vc fund regarding
      - >-
        Technology, Quality and Institutions. Related to Scheme of KVIC. this
        category encompasses grievances related to schemes subsidies
        certifications and implementation processes administered by the khadi
        village industries commission kvic and its implementing authorities
        including state kvic and district industries centre dic offices. it
        specifically addresses issues that originate from kvic or its
        field-level offices excluding problems solely with banks generic msme
        schemes or non-kvic authorities. the category covers a range of issues
        including <NUM> . delays or failures in the release of pmegp margin
        money subsidies where loans have already been sanctioned and units have
        been set up but kvic has not credited the subsidy to the bank due to
        pending portal actions physical verification delays repeated document
        objections or prolonged under process status without timelines. <NUM> .
        grievances related to khadi subsidies including non-release partial
        release or unexplained reduction of admissible subsidy amounts stoppage
        of subsidy citing non-compliance without sharing inspection reports
        deviations from prescribed scheme norms in determining subsidy
        eligibility or quantum <NUM> . issues related to kvic certification and
        registration including pending or delayed issuance of khadi certificates
        cancellation of certification without prior notice or stated reasons
        inspection-related delays without clarification delayed renewal of
        certificates that directly affect eligibility for subsidies tenders and
        market access subcategories <NUM> . providing financial assistance to
        set up new enterprises under pmegp <NUM> . providing insurance cover to
        khadi artisans under aam admi bima yojana <NUM> . providing financial
        assistance to khadi institutions under mda <NUM> . workshed scheme for
        khadi artisans <NUM> . loans under interest subsidy eligibility
        certificate scheme isec <NUM> . mission solar charkha
      - >-
        UAM/Udyam Registration/Certificate related issues. Issues in Updating
        Latitude and Longitude Details (Technical). this category covers
        grievances related to technical issues encountered while entering or
        updating the latitude and longitude coordinates of the enterprise
        location in the udyam registration system. these coordinates are used to
        identify the geographic location of the enterprise and are sometimes
        required when updating address information or completing registration
        details. grievances under this category usually arise when the portal
        does not accept the latitude and longitude values entered by the user or
        when technical errors prevent the coordinates from being saved. users
        may report that the location detection feature does not function
        properly that the system repeatedly shows errors while entering
        coordinates or that the map interface does not load correctly. in some
        cases entrepreneurs may also face issues when the location selected on
        the map does not match their actual address or when the coordinates fail
        to update despite repeated attempts. these grievances are typically
        raised by msme owners proprietors partners directors or authorized
        representatives who are attempting to update enterprise location details
        in the registration system. small business owners completing
        registration updates themselves may encounter technical difficulties
        while entering location coordinates. similarly consultants accountants
        or administrative staff who assist enterprises with registration or
        profile updates may submit grievances if the portal prevents them from
        completing the required location information due to technical errors.
  - source_sentence: >-
      on behalf of miras engineers i request to update my mobile number to
      <phone_no> and email id to <email_id> as my previous mobile number is
      misplaced and email id is not being used. issue update of mobile number in
      msme udyog aadhaar certificate context the user is requesting an update of
      the mobile number in the msme udyog aadhaar certificate due to the
      original number being lost and a new number being provided. details -
      original mobile no <NUM> new mobile no <NUM>
    sentences:
      - >-
        Policy and Schemes. PM Vishwakarma. the pm vishwakarma category
        encompasses the registration skill certification and benefit disbursal
        processes for artisans and craftspeople. the system aims to provide easy
        registration skill certification toolkit incentives credit support and
        strong market linkage. however operational issues eligibility
        interpretation challenges and bank coordination failures lead to
        breakdowns at the stages of registration certification benefit disbursal
        and bank linkage. common grievance scenarios registration stuck at
        pending verification applicants may experience delays in the
        registration process with applications remaining stuck at pending
        verification for <NUM> days without any response from the local officer.
        aadhaar-based registration failures aadhaar-based registration may fail
        due to occupation mismatch despite the individual being a traditional
        carpenter for <NUM> years. non-receipt of toolkit incentives artisans
        and craftspeople may not receive the toolkit incentive despite
        completing skill training and assessment. bank refusal of pm vishwakarma
        loans banks may refuse to provide pm vishwakarma loans due to unclear
        scheme guidelines. incorrect trade listing trades eligible under the
        scheme may not be listed correctly in the portal s dropdown options.
        operational procedural policy and institutional causes operational
      - >-
        Startup, Credit and Finance. Loans from Banks. this category loans from
        banks encompasses grievances related to access to credit from banks
        where micro small and medium enterprises msmes have applied for loans
        and the bottleneck lies at the bank level. the scope of this category
        includes issues involving commercial banks regional rural banks rrbs and
        cooperative banks. it specifically addresses situations where the
        problem is neither related to rbi policy government scheme design nor
        buyer default but arises from bank-side processing handling or
        decision-making of loan applications. the category captures the
        following scenarios - msmes have submitted loan applications along with
        required documentation complied with bank procedures and followed up
        through branches or portals but the application remains pending without
        a formal decision. - banks keep applications under prolonged under
        process or pending for verification status without issuing deficiency
        letters timelines or written communication. - situations involving
        repeated or circular document demands that effectively stall credit
        access. - grievances where branch-level offices do not forward eligible
        loan applications to regional or head offices. - delays in internal
        approvals. - avoidance of issuing a clear sanction or rejection decision
        despite prolonged engagement. these cases reflect administrative
        stalling rather than informed credit rejection based on risk or
        eligibility. the category includes the following example issues - i
        applied for a term loan under the msme category and submitted all
        documents but the bank has kept the application under process for
        several months without any written update. - my loan application status
        has been showing pending for verification on the bank portal for over
        <NUM> days with no deficiency letter issued. - the bank is repeatedly
        asking for documents that were already submitted causing unnecessary
        delay in loan processing. - the branch is not forwarding
      - >-
        Others. Others. this category includes udyam uam registration grievances
        that cannot be clearly classified under the defined technical
        categories. it covers complaints where the grievance description or
        technical summary is invalid incomplete irrelevant vague or lacks
        sufficient details to identify the specific issue. examples include
        unclear statements such as udyam not working submissions without key
        identifiers like urn or pan queries unrelated to registration processes
        such as scheme eligibility or bank loan inquiries foreign language
        submissions without translation or attachments shared without proper
        explanation. the others category ensures that such unclassifiable
        grievances are not ignored or abandoned. instead they are flagged for
        manual review and preliminary assessment. during this process reviewers
        attempt to understand the issue request additional information if
        necessary and determine whether the grievance can be redirected to a
        relevant category or requires further technical attention. this approach
        helps maintain continuity in grievance handling by allowing submissions
        that do not initially meet classification standards to still enter the
        review system. it also supports data quality by encouraging
        clarification and correction of incomplete inputs. by enabling manual
        triage and follow-up the others category helps ensure that stakeholders
        receive appropriate guidance and that legitimate concerns are eventually
        directed to the correct resolution pathway reducing repeated or
        misclassified submissions.
  - source_sentence: >-
      i lost my mobile number and forgot my mail id and password. please change
      my mobile number in udyog aadhaar to the new number <phone_no> . issue
      rectification of enterprise classification and investment details in udyam
      msme registration certificate context the user on behalf of topline
      commodities private limited is reporting an issue with the enterprise
      classification and investment details in their udyam msme registration
      certificate which is showing as micro instead of medium and incorrectly
      displaying nil figures for <NUM> - <NUM> and <NUM> - <NUM> despite regular
      updates. details - udyam registration no udyam-wb- <NUM> - <NUM> incorrect
      classification micro instead of medium incorrect investment details nil
      for <NUM> - <NUM> and <NUM> - <NUM> correct investment details nil for
      <NUM> - <NUM> and <NUM> - <NUM>
    sentences:
      - >-
        Startup, Credit and Finance. Subordinate debt scheme loan to MSME
        promoters implemented by CGTMSE. the category description pertains to
        the implementation of the subordinate debt scheme loan to msme promoters
        by the credit guarantee fund trust for micro and small enterprises
        cgtmse . the purpose and scope of this category involve addressing user
        expectations for debt support to stressed msmes facilitating easy access
        for promoters and promoting cooperative behaviour from banks due to
        guarantee backing. however the breakdown typically occurs at the level
        of bank interpretation eligibility screening and risk aversion. common
        grievance scenarios include banks refusing subordinate debt loans even
        when the msme is eligible and npa stress guidelines are met branch
        officials being unaware of the scheme and asking for collateral loan
        applications remaining pending with no response for over a month cgtmse
        guarantee coverage not being properly explained by banks promoter
        eligibility being rejected without any written justification these
        grievance scenarios reflect issues related to bank behaviour awareness
        gaps and risk avoidance. the operational procedural policy
      - >-
        Technology, Quality and Institutions. Related to MSME-DFO. this category
        encompasses grievances related to field-level execution failures at msme
        development facilitation offices dfos which are responsible for
        facilitating msme schemes loans subsidies and services. the scope of
        this category includes field-level execution failures non-responsive dfo
        officers failure to provide guidance on documentation or procedures
        inaction on queries submitted through champions or physical visits
        inspection delays or inconsistencies postponed or repeatedly rescheduled
        site visits delayed inspection reports unnecessary multiple inspections
        that stall loan disbursement or subsidy release local facilitation and
        coordination failures misrouting of applications between offices lack of
        facilitation for land or utilities approvals unavailability of promised
        local support services poor coordination between dfos banks psus and
        state nodal officers resulting in projects remaining stuck despite
        eligibility or prior approvals example issues dfo officials not
        responding to phone calls or emails regarding subsidy applications with
        no guidance provided on required documents on-site inspection for msme
        projects pending for several months blocking bank loan disbursement
        inspection scheduled multiple times but cancelled without notice with
        the inspection report still not issued applications being sent from one
        local office to another by the dfo without clear instructions or
        responsibility lack of coordination between dfo and bank delaying loan
        sanction even after project verification operational procedural policy
        or institutional causes inadequate communication and coordination
        between dfos banks psus and state nodal officers inefficient
        documentation and procedure guidance inaction
      - >-
        UAM/Udyam Registration/Certificate related issues. Invalid/Incomplete
        Details Provided During Registration. this category includes grievances
        related to requests for cancellation or deactivation of an existing
        udyam registration. in some cases businesses that were previously
        registered as msmes may no longer operate may have undergone structural
        changes or may have been registered incorrectly. when such situations
        occur the enterprise owner may wish to cancel the existing udyam
        certificate to prevent incorrect records or to allow proper registration
        in the future. grievances under this category typically include requests
        to cancel a registration because the business has permanently closed the
        enterprise was registered by mistake or the registration was created
        with incorrect information. some entrepreneurs also request cancellation
        when duplicate registrations exist for the same enterprise and they want
        only one valid record to remain. another common grievance arises when
        the enterprise was registered earlier under outdated or incorrect
        details and the owner wants the registration cancelled before creating a
        new one with correct information. these grievances are usually raised by
        proprietors partners directors of companies or authorized
        representatives of msmes who are responsible for maintaining the
        official records of the enterprise. small business owners who registered
        their enterprises earlier but later discontinued operations may also
        request cancellation to avoid confusion or misuse of the registration.
        in some cases accountants consultants or compliance officers working on
        behalf of the enterprise may submit the grievance if they identify that
        the existing udyam registration is no longer valid or should be removed
        from the records.
  - source_sentence: >-
      udyam certificate download issue update of official address in udyam
      registration context the user is requesting an update of the official
      address in the udyam registration as the business has shifted from
      maharashtra to gujarat and the user is now residing in gujarat. details -
      udyam registration number udyam-mh- <NUM> - <NUM> current state
      maharashtra desired state gujarat
    sentences:
      - >-
        Startup, Credit and Finance. Schemes implemented by NCGTC for automatic
        loan. the category description outlines the schemes implemented by the
        national credit guarantee trust company ncgtc for automatic or
        guaranteed loans focusing on the grievances raised by micro and small
        business owners existing borrowers of banks or non-banking financial
        companies nbfcs and enterprises seeking emergency or collateral-free
        credit. the description categorizes the grievances into several
        subcategories including <NUM> . bank refusal despite scheme eligibility
        eligible micro small and medium enterprises msmes are denied loans due
        to - branches claiming the scheme is not being offered - lack of
        instructions from higher authorities - rejection of applications despite
        standard and compliant accounts impact msmes are denied access to credit
        despite meeting scheme eligibility criteria. <NUM> . partial or
        incorrect loan sanction enterprises receive amounts far below their
        entitlement due to - incorrect working capital limits - sanctioned
        amounts reduced without justification or explanation impact enterprises
        receive inadequate credit affecting their business operations. <NUM> .
        delays after sanction sanction letters are issued but funds are not
        credited for weeks due to - vague technical problems - disbursements
        held up despite guarantee activation impact enterprises experience
        delayed credit delivery affecting their business operations. <NUM> .
        collateral or additional conditions msmes are subjected to - demands for
        collateral or personal guarantees - forced purchase of insurance or
        third-party financial products impact msmes are burdened with additional
        costs and conditions affecting their credit accessibility. <NUM> .
        incorrect ineligibility tagging msme
      - >-
        UAM/Udyam Registration/Certificate related issues. Existing /
        Unauthorized UDYAM Registration Against PAN. this category includes
        grievances related to updating or correcting the email id or mobile
        number associated with an existing udyam registration. contact details
        provided during registration are used for communication verification and
        authentication when accessing the enterprise profile on the portal. if
        these contact details become outdated incorrect or inaccessible the
        enterprise owner may face difficulty receiving otps accessing the portal
        or managing the registration information. common grievances under this
        category include requests to change the registered mobile number or
        email address because the original number is no longer active the sim
        card has been lost the email account is no longer accessible or the
        contact details were entered incorrectly during registration. some
        complaints arise when the registered contact details belong to an
        employee or consultant who is no longer associated with the enterprise
        preventing the current owner from receiving verification messages. in
        other cases entrepreneurs report that they cannot update contact details
        because the system requires authentication through the old mobile number
        or email which they no longer have access to. these grievances are
        typically raised by msme owners proprietors partners directors of
        companies or authorized representatives responsible for managing
        business registrations. small business owners who registered their
        enterprise personally may request updates when their phone number or
        email changes. in some cases accountants consultants or administrative
        staff handling compliance activities may also submit grievances when
        they cannot access the registration due to outdated contact details.
        this category therefore represents issues related specifically to
        correcting or updating communication details associated with an existing
        udyam certificate.
      - >-
        Policy and Schemes. Related to MSME Scheme. this category encompasses
        grievances related to central sector schemes directly administered by
        the ministry of micro small and medium enterprises momsme where the
        ministry itself serves as the implementing authority. the category
        includes schemes such as zero defect zero effect zed credit linked
        capital subsidy scheme clcss lean manufacturing and other centrally
        managed msme support programs. it covers cases where msmes have applied
        for scheme benefits or subsidies received approvals or completed
        required assessments or certifications but the approved financial
        assistance has not been released or credited. the category also captures
        grievances where claims submitted under ministry-run schemes for
        incentives reimbursements or financial support remain pending for
        extended periods or are rejected without clear or consistent
        justification. this includes cases of rejection due to alleged
        documentation gaps system-generated ineligibility flags disputes over
        eligible machinery or activities and delays caused by human or
        system-level verification failures. additionally the category includes
        grievances arising from ambiguity or confusion regarding scheme
        eligibility scope or applicability such as uncertainty over mandatory
        certifications eligibility of second-hand versus new machinery
        applicability to service enterprises or inconsistent interpretations of
        scheme rules by different central or state offices. the category further
        covers portal-related issues affecting scheme access and execution
        including technical errors during registration or document upload login
        or authentication failures contradictory status messages and
        non-updating dashboards for application claim or training progress.
        these issues typically arise due to system bugs integration gaps between
        multiple portals file format or size restrictions or delays in updating
        portal logic after scheme guideline revisions.
  - source_sentence: >-
      on the udyam registration form my daughter s pan number and name were
      incorrectly updated instead of my pan number and name. the original pan
      number to be updated is <pan_no> and the name is durai singh. issue update
      of contact details in udyam registration certificate context the user is
      requesting an update of the mobile number and email id in the udyam
      registration certificate for efes process equipment pvt ltd. details -
      udyam registration no udyam-ap- <NUM> - <NUM> current mobile no <NUM>
      current email id pramod4holy@gmail.com aadhaar no <NUM> old mobile no
      <NUM> old email id irfansirfan60@gmail.com
    sentences:
      - >-
        Marketing and Skilling. National SC ST HUB. national sc-st hub nssh is a
        central sector scheme launched in <NUM> by the ministry of micro small
        and medium enterprises and implemented by the national small industries
        corporation to empower scheduled caste and scheduled tribe entrepreneurs
        and strengthen their participation in the msme ecosystem. the scheme
        focuses on capacity building market access financial facilitation and
        handholding support while also operationalizing the mandatory <NUM>
        procurement target for sc st owned mses under the public procurement
        policy for mses <NUM> . through a network of national sc-st hub offices
        across the country the hub assists eligible sc st entrepreneurs holding
        at least <NUM> ownership and control in activities such as udyam and gem
        registration participation in government tenders access to credit and
        skill upgradation. financial support is provided in the form of
        reimbursements for testing and certification charges from recognized
        laboratories bank loan processing and bank guarantee fees membership
        fees of export promotion councils onboarding costs for e-commerce and
        government procurement platforms and fees for short-term skill and
        management training programs at reputed institutions. by reducing entry
        barriers and providing structured handholding nssh aims to enhance
        competitiveness ensure inclusive growth and enable sc st entrepreneurs
        to scale up operations and integrate with formal supply chains. examples
        of grievances reported under the scheme include rejection of
        reimbursement claims where testing or certification expenses exceed the
        prescribed financial ceiling despite compliance with quality standards
        blockage of financial assistance due to delays or discrepancies in caste
        certificate verification even when enterprises are otherwise registered
        as sc st-owned instances where sc st msmes fail to secure tenders
        despite the mandated procurement quota because of non-compliance by
        procuring cpses partial reimbursement of approved training or
        capacity-building expenses owing to scheme-specific limits leading to
        out-of-pocket costs for entrepreneurs and gaps in timely support from
        local nssh offices particularly in remote or north-eastern regions
        affecting onboarding to procurement portals and access to scheme
        benefits.
      - >-
        Technology, Quality and Institutions. Manufactruing
        (Chemical/Food/Electrical & Electronics). manufacturing in the chemical
        food electrical and electrical electronics sectors under msme refers to
        sector-focused support provided by the ministry of msme through a
        combination of specialized infrastructure technology upgradation and
        competitiveness schemes. this includes dedicated technology centres for
        activities such as fragrance and flavour development in the chemical
        sector tooling and process development for electrical measuring
        instruments and electronics and esdm-focused prototyping and testing
        facilities under programmes like the technology centre systems programme
        and clcss. food processing msmes are supported through cluster-based
        common facility centres offering shared infrastructure for testing r d
        packaging cold chains and effluent treatment under the mse cluster
        development framework. these sectoral interventions are complemented by
        horizontal schemes such as lean manufacturing zed certification and
        digital msme which help units improve quality sustainability
        productivity and market readiness. together these measures aim to enable
        value-added manufacturing reduce individual investment burdens promote
        compliance with quality and environmental standards and enhance domestic
        as well as export competitiveness across these msme-intensive sectors.
        examples of grievances include technology centre access denial an
        electronics msme seeking advanced esdm testing is denied access at a
        specialized technology centre because available slots are prioritized
        for chemical or fragrance units delaying product validation. clcss
        machinery rejection a food processing unit s modern packaging or
        processing machine is not included in the approved sub-sector or
        machinery list resulting in rejection of the <NUM> capital subsidy
        claim. common facility centre shortfall a chemical manufacturing cluster
        s approved cfc does not include the promised effluent treatment facility
        forcing individual msmes to incur high compliance and disposal costs.
        zed certification scoring dispute a food msme implementing lean
        practices and waste reduction measures receives lower-than-expected
        scores during audit missing bronze certification despite documented
        improvements. lean cluster exclusion a small electrical and electronics
        group with fewer than the required number of units is excluded from lean
        manufacturing cluster support even though the cluster has clear process
        improvement potential.
      - >-
        UAM/Udyam Registration/Certificate related issues. After Cancellation,
        Unable to Register with PAN Details (Technical). this category refers to
        grievances where an entrepreneur is unable to create a new udyam
        registration using their pan after an earlier registration has already
        been cancelled. in such situations the system may continue to recognize
        the pan as already associated with an existing registration preventing
        the user from completing a new registration. grievances under this
        category generally occur when an enterprise previously cancelled its
        registration due to closure incorrect details or duplication and later
        attempts to register again using the same pan. users may report that the
        system still displays a message indicating that a registration already
        exists for that pan even though the earlier registration was cancelled.
        some entrepreneurs also encounter errors where the portal does not allow
        them to proceed with registration because the pan remains linked to the
        previous record. these grievances are commonly raised by business owners
        proprietors partners or company directors attempting to register their
        enterprise again after cancelling an earlier registration. the issue may
        also be reported by authorized representatives compliance managers or
        consultants responsible for completing the msme registration process on
        behalf of the enterprise. such grievances typically arise when the
        system does not update the cancellation status correctly or when
        residual records associated with the pan prevent the new registration
        from being completed.
pipeline_tag: sentence-similarity
library_name: sentence-transformers
metrics:
  - pearson_cosine
  - spearman_cosine
model-index:
  - name: SentenceTransformer based on BAAI/bge-base-en-v1.5
    results:
      - task:
          type: semantic-similarity
          name: Semantic Similarity
        dataset:
          name: Unknown
          type: unknown
        metrics:
          - type: pearson_cosine
            value: .nan
            name: Pearson Cosine
          - type: spearman_cosine
            value: .nan
            name: Spearman Cosine

SentenceTransformer based on BAAI/bge-base-en-v1.5

This is a sentence-transformers model finetuned from BAAI/bge-base-en-v1.5. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

  • Model Type: Sentence Transformer
  • Base model: BAAI/bge-base-en-v1.5
  • Maximum Sequence Length: 256 tokens
  • Output Dimensionality: 768 dimensions
  • Similarity Function: Cosine Similarity

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 256, 'do_lower_case': True, 'architecture': 'BertModel'})
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
  (2): Normalize()
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
sentences = [
    'on the udyam registration form my daughter s pan number and name were incorrectly updated instead of my pan number and name. the original pan number to be updated is <pan_no> and the name is durai singh. issue update of contact details in udyam registration certificate context the user is requesting an update of the mobile number and email id in the udyam registration certificate for efes process equipment pvt ltd. details - udyam registration no udyam-ap- <NUM> - <NUM> current mobile no <NUM> current email id pramod4holy@gmail.com aadhaar no <NUM> old mobile no <NUM> old email id irfansirfan60@gmail.com',
    'UAM/Udyam Registration/Certificate related issues. After Cancellation, Unable to Register with PAN Details (Technical). this category refers to grievances where an entrepreneur is unable to create a new udyam registration using their pan after an earlier registration has already been cancelled. in such situations the system may continue to recognize the pan as already associated with an existing registration preventing the user from completing a new registration. grievances under this category generally occur when an enterprise previously cancelled its registration due to closure incorrect details or duplication and later attempts to register again using the same pan. users may report that the system still displays a message indicating that a registration already exists for that pan even though the earlier registration was cancelled. some entrepreneurs also encounter errors where the portal does not allow them to proceed with registration because the pan remains linked to the previous record. these grievances are commonly raised by business owners proprietors partners or company directors attempting to register their enterprise again after cancelling an earlier registration. the issue may also be reported by authorized representatives compliance managers or consultants responsible for completing the msme registration process on behalf of the enterprise. such grievances typically arise when the system does not update the cancellation status correctly or when residual records associated with the pan prevent the new registration from being completed.',
    'Technology, Quality and Institutions. Manufactruing (Chemical/Food/Electrical & Electronics). manufacturing in the chemical food electrical and electrical electronics sectors under msme refers to sector-focused support provided by the ministry of msme through a combination of specialized infrastructure technology upgradation and competitiveness schemes. this includes dedicated technology centres for activities such as fragrance and flavour development in the chemical sector tooling and process development for electrical measuring instruments and electronics and esdm-focused prototyping and testing facilities under programmes like the technology centre systems programme and clcss. food processing msmes are supported through cluster-based common facility centres offering shared infrastructure for testing r d packaging cold chains and effluent treatment under the mse cluster development framework. these sectoral interventions are complemented by horizontal schemes such as lean manufacturing zed certification and digital msme which help units improve quality sustainability productivity and market readiness. together these measures aim to enable value-added manufacturing reduce individual investment burdens promote compliance with quality and environmental standards and enhance domestic as well as export competitiveness across these msme-intensive sectors. examples of grievances include technology centre access denial an electronics msme seeking advanced esdm testing is denied access at a specialized technology centre because available slots are prioritized for chemical or fragrance units delaying product validation. clcss machinery rejection a food processing unit s modern packaging or processing machine is not included in the approved sub-sector or machinery list resulting in rejection of the <NUM> capital subsidy claim. common facility centre shortfall a chemical manufacturing cluster s approved cfc does not include the promised effluent treatment facility forcing individual msmes to incur high compliance and disposal costs. zed certification scoring dispute a food msme implementing lean practices and waste reduction measures receives lower-than-expected scores during audit missing bronze certification despite documented improvements. lean cluster exclusion a small electrical and electronics group with fewer than the required number of units is excluded from lean manufacturing cluster support even though the cluster has clear process improvement potential.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.7657, 0.4355],
#         [0.7657, 1.0000, 0.5440],
#         [0.4355, 0.5440, 1.0000]])

Evaluation

Metrics

Semantic Similarity

Metric Value
pearson_cosine nan
spearman_cosine nan

Training Details

Training Dataset

Unnamed Dataset

  • Size: 124 training samples
  • Columns: sentence_0 and sentence_1
  • Approximate statistics based on the first 124 samples:
    sentence_0 sentence_1
    type string string
    details
    • min: 51 tokens
    • mean: 143.99 tokens
    • max: 256 tokens
    • min: 181 tokens
    • mean: 252.81 tokens
    • max: 256 tokens
  • Samples:
    sentence_0 sentence_1
    not having register mobile no and email id need to update the mobile number and email id issue invalid incomplete grievance context the grievance text does not contain sufficient or meaningful information to identify an issue related to the msme scheme. details - Technology, Quality and Institutions. Related to Tool Rooms. this category encompasses grievances related to the operational and technical services provided by government-supported msme tool rooms. the scope includes issues with access to machinery prototyping facilities manufacturing support and skill-development or training programs. key areas of concern include unavailability of machine time despite confirmed bookings equipment under maintenance or frequent breakdowns high-demand machines consistently overbooked infrastructure promised for msme production support not accessible when required delays cancellations or poor execution of technical training programs non-availability of trainers or technical experts mismatch between published and actual service fees lack of transparency during machine usage or training delivery these grievances directly impact production timelines project execution and workforce upskilling. they arise from service delivery and operational failures rather t...
    i never applied for udyam registration before but it is showing that it has already been done through my pan. kindly look into this. issue retrieval of udyam registration number and contact details context the user is requesting the udyam registration number and contact details associated with the existing udyam registration in order to obtain the udyam certificate or update the details. details - pan no agtpj3178r aadhar no UAM/Udyam Registration/Certificate related issues. Migration from UAM to UDYAM. this category refers to grievances related to the migration of enterprises registered under the earlier udyog aadhaar memorandum uam system to the current udyam registration system. the uam registration system was used earlier for msme registration but enterprises registered under that system were required to migrate their registration details to the newer udyam portal to maintain updated records. during this migration process some enterprises encounter difficulties in transferring or verifying their existing registration details. grievances under this category typically include issues where business owners are unable to complete the migration process from uam to udyam due to errors or system restrictions. entrepreneurs may report that their uam number is not being recognized by the portal or that the migration process stops due to validation errors related to aadhaar pan or enterprise details. some users a...
    my team needs incubator support for mentoring workspace and early funding to grow our innovative product but this delay is forcing us to shut down. please check my application and release the support fast to save my business.got no response or funding approval past months issue delayed incubator support under nmcp scheme context the user is reporting that the application for incubator support under the nmcp scheme has not been processed or approved within the expected timeframe and is requesting urgent assistance to prevent business shutdown. details - incubator support required mentoring workspace early funding application status no response or funding approval past months Technology, Quality and Institutions. Support for entrepreneurial and managerial development of SMEs through incubators- an NMCP Scheme. the support for entrepreneurial and managerial development of smes through incubators scheme under the national manufacturing competitiveness programme nmcp is an initiative of the ministry of msme designed to nurture innovative technology-driven and knowledge-based ideas by providing structured incubation support through approved business incubators hosted in technical academic or research institutions. under the scheme financial assistance of up to lakh is provided per idea or incubated unit for product development testing validation and commercialisation with an overall ceiling of . lakh per incubator to support up to ventures. in addition host institutions may receive up to . lakh for minor infrastructure and facility upgrades to strengthen incubation capabilities. the scheme follows a tripartite arrangement amo...
  • Loss: CachedMultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim",
        "mini_batch_size": 32,
        "gather_across_devices": false,
        "directions": [
            "query_to_doc"
        ],
        "partition_mode": "joint",
        "hardness_mode": null,
        "hardness_strength": 0.0
    }
    

Training Hyperparameters

Non-Default Hyperparameters

  • per_device_train_batch_size: 64
  • per_device_eval_batch_size: 64
  • num_train_epochs: 6
  • fp16: True
  • multi_dataset_batch_sampler: round_robin

All Hyperparameters

Click to expand
  • do_predict: False
  • eval_strategy: no
  • prediction_loss_only: True
  • per_device_train_batch_size: 64
  • per_device_eval_batch_size: 64
  • gradient_accumulation_steps: 1
  • eval_accumulation_steps: None
  • torch_empty_cache_steps: None
  • learning_rate: 5e-05
  • weight_decay: 0.0
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1
  • num_train_epochs: 6
  • max_steps: -1
  • lr_scheduler_type: linear
  • lr_scheduler_kwargs: None
  • warmup_ratio: None
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • enable_jit_checkpoint: False
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • use_cpu: False
  • seed: 42
  • data_seed: None
  • bf16: False
  • fp16: True
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: None
  • local_rank: -1
  • ddp_backend: None
  • debug: []
  • dataloader_drop_last: False
  • dataloader_num_workers: 0
  • dataloader_prefetch_factor: None
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: False
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • parallelism_config: None
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch_fused
  • optim_args: None
  • group_by_length: False
  • length_column_name: length
  • project: huggingface
  • trackio_space_id: trackio
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: None
  • hub_always_push: False
  • hub_revision: None
  • gradient_checkpointing: False
  • gradient_checkpointing_kwargs: None
  • include_for_metrics: []
  • eval_do_concat_batches: True
  • auto_find_batch_size: False
  • full_determinism: False
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • include_num_input_tokens_seen: no
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • eval_on_start: False
  • use_liger_kernel: False
  • liger_kernel_config: None
  • eval_use_gather_object: False
  • average_tokens_across_devices: True
  • use_cache: False
  • prompts: None
  • batch_sampler: batch_sampler
  • multi_dataset_batch_sampler: round_robin
  • router_mapping: {}
  • learning_rate_mapping: {}

Training Logs

Epoch Step spearman_cosine
1.0 2 nan
2.0 4 nan
3.0 6 nan
4.0 8 nan
5.0 10 nan
6.0 12 nan

Framework Versions

  • Python: 3.12.12
  • Sentence Transformers: 5.3.0
  • Transformers: 5.0.0
  • PyTorch: 2.10.0+cu128
  • Accelerate: 1.13.0
  • Datasets: 4.0.0
  • Tokenizers: 0.22.2

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

CachedMultipleNegativesRankingLoss

@misc{gao2021scaling,
    title={Scaling Deep Contrastive Learning Batch Size under Memory Limited Setup},
    author={Luyu Gao and Yunyi Zhang and Jiawei Han and Jamie Callan},
    year={2021},
    eprint={2101.06983},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}