rkosti commited on
Commit
3c43d5f
·
verified ·
1 Parent(s): 2ae8161

Upload folder using huggingface_hub

Browse files
Files changed (6) hide show
  1. LICENSE.md +66 -0
  2. README.md +105 -5
  3. TERMS_OF_USE.md +99 -0
  4. config.json +32 -0
  5. model.safetensors +3 -0
  6. preprocessor_config.json +31 -0
LICENSE.md ADDED
@@ -0,0 +1,66 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # DINOv3 License
2
+
3
+ *Last Updated: August 19, 2025*
4
+
5
+ **“Agreement”** means the terms and conditions for use, reproduction, distribution and modification of the DINO Materials set forth herein.
6
+
7
+ **“DINO Materials”** means, collectively, Documentation and the models, software and algorithms, including machine-learning model code, trained model weights, inference-enabling code, training-enabling code, fine-tuning enabling code, and other elements of the foregoing distributed by Meta and made available under this Agreement.
8
+
9
+ **“Documentation”** means the specifications, manuals and documentation accompanying
10
+ DINO Materials distributed by Meta.
11
+
12
+ **“Licensee”** or **“you”** means you, or your employer or any other person or entity (if you are entering into this Agreement on such person or entity’s behalf), of the age required under applicable laws, rules or regulations to provide legal consent and that has legal authority to bind your employer or such other person or entity if you are entering in this Agreement on their behalf.
13
+
14
+ **“Meta”** or **“we”** means Meta Platforms Ireland Limited (if you are located in or, if you are an entity, your principal place of business is in the EEA or Switzerland) or Meta Platforms, Inc. (if you are located outside of the EEA or Switzerland).
15
+
16
+ **“Sanctions”** means any economic or trade sanctions or restrictions administered or enforced by the United States (including the Office of Foreign Assets Control of the U.S. Department of the Treasury (“OFAC”), the U.S. Department of State and the U.S. Department of Commerce), the United Nations, the European Union, or the United Kingdom.
17
+
18
+ **“Trade Controls”** means any of the following: Sanctions and applicable export and import controls.
19
+
20
+ By clicking “I Accept” below or by using or distributing any portion or element of the DINO Materials, you agree to be bound by this Agreement.
21
+
22
+ ## 1. License Rights and Redistribution.
23
+
24
+ a. <ins>Grant of Rights</ins>. You are granted a non-exclusive, worldwide, non-transferable and royalty-free limited license under Meta’s intellectual property or other rights owned by Meta embodied in the DINO Materials to use, reproduce, distribute, copy, create derivative works of, and make modifications to the DINO Materials.
25
+
26
+ b. <ins>Redistribution and Use</ins>.
27
+
28
+ i. Distribution of DINO Materials, and any derivative works thereof, are subject to the terms of this Agreement. If you distribute or make the DINO Materials, or any derivative works thereof, available to a third party, you may only do so under the terms of this Agreement and you shall provide a copy of this Agreement with any such DINO Materials.
29
+
30
+ ii. If you submit for publication the results of research you perform on, using, or otherwise in connection with DINO Materials, you must acknowledge the use of DINO Materials in your publication.
31
+
32
+ iii. Your use of the DINO Materials must comply with applicable laws and regulations, including Trade Control Laws and applicable privacy and data protection laws.
33
+
34
+ iv. Your use of the DINO Materials will not involve or encourage others to reverse engineer, decompile or discover the underlying components of the DINO Materials.
35
+
36
+ v. You are not the target of Trade Controls and your use of DINO Materials must comply with Trade Controls. You agree not to use, or permit others to use, DINO Materials for any activities subject to the International Traffic in Arms Regulations (ITAR) or end uses prohibited by Trade Controls, including those related to military or warfare purposes, nuclear industries or applications, espionage, or the development or use of guns or illegal weapons.
37
+
38
+ ## 2. User Support.
39
+
40
+ Your use of the DINO Materials is done at your own discretion; Meta does not process any information nor provide any service in relation to such use. Meta is under no obligation to provide any support services for the DINO Materials. Any support provided is “as is”, “with all faults”, and without warranty of any kind.
41
+
42
+ ## 3. Disclaimer of Warranty.
43
+
44
+ UNLESS REQUIRED BY APPLICABLE LAW, THE DINO MATERIALS AND ANY OUTPUT AND RESULTS THEREFROM ARE PROVIDED ON AN “AS IS” BASIS, WITHOUT WARRANTIES OF ANY KIND, AND META DISCLAIMS ALL WARRANTIES OF ANY KIND, BOTH EXPRESS AND IMPLIED, INCLUDING, WITHOUT LIMITATION, ANY WARRANTIES OF TITLE, NON-INFRINGEMENT, MERCHANTABILITY, OR FITNESS FOR A PARTICULAR PURPOSE. YOU ARE SOLELY RESPONSIBLE FOR DETERMINING THE APPROPRIATENESS OF USING OR REDISTRIBUTING THE DINO MATERIALS AND ASSUME ANY RISKS ASSOCIATED WITH YOUR USE OF THE DINO MATERIALS AND ANY OUTPUT AND RESULTS.
45
+
46
+ ## 4. Limitation of Liability.
47
+
48
+ IN NO EVENT WILL META OR ITS AFFILIATES BE LIABLE UNDER ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, TORT, NEGLIGENCE, PRODUCTS LIABILITY, OR OTHERWISE, ARISING OUT OF THIS AGREEMENT, FOR ANY LOST PROFITS OR ANY DIRECT OR INDIRECT, SPECIAL, CONSEQUENTIAL, INCIDENTAL, EXEMPLARY OR PUNITIVE DAMAGES, EVEN IF META OR ITS AFFILIATES HAVE BEEN ADVISED OF THE POSSIBILITY OF ANY OF THE FOREGOING.
49
+
50
+ ## 5. Intellectual Property.
51
+
52
+ a. Subject to Meta’s ownership of DINO Materials and derivatives made by or for Meta, with respect to any derivative works and modifications of the DINO Materials that are made by you, as between you and Meta, you are and will be the owner of such derivative works and modifications.
53
+
54
+ b. If you institute litigation or other proceedings against Meta or any entity (including a cross-claim or counterclaim in a lawsuit) alleging that the DINO Materials, outputs or results, or any portion of any of the foregoing, constitutes infringement of intellectual property or other rights owned or licensable by you, then any licenses granted to you under this Agreement shall terminate as of the date such litigation or claim is filed or instituted. You will indemnify and hold harmless Meta from and against any claim by any third party arising out of or related to your use or distribution of the DINO Materials.
55
+
56
+ ## 6. Term and Termination.
57
+
58
+ The term of this Agreement will commence upon your acceptance of this Agreement or access to the DINO Materials and will continue in full force and effect until terminated in accordance with the terms and conditions herein. Meta may terminate this Agreement if you are in breach of any term or condition of this Agreement. Upon termination of this Agreement, you shall delete and cease use of the DINO Materials. Sections 3, 4 and 7 shall survive the termination of this Agreement.
59
+
60
+ ## 7. Governing Law and Jurisdiction.
61
+
62
+ This Agreement will be governed and construed under the laws of the State of California without regard to choice of law principles, and the UN Convention on Contracts for the International Sale of Goods does not apply to this Agreement. The courts of California shall have exclusive jurisdiction of any dispute arising out of this Agreement.
63
+
64
+ ## 8. Modifications and Amendments.
65
+
66
+ Meta may modify this Agreement from time to time; provided that they are similar in spirit to the current version of the Agreement, but may differ in detail to address new problems or concerns. All such changes will be effective immediately. Your continued use of the DINO Materials after any modification to this Agreement constitutes your agreement to such modification. Except as provided in this Agreement, no modification or addition to any provision of this Agreement will be binding unless it is in writing and signed by an authorized representative of both you and Meta.
README.md CHANGED
@@ -1,5 +1,105 @@
1
- ---
2
- license: other
3
- license_name: dinov3-license
4
- license_link: https://ai.meta.com/resources/models-and-libraries/dinov3-license/
5
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ tags:
5
+ - dino
6
+ - dinov3
7
+ - arxiv:2508.10104
8
+ license: other
9
+ license_name: dinov3-license
10
+ license_link: https://ai.meta.com/resources/models-and-libraries/dinov3-license
11
+ base_model: dinov3-vitl16-pretrain-lvd1689m
12
+ pipeline_tag: image-feature-extraction
13
+ library_name: transformers
14
+ ---
15
+ ## License and commercial use
16
+
17
+ This model redistributes DINO Materials under the [DINOv3 License Agreement](LICENSE.md). Commercial use is permitted provided you comply with that agreement and with applicable export and trade control laws. Full terms: [LICENSE.md](LICENSE.md), [TERMS_OF_USE.md](TERMS_OF_USE.md).
18
+
19
+ # DINOv3 ViT-L/16
20
+
21
+ Vision backbone for dense visual features (ViT-L, patch 16). Built with DINOv3.
22
+
23
+
24
+ # Model Card
25
+
26
+ This repository hosts **DINOv3 ViT-L/16** pretrained on LVD-1689M: a Vision Transformer (ViT-L, patch size 16) distilled from the DINOv3 ViT-7B teacher. It produces dense visual features suitable for classification, retrieval, segmentation, and other vision tasks without fine-tuning.
27
+
28
+ ## Model Details
29
+
30
+ This model takes an image as input and returns a class token, patch tokens, and register tokens. For a 224×224 image: 1 class token + 4 register tokens + 196 patch tokens = 201 tokens. Inputs can be larger provided dimensions are multiples of 16; otherwise the image is cropped to the nearest smaller multiple.
31
+
32
+ ### Model Description
33
+
34
+ - **Original model:** Meta AI (DINOv3)
35
+ - **Model type:** Vision Transformer (ViT-L/16)
36
+ - **License:** [DINOv3 License](LICENSE.md)
37
+
38
+ ### Model Sources
39
+
40
+ - **Repository:** [https://github.com/facebookresearch/dinov3](https://github.com/facebookresearch/dinov3)
41
+ - **Paper:** [https://arxiv.org/abs/2508.10104](https://arxiv.org/abs/2508.10104)
42
+
43
+ ## Uses
44
+
45
+ This model is a vision backbone providing multi-purpose features for downstream tasks.
46
+
47
+ ### Direct Use
48
+
49
+ The model can be used without fine-tuning, with downstream classifiers as simple as linear layers, to obtain competitive results:
50
+
51
+ - on image classification, using k-NN classifiers on the class token
52
+ - on image classification, with logistic regression classifiers applied on the class token
53
+ - on image classification, with a linear layer applied on the class token and the average of the patch tokens
54
+ - on image retrieval using nearest neighbors
55
+ - on geometric and semantic 3D keypoint correspondances
56
+ - on depth estimation, semantic segmentation, using linear layers
57
+ - on unsupervised object discovery
58
+ - on video segmentation tracking
59
+ - on video classification, using a small 4-layer attentive probe
60
+
61
+ ### Downstream Use
62
+
63
+ Fine-tuning can yield additional gains but is optional; frozen features are typically strong out-of-the-box.
64
+
65
+ ## Bias, Risks, and Limitations
66
+
67
+ Compared to DINOv2 and SEERv2, DINOv3 delivers somewhat consistent performance across income categories on geographical fairness and diversity, although with a notable performance drop in the low-income bucket compared to the highest-income bucket.
68
+
69
+ DINOv3 also achieves relatively good scores across different regions, improving over its predecessor DINOv2. However, a relative difference is still observed between Europe and Africa.
70
+
71
+ ## Evaluation
72
+
73
+ Representative results for **DINOv3 ViT-L/16** (LVD-1689M) from the paper:
74
+
75
+ | Model | IN-ReaL | IN-R | Obj.Net | Ox.-H | ADE20k | NYU↓ | DAVIS | NAVI | SPair |
76
+ |-------|---------|------|---------|-------|--------|------|-------|------|-------|
77
+ | DINOv3 ViT-L/16 | 90.2 | 88.1 | 74.8 | 63.1 | 54.9 | 0.352 | 79.9 | 62.3 | 61.3 |
78
+
79
+ See the [paper](https://arxiv.org/abs/2508.10104) for evaluation protocols and full benchmarks.
80
+
81
+ ## Technical Specifications
82
+
83
+ - **Architecture:** ViT-L (300M parameters), patch size 16, embedding dimension 1024, 4 register tokens, 16 heads, MLP FFN, RoPE
84
+
85
+ ## More Information
86
+
87
+ More on DINOv3: [blog](https://ai.meta.com/blog/dinov3-self-supervised-vision-model/), [project page](https://ai.meta.com/dinov3/).
88
+
89
+ ## Citation
90
+
91
+ **BibTeX**
92
+
93
+ ```
94
+ @misc{simeoni2025dinov3,
95
+ title={{DINOv3}},
96
+ author={Sim{\'e}oni, Oriane and Vo, Huy V. and Seitzer, Maximilian and Baldassarre, Federico and Oquab, Maxime and Jose, Cijo and Khalidov, Vasil and Szafraniec, Marc and Yi, Seungeun and Ramamonjisoa, Micha{\"e}l and Massa, Francisco and Haziza, Daniel and Wehrstedt, Luca and Wang, Jianyuan and Darcet, Timoth{\'e}e and Moutakanni, Th{\'e}o and Sentana, Leonel and Roberts, Claire and Vedaldi, Andrea and Tolan, Jamie and Brandt, John and Couprie, Camille and Mairal, Julien and J{\'e}gou, Herv{\'e} and Labatut, Patrick and Bojanowski, Piotr},
97
+ year={2025},
98
+ eprint={2508.10104},
99
+ archivePrefix={arXiv},
100
+ primaryClass={cs.CV},
101
+ url={https://arxiv.org/abs/2508.10104},
102
+ }
103
+ ```
104
+
105
+ DINOv3 by Meta. Use subject to the [DINOv3 License](LICENSE.md).
TERMS_OF_USE.md ADDED
@@ -0,0 +1,99 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Terms of Use
2
+
3
+ **Effective Date:** 13.Feb.2026
4
+
5
+ These Terms of Use (“Terms”) govern access to and use of this Hugging Face model repository (the “Service”), including access to and download of the DINOv3 ViT-L/16 model weights and related materials.
6
+
7
+ By accessing or using this Service, you agree to these Terms.
8
+
9
+ ---
10
+
11
+ ## 1. Relationship to DINO License
12
+
13
+ This Service redistributes DINO Materials under the DINOv3 License Agreement.
14
+
15
+ Use of the DINO Materials is subject to the DINOv3 License Agreement included in the LICENSE file.
16
+
17
+ In the event of conflict between these Terms and the DINOv3 License, the DINOv3 License governs with respect to DINO Materials.
18
+
19
+ ---
20
+
21
+ ## 2. Attribution
22
+
23
+ This repository hosts the DINOv3 ViT-L/16 model. The DINOv3 model and training methodology are from Meta.
24
+
25
+ ---
26
+
27
+ ## 3. Permitted Use
28
+
29
+ You may use the Service and DINO Materials for lawful commercial and research purposes, subject to:
30
+
31
+ * Compliance with the DINOv3 License Agreement
32
+ * Compliance with all applicable laws and regulations
33
+ * Compliance with export control and trade control laws
34
+
35
+ ---
36
+
37
+ ## 4. Prohibited Uses
38
+
39
+ You may not use the Service or DINO Materials:
40
+
41
+ * For military or warfare applications
42
+ * In connection with weapons systems
43
+ * For nuclear applications
44
+ * For activities subject to ITAR
45
+ * In violation of export control or sanctions laws
46
+ * If you are a person or entity subject to applicable sanctions
47
+
48
+ You are solely responsible for ensuring compliance with all applicable trade and export laws.
49
+
50
+ ---
51
+
52
+ ## 5. No Warranty
53
+
54
+ THE SERVICE AND DINO MATERIALS ARE PROVIDED “AS IS” AND “AS AVAILABLE,” WITHOUT WARRANTIES OF ANY KIND, WHETHER EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, NON-INFRINGEMENT, OR ACCURACY.
55
+
56
+ You assume all risks associated with use of the Service and any outputs generated.
57
+
58
+ ---
59
+
60
+ ## 6. Limitation of Liability
61
+
62
+ TO THE MAXIMUM EXTENT PERMITTED BY LAW, THE PROVIDER OF THIS SERVICE SHALL NOT BE LIABLE FOR ANY INDIRECT, INCIDENTAL, SPECIAL, CONSEQUENTIAL, EXEMPLARY, OR PUNITIVE DAMAGES, INCLUDING LOSS OF PROFITS, DATA, OR BUSINESS INTERRUPTION.
63
+
64
+ TOTAL LIABILITY SHALL NOT EXCEED ONE HUNDRED U.S. DOLLARS (USD $100).
65
+
66
+ ---
67
+
68
+ ## 7. User Responsibility
69
+
70
+ You are solely responsible for:
71
+
72
+ * Ensuring lawful use of the Service
73
+ * Compliance with export and trade regulations
74
+ * Compliance with intellectual property laws
75
+ * Any downstream redistribution of model weights
76
+
77
+ Once model weights are downloaded, you assume full responsibility for their use and further distribution.
78
+
79
+ ---
80
+
81
+ ## 8. Indemnification
82
+
83
+ You agree to indemnify and hold harmless the provider of this Service from any claims, damages, losses, liabilities, and expenses arising from:
84
+
85
+ * Your use of the Service
86
+ * Your violation of these Terms
87
+ * Your violation of applicable laws or regulations
88
+
89
+ ---
90
+
91
+ ## 9. Termination
92
+
93
+ Access may be revoked at any time for violation of these Terms or applicable laws.
94
+
95
+ ---
96
+
97
+ ## 10. Governing Law
98
+
99
+ These Terms shall be governed by the laws of [Insert Jurisdiction], without regard to conflict of law principles.
config.json ADDED
@@ -0,0 +1,32 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "DINOv3ViTModel"
4
+ ],
5
+ "attention_dropout": 0.0,
6
+ "drop_path_rate": 0.0,
7
+ "hidden_act": "gelu",
8
+ "hidden_size": 1024,
9
+ "image_size": 224,
10
+ "initializer_range": 0.02,
11
+ "intermediate_size": 4096,
12
+ "key_bias": false,
13
+ "layer_norm_eps": 1e-05,
14
+ "layerscale_value": 1.0,
15
+ "mlp_bias": true,
16
+ "model_type": "dinov3_vit",
17
+ "num_attention_heads": 16,
18
+ "num_channels": 3,
19
+ "num_hidden_layers": 24,
20
+ "num_register_tokens": 4,
21
+ "patch_size": 16,
22
+ "pos_embed_jitter": null,
23
+ "pos_embed_rescale": 2.0,
24
+ "pos_embed_shift": null,
25
+ "proj_bias": true,
26
+ "query_bias": true,
27
+ "rope_theta": 100.0,
28
+ "torch_dtype": "float32",
29
+ "transformers_version": "4.56.0.dev0",
30
+ "use_gated_mlp": false,
31
+ "value_bias": true
32
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dcb2e45127cccbf1601e5f42fef165eea275c8e5213197e8dcf3f48822718179
3
+ size 1212559808
preprocessor_config.json ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "crop_size": null,
3
+ "data_format": "channels_first",
4
+ "default_to_square": true,
5
+ "device": null,
6
+ "disable_grouping": null,
7
+ "do_center_crop": null,
8
+ "do_convert_rgb": null,
9
+ "do_normalize": true,
10
+ "do_rescale": true,
11
+ "do_resize": true,
12
+ "image_mean": [
13
+ 0.485,
14
+ 0.456,
15
+ 0.406
16
+ ],
17
+ "image_processor_type": "DINOv3ViTImageProcessorFast",
18
+ "image_std": [
19
+ 0.229,
20
+ 0.224,
21
+ 0.225
22
+ ],
23
+ "input_data_format": null,
24
+ "resample": 2,
25
+ "rescale_factor": 0.00392156862745098,
26
+ "return_tensors": null,
27
+ "size": {
28
+ "height": 224,
29
+ "width": 224
30
+ }
31
+ }