Instructions to use tanganke/convnext-base-224_mnist_sgd_batch-size-64_lr-0.01_steps-4000 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use tanganke/convnext-base-224_mnist_sgd_batch-size-64_lr-0.01_steps-4000 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-classification", model="tanganke/convnext-base-224_mnist_sgd_batch-size-64_lr-0.01_steps-4000") pipe("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hub/parrots.png")# Load model directly from transformers import AutoImageProcessor, AutoModelForImageClassification processor = AutoImageProcessor.from_pretrained("tanganke/convnext-base-224_mnist_sgd_batch-size-64_lr-0.01_steps-4000") model = AutoModelForImageClassification.from_pretrained("tanganke/convnext-base-224_mnist_sgd_batch-size-64_lr-0.01_steps-4000") - Notebooks
- Google Colab
- Kaggle
| path: | |
| root_dir: ${oc.env:FUSION_BENCH_PROJECT_ROOT,"."} | |
| output_dir: ${.root_dir}/outputs | |
| data_dir: ${oc.env:FUSION_BENCH_DATA_DIR,${.root_dir}/data} | |
| cache_dir: ${oc.env:FUSION_BENCH_CACHE_DIR,${.output_dir}/cache} | |
| log_dir: outputs/convnext-base-224/mnist/batch_size=64,lr=0.01 | |
| work_dir: ${hydra:runtime.cwd} | |
| modelpool: | |
| _target_: fusion_bench.modelpool.ConvNextForImageClassificationPool | |
| _recursive_: false | |
| models: | |
| _pretrained_: | |
| config_path: facebook/convnext-base-224 | |
| pretrained: true | |
| dataset_name: mnist | |
| train_datasets: | |
| mnist: | |
| _target_: datasets.load_dataset | |
| path: mnist | |
| split: train | |
| val_datasets: | |
| mnist: | |
| _target_: datasets.load_dataset | |
| path: mnist | |
| split: test | |
| test_datasets: null | |
| method: | |
| _target_: fusion_bench.method.classification.ImageClassificationFineTuning | |
| max_epochs: -1 | |
| max_steps: 4000 | |
| save_top_k: -1 | |
| save_interval: 1000 | |
| save_on_train_epoch_end: false | |
| training_data_ratio: null | |
| label_smoothing: 0 | |
| optimizer: | |
| _target_: torch.optim.SGD | |
| lr: 0.01 | |
| momentum: 0.9 | |
| weight_decay: 0.0001 | |
| lr_scheduler: null | |
| dataloader_kwargs: | |
| batch_size: 64 | |
| num_workers: 8 | |
| pin_memory: true | |
| taskpool: | |
| _target_: fusion_bench.taskpool.DummyTaskPool | |
| model_save_path: null | |
| _target_: fusion_bench.programs.ModelFusionProgram | |
| _recursive_: false | |
| seed: 0 | |
| fast_dev_run: false | |
| dry_run: false | |
| print_config: true | |
| print_function_call: true | |
| merged_model_save_path: null | |
| merged_model_save_kwargs: null | |
| report_save_path: '{log_dir}/program_report.json' | |