Shail Shah
shail-2512
AI & ML interests
None yet
Organizations
LLMs
Coder
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.11M • • 2k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 2.23M • • 684 -
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF
33B • Updated • 2.78k • 75 -
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 6.64k • 682
Image Generation
3D
Speech Recognition
-
nvidia/canary-1b
Automatic Speech Recognition • Updated • 1.68k • 457 -
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 65.7k • 968 -
nyrahealth/CrisperWhisper
Automatic Speech Recognition • 2B • Updated • 27.7k • 325 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 6.37M • • 2.92k
Reranking Models
ALMs (Audio Language Models)
TTS
Reasoning (LRMs)
VLMs
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 29.6k • 583 -
microsoft/OmniParser
Image-Text-to-Text • Updated • 231 • 1.71k -
vidore/colsmolvlm-v0.1
Visual Document Retrieval • Updated • 61 • 55 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 204k • 1.58k
Video Generation
Dataset to fine-tune Embeddings
Embedding Models
MultiModal (Any-to-Any)
ALMs (Audio Language Models)
LLMs
TTS
Coder
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.11M • • 2k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 2.23M • • 684 -
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF
33B • Updated • 2.78k • 75 -
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 6.64k • 682
Reasoning (LRMs)
Image Generation
VLMs
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 29.6k • 583 -
microsoft/OmniParser
Image-Text-to-Text • Updated • 231 • 1.71k -
vidore/colsmolvlm-v0.1
Visual Document Retrieval • Updated • 61 • 55 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 204k • 1.58k
3D
Video Generation
Speech Recognition
-
nvidia/canary-1b
Automatic Speech Recognition • Updated • 1.68k • 457 -
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 65.7k • 968 -
nyrahealth/CrisperWhisper
Automatic Speech Recognition • 2B • Updated • 27.7k • 325 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 6.37M • • 2.92k
Dataset to fine-tune Embeddings
Reranking Models
Embedding Models