Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
z's picture
17 114

z

Huye2023
·

AI & ML interests

None yet

Recent Activity

upvoted a collection 2 days ago
Nemotron-Pre-Training-Datasets
liked a dataset 2 days ago
nvidia/Nemotron-Pretraining-Dataset-sample
liked a model about 2 months ago
Qwen/Qwen3-Next-80B-A3B-Instruct
View all activity

Organizations

None yet

Collections 2

Transformer模型改进
  • Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

    Paper • 2501.13629 • Published Jan 23, 2025 • 48
finance
  • MIGA: Mixture-of-Experts with Group Aggregation for Stock Market Prediction

    Paper • 2410.02241 • Published Oct 3, 2024 • 11
Transformer模型改进
  • Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

    Paper • 2501.13629 • Published Jan 23, 2025 • 48
finance
  • MIGA: Mixture-of-Experts with Group Aggregation for Stock Market Prediction

    Paper • 2410.02241 • Published Oct 3, 2024 • 11

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs