Baseline CNN RGB + Depth + Side-angle Carbohydrate Regression
This model predicts dish-level carbohydrate content from:
- overhead RGB image
- overhead depth image
- one side-angle RGB image at a time
All available side-angle images are used during training by expanding each dish into multiple samples.
Dataset
Based on Nutrition5K compact preprocessing.
Split counts (rows after side-view expansion)
- Train rows: 17348
- Validation rows: 2652
- Test rows: 3596
Unique dishes
- Train dishes: 2303
- Validation dishes: 334
- Test dishes: 474
Architecture
- RGB overhead CNN branch
- Depth overhead CNN branch
- Side-angle RGB CNN branch
- Feature fusion via concatenation
- Dense regression head
- Target:
total_carb
Training
- Loss: MSE
- Metrics: MAE, MSE
- Optimizer: Adam
- Best checkpoint selected by validation loss
Notes
Each available side-angle image becomes a separate sample while keeping the same overhead RGB and depth image for the corresponding dish.
- Downloads last month
- 6
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support