Interpreted Prediction
DeepSeek's V3 model was trained for 55 days using ~2,000 H800 GPUs for ~$5.6 million, while OpenAI's GPT-4 model cost between $100 million and several hundred million to train.