Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ultralytics Cloud Issue #1034

Open
1 task done
webdevmatt07 opened this issue Feb 28, 2025 · 3 comments
Open
1 task done

Ultralytics Cloud Issue #1034

webdevmatt07 opened this issue Feb 28, 2025 · 3 comments
Labels
HUB Ultralytics HUB issues info needed More information is required to proceed question Further information is requested

Comments

@webdevmatt07
Copy link

Search before asking

Question

I'm having issues with the Ultralytics Cloud. When training my model I get disconnected often. Sometimes I get billed when it disconnects, sometimes I don't. I would really like to have it consistently train and I'm wondering if I'm doing something wrong.

Here are a few screen shots of what I"m seeing

Image

Image

Image

On other occasions I'll get partially through the training on one server, it will disconnect many times and I'll try a new server and sometimes it completes and sometimes it doesn't.

Thank you for your help.

Additional

No response

@webdevmatt07 webdevmatt07 added the question Further information is requested label Feb 28, 2025
@UltralyticsAssistant UltralyticsAssistant added HUB Ultralytics HUB issues info needed More information is required to proceed labels Feb 28, 2025
@UltralyticsAssistant
Copy link
Member

👋 Hello @webdevmatt07, thank you for reporting this issue with Ultralytics HUB 🚀! We're sorry to hear about the disconnection troubles you're experiencing while training your models on the Ultralytics Cloud. Please take a look at our HUB Docs, which can offer helpful information:

  • Quickstart. Get started training and deploying YOLO models with HUB effortlessly.
  • Models: Training and Exporting. Explore detailed steps for training YOLOv5 and YOLOv8 models on custom datasets and exporting them for deployment.
  • Projects: Creating and Managing. Organize your models into projects for better team collaboration.
  • Inference API. Discover how to use the Inference API to generate predictions in the cloud.

If this is a 🐛 Bug Report, could you please provide the following additional details to help us investigate the issue more effectively?

  1. A detailed description of the steps leading up to the disconnection.
  2. Screenshots of any relevant error messages, logs, or events during the disconnections.
  3. A minimum reproducible example (MRE) if possible. This would allow us to replicate the issue and work on a quicker resolution.

If this is a ❓ Question about configuration or usage, please provide details about your model, dataset, training settings, and any customizations you might have applied.

An Ultralytics engineer will review this and assist you further soon. We try to address all issues as promptly as possible. Thank you for your patience and understanding! 😊

@webdevmatt07
Copy link
Author

I have done the "Bring your own" training on my linux box with success. I wanted to use the Ultralytics cloud for convenience and speed.

There have not been any errors that I can see on the UI, is there a place to see a log that I could share with you.

@pderrenger
Copy link
Member

Thank you for sharing these details about your Ultralytics Cloud Training experience. Let's address this systematically:

  1. Disconnections: Our Cloud Training docs note that training sessions may stop if your account balance drops below $5 during epoch-based training. However, you should always be able to resume from the last checkpoint. The screenshots suggest potential connectivity issues - could you confirm if these disconnections occur at consistent intervals or after specific epochs?

  2. Billing: Charges should only apply for completed epochs (epoch-based training) or elapsed time (timed training). If you're seeing inconsistent deductions, please check your Billing tab for detailed cost breakdowns per session.

  3. Logs: While full training logs aren't currently exposed in the UI, our team can investigate server-side logs. Please email support @webdevmatt07.com with:

    • Exact timestamps of failed sessions
    • Training instance regions used
    • Any patterns you noticed (e.g., specific epoch numbers)
  4. Stability Tip: We recommend using the same instance region for resume operations when possible, as cross-region checkpoints can occasionally introduce latency issues.

The team will prioritize investigating your case - your detailed report with timestamps will help us identify if this is an isolated infrastructure issue or something more systemic. We appreciate your patience as we work to improve Cloud Training reliability. 🚀

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
HUB Ultralytics HUB issues info needed More information is required to proceed question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants