Anyone using Tensordock GPU instances and having problems with failing VM’s [D]
A researcher’s GPU VM failed, support vanished, and data is at risk.
A Tensordock user, u/Separate-Noise-2589, detailed a frustrating experience with a GPU distributed instance VM (3-tier data center specified in server info). The VM failed to start 2 days ago, despite the user paying monthly for storage to preserve primary storage and valuable research work. Support has been completely unresponsive, with no replies or solutions offered. The user discovered online that even if the disk image exists, there's no option to mount it to a new VM, making data recovery impossible. The only compensation mentioned is a bot promising 40x credits in case of data loss, but the user is unclear on what that means. This has led to anger and a sense of being ripped off, as the service was expected to be reliable but has resulted in potential total data loss.
This incident underscores significant risks for professionals relying on cloud GPU services for critical work. Tensordock's lack of support and inability to recover data from failed VMs could deter users who need consistent uptime and data integrity. The vague credit compensation scheme offers little reassurance, especially for valuable research. Users may want to consider backup strategies or more reliable providers for mission-critical workloads, as this case highlights the dangers of depending on a single cloud service without failover options.
- User's GPU VM failed to start 2 days ago despite ongoing storage payments for research data.
- Support is completely unresponsive, with no replies or solutions offered.
- No option exists to mount disk images to new VMs, risking total data loss; vague 40x credits promised.
- User expresses anger and warns of unreliable service after paying monthly automatically.
Why It Matters
Highlights risks of cloud GPU services for critical research, urging users to prioritize data backup and reliable providers.