WD MyCloud Python Recovery (Legacy - Maintenance Mode)

⚠️ This tool is in maintenance mode. For new projects, use wd-mycloud-rsync-recovery

Recover and transfer files from a Western Digital (WD) MyCloud device using Python-based REST SDK approach.

📢 This is the legacy Python-based recovery tool. It remains available for existing users but is no longer actively developed. Critical bug fixes only.

⚠️ Platform Support: macOS and Linux. Windows has limited support via WSL2 only.

⚠️ Disclaimer: This software is provided "as is" without warranty of any kind. The authors are not responsible for any data loss, corruption, or other issues that may occur. Always maintain backups of your original data before attempting recovery. Use at your own risk.

☕ Support This Project

If this tool saved your data, consider supporting continued development:

GitHub Sponsors: Sponsor @ericchapman80
Buy Me a Coffee: buymeacoffee.com/ericchapman80

🚨 Migration Notice

Recommended: Switch to the modern rsync-based tool for:

✅ Automatic timestamp preservation (no sync_mtime.py needed)
✅ Better resume capability
✅ Lower memory usage (~50 MB vs 2-10 GB)
✅ Real-time progress tracking
✅ Active development and new features

👉 Get the modern tool →

Quick Start (Legacy Tool)

macOS users (install dependencies first):

brew install python@3.12

Setup:

# Standard setup
./setup.sh

# Activate virtual environment
source venv/bin/activate

# Run preflight analysis
python preflight.py /path/to/source /path/to/dest

# Run recovery
python restsdk_public.py --db index.db --filedir /source --dumpdir /dest

# For --low-memory mode, sync timestamps afterward
python sync_mtime.py --db index.db --filedir /source --dumpdir /dest

Features

Multi-threaded file recovery via WD MyCloud REST SDK
Memory-optimized mode (--low-memory)
Resume capability with path-based matching
Symlink-based deduplication
Metadata validation tools
Preflight system analysis with thread recommendations

Key CLI Options

Option	Description
`--resume`	Resume a previous run (regenerates log from destination)
`--low-memory`	Reduce RAM usage ~40% (disables mtime preservation)
`--thread-count N`	Number of threads (default: CPU count)
`--preserve-mtime`	Set destination mtime from DB timestamps (default: on)
`--sanitize-pipes`	Replace `\|` with `-` for Windows/NTFS/SMB targets
`--io-buffer-size N`	Buffer size for manual buffered copies (default: 0)
`--io-max-concurrency N`	Limit concurrent disk I/O (default: 0 = no cap)
`--preflight`	Run system analysis before copying

Low-Memory Mode

For systems with limited RAM (< 16GB) or very large file databases (500K+ files):

python restsdk_public.py \
    --db=/path/to/index.db \
    --filedir=/path/to/source \
    --dumpdir=/path/to/dest \
    --log_file=copied_file.log \
    --low-memory \
    --thread-count=2 \
    --resume

Memory comparison (500K files):

Mode	RAM Usage	Preserve mtime
Normal	~11GB	✅ Yes
`--low-memory`	~6-7GB	❌ No
`--low-memory --thread-count=2`	~5-6GB	❌ No

Tools

restsdk_public.py - Main recovery script (Python/REST SDK)
sync_mtime.py - Post-recovery timestamp sync (required for --low-memory)
preflight.py - System analysis and thread recommendations
create_symlink_farm.py - Symlink-based deduplication
mtime_check.py - Metadata validation utility

Testing

# Run all legacy tests
./run_tests.sh

# Run with coverage
./run_tests.sh html

Test Coverage: 63% (stable baseline)

Why Maintenance Mode?

The modern rsync-based approach (wd-mycloud-rsync-recovery) offers:

Simpler operation (fewer manual steps)
No separate timestamp sync needed
Better performance and reliability
Lower resource usage
Active feature development

This Python tool remains available for:

Existing users with established workflows
Environments where rsync is not available
Specific use cases requiring Python API access

Support

Active Development: wd-mycloud-rsync-recovery
Critical Bugs: Open issues in this repository
Questions: See modern tool documentation

Running Over SSH

For long-running recoveries over SSH, use tmux or screen:

# Start a detachable session
tmux new -s recovery

# Run recovery inside the session
source venv/bin/activate
python restsdk_public.py --resume --db index.db --filedir /source --dumpdir /dest --log_file copied_file.log

# Detach: Ctrl+B then D
# Reattach later: tmux attach -t recovery

Monitoring

While the script runs, monitor progress in another terminal:

# Follow the log
tail -f summary_*.log

# Check copied files count
sqlite3 /path/to/index.db "SELECT COUNT(*) FROM copied_files"

# Check skipped files
sqlite3 /path/to/index.db "SELECT COUNT(*) FROM skipped_files"

# Run the monitor script
nohup ./monitor.sh /path/to/monitor.log 30 > /dev/null 2>&1 &

FAQ

Why do I see "File not found in database" errors?

Files may be missing from the database due to corruption or interrupted operations on the MyCloud device. These are skipped and reported.

How is the database structured?

Main table: Files
contentID: On-disk filename (e.g., a22236cwsmelmd4on2qs2jdf)
name: Original human-readable filename
parentID: Reference to parent directory for path reconstruction
Files stored in sharded directories: /files/a/a22236..., /files/b/b12345...

When to use --sanitize-pipes?

Needed for destinations that disallow | in filenames (Windows NTFS/FAT and many SMB shares). Leave off for Linux/macOS/EXT4/APFS.

Documentation

Modern Tool (Recommended): wd-mycloud-rsync-recovery

License

See LICENSE file.

Credits

Original script by springfielddatarecovery

Legacy Python approach maintained by @ericchapman80

Modern rsync approach: wd-mycloud-rsync-recovery

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WD MyCloud Python Recovery (Legacy - Maintenance Mode)

☕ Support This Project

🚨 Migration Notice

Quick Start (Legacy Tool)

Features

Key CLI Options

Low-Memory Mode

Tools

Testing

Why Maintenance Mode?

Support

Running Over SSH

Monitoring

FAQ

Documentation

License

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
create_symlink_farm.py		create_symlink_farm.py
mtime_check.py		mtime_check.py
preflight.py		preflight.py
pytest.ini		pytest.ini
requirements.txt		requirements.txt
restsdk_public.py		restsdk_public.py
run_tests.sh		run_tests.sh
setup.sh		setup.sh
sql-data.info		sql-data.info
sync_mtime.py		sync_mtime.py

Folders and files

Latest commit

History

Repository files navigation

WD MyCloud Python Recovery (Legacy - Maintenance Mode)

☕ Support This Project

🚨 Migration Notice

Quick Start (Legacy Tool)

Features

Key CLI Options

Low-Memory Mode

Tools

Testing

Why Maintenance Mode?

Support

Running Over SSH

Monitoring

FAQ

Documentation

License

Credits

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages