$ datalad clone https://github.com/datalad-datasets/machinelearning-books.git
$ cd machinelearning-books
$ datalad get A.Shashua-Introduction_to_Machine_Learning.pdf
$ datalad drop A.Shashua-Introduction_to_Machine_Learning.pdf
drop(ok): /tmp/machinelearning-books/A.Shashua-Introduction_to_Machine_Learning.pdf (file)
[checking https://arxiv.org/pdf/0904.3664v1.pdf...]$ datalad clone https://github.com/datalad-datasets/human-connectome-project-openaccess.git
$ cd human-connectome-project-openaccess
$ datalad get -n HCP1200/996782
$ datalad drop --what all HCP1200/996782# The command operates outside of the to-be-removed dataset!
$ datalad remove -d . machinelearning-books
uninstall(ok): /tmp/machinelearning-books (dataset)$ datalad create local-dataset
$ cd local-dataset
$ echo "This file content will only exist locally" > local-file.txt
$ datalad save -m "Added a file without remote content availability"
$ datalad drop local-file.txt
$ drop(error): local-file.txt (file) [unsafe; Could only verify the existence of 0 out of 1 necessary copy;
(Note that these git remotes have annex-ignore set: origin upstream);
(Use --reckless availability to override this check, or adjust numcopies.)]
$ datalad drop local-file.txt --reckless availability$ datalad remove -d local-dataset
uninstall(error): . (dataset) [to-be-dropped dataset has revisions that are not available at any known
sibling. Use `datalad push --to ...` to push these before dropping the local dataset,
or ignore via `--reckless availability`. Unique revisions: ['main']]
$ datalad remove -d local-dataset --reckless availability$ datalad create local-dataset
$ cd local-dataset
$ echo "This file content will only exist locally" > local-file.txt
$ datalad save -m "Added a file without remote content availability"
$ rm -rf local-dataset
rm: cannot remove 'local-dataset/.git/annex/objects/Kj/44/MD5E-s42--8f008874ab52d0ff02a5bbd0174ac95e.txt/
MD5E-s42--8f008874ab52d0ff02a5bbd0174ac95e.txt': Permission denied
$ chmod +w -R local-dataset
$ rm -rf local-dataset
| How to share your work with others |
Repository hosting services, siblings, and datalad push |
../my-projects/experiment_data
Remote path myuser@myinstitutes.hcp.system:/home/myuser/my-projects/experiment_data
Hosting service git.github.com:myuser/experiment_data.git
External special remotes osf://my-osf-project-id
See the chapter
Third party infrastructure for walk-throughs for different services
See the chapter
Third party infrastructure for walk-throughs for different services
$ git config --local remote.github.datalad-publish-depends gdrive
# or
$ datalad siblings add --name origin --url git@git.jugit.fzj.de:adswa/experiment-data.git --publish-depends s3
Special case 1: repositories with annex support
Special case 2: Special remotes with repositories
Special case 1: repositories with annex support
[LIVE DEMO GIN]Special case 2: Special remotes with repositories
Requires the DataLad extension datalad-osfSpecial case 2: Special remotes with repositories
Requires the DataLad extension datalad-nextSpecial case 3: RIA stores for dataset hosting/backup
Tutorial for large scale, reproducible computation: github.com/psychoinformatics-de/fairly-big-processing-workflowdatalad create-sibling-githubdatalad create-sibling-gitlabdatalad create-sibling-gindatalad create-sibling-gogsdatalad create-siblingdatalad create-sibling-riadatalad create-sibling-osfdatalad create-sibling-webdav


Next: Let's collaborate!
datalad clone ...datalad save ...datalad create-sibling-gin ...datalad push ...datalad drop ...datalad clone ...datalad save ...datalad push ...datalad siblings add ...datalad update ...datalad push ...