[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"project-9612":3},{"id":4,"name":5,"fullName":6,"owner":7,"repo":5,"description":8,"homepage":9,"htmlUrl":10,"language":11,"languages":10,"totalLinesOfCode":10,"stars":12,"forks":13,"watchers":14,"openIssues":15,"contributorsCount":16,"subscribersCount":16,"size":16,"stars1d":16,"stars7d":17,"stars30d":18,"stars90d":16,"forks30d":16,"starsTrendScore":19,"compositeScore":20,"rankGlobal":10,"rankLanguage":10,"license":21,"archived":22,"fork":22,"defaultBranch":23,"hasWiki":24,"hasPages":22,"topics":25,"createdAt":10,"pushedAt":10,"updatedAt":33,"readmeContent":34,"aiSummary":35,"trendingCount":16,"starSnapshotCount":16,"syncStatus":36,"lastSyncTime":37,"discoverSource":38},9612,"dvc","treeverse\u002Fdvc","treeverse","🦉 Data Versioning and ML Experiments","https:\u002F\u002Fdvc.org",null,"Python",15667,1303,129,168,0,14,76,5,83.95,"Apache License 2.0",false,"main",true,[26,27,28,29,30,31,32],"ai","data-science","data-version-control","developer-tools","machine-learning","reproducibility","unstructured-data","2026-06-12 04:00:46","|Banner|\n\n`Website \u003Chttps:\u002F\u002Fdvc.org>`_\n• `Docs \u003Chttps:\u002F\u002Fdvc.org\u002Fdoc>`_\n• `Blog \u003Chttp:\u002F\u002Fblog.dataversioncontrol.com>`_\n• `Tutorial \u003Chttps:\u002F\u002Fdvc.org\u002Fdoc\u002Fget-started>`_\n• `Related Technologies \u003Chttps:\u002F\u002Fdvc.org\u002Fdoc\u002Fuser-guide\u002Frelated-technologies>`_\n• `How DVC works`_\n• `VS Code Extension`_\n• `Installation`_\n• `Contributing`_\n• `Community and Support`_\n\n|CI| |Python Version| |Coverage| |VS Code| |DOI|\n\n|PyPI| |PyPI Downloads| |Packages| |Brew| |Conda| |Choco| |Snap|\n\n|\n\n**Data Version Control** or **DVC** is a command line tool and `VS Code Extension`_ to help you develop reproducible machine learning projects:\n\n#. **Version** your data and models.\n   Store them in your cloud storage but keep their version info in your Git repo.\n\n#. **Iterate** fast with lightweight pipelines.\n   When you make changes, only run the steps impacted by those changes.\n\n#. **Track** experiments in your local Git repo (no servers needed).\n\n#. **Compare** any data, code, parameters, model, or performance plots.\n\n#. **Share** experiments and automatically reproduce anyone's experiment.\n\nQuick start\n===========\n\n    Please read our `Command Reference \u003Chttps:\u002F\u002Fdvc.org\u002Fdoc\u002Fcommand-reference>`_ for a complete list.\n\nA common CLI workflow includes:\n\n\n+-----------------------------------+----------------------------------------------------------------------------------------------------+\n| Task                              | Terminal                                                                                           |\n+===================================+====================================================================================================+\n| Track data                        | | ``$ git add train.py params.yaml``                                                               |\n|                                   | | ``$ dvc add images\u002F``                                                                            |\n+-----------------------------------+----------------------------------------------------------------------------------------------------+\n| Connect code and data             | | ``$ dvc stage add -n featurize -d images\u002F -o features\u002F python featurize.py``                     |\n|                                   | | ``$ dvc stage add -n train -d features\u002F -d train.py -o model.p -M metrics.json python train.py`` |\n+-----------------------------------+----------------------------------------------------------------------------------------------------+\n| Make changes and experiment       | | ``$ dvc exp run -n exp-baseline``                                                                |\n|                                   | | ``$ vi train.py``                                                                                |\n|                                   | | ``$ dvc exp run -n exp-code-change``                                                             |\n+-----------------------------------+----------------------------------------------------------------------------------------------------+\n| Compare and select experiments    | | ``$ dvc exp show``                                                                               |\n|                                   | | ``$ dvc exp apply exp-baseline``                                                                 |\n+-----------------------------------+----------------------------------------------------------------------------------------------------+\n| Share code                        | | ``$ git add .``                                                                                  |\n|                                   | | ``$ git commit -m 'The baseline model'``                                                         |\n|                                   | | ``$ git push``                                                                                   |\n+-----------------------------------+----------------------------------------------------------------------------------------------------+\n| Share data and ML models          | | ``$ dvc remote add myremote -d s3:\u002F\u002Fmybucket\u002Fimage_cnn``                                         |\n|                                   | | ``$ dvc push``                                                                                   |\n+-----------------------------------+----------------------------------------------------------------------------------------------------+\n\nHow DVC works\n=============\n\n    We encourage you to read our `Get Started\n    \u003Chttps:\u002F\u002Fdvc.org\u002Fdoc\u002Fget-started>`_ docs to better understand what DVC\n    does and how it can fit your scenarios.\n\nThe closest *analogies* to describe the main DVC features are these:\n\n#. **Git for data**: Store and share data artifacts (like Git-LFS but without a server) and models, connecting them with a Git repository. Data management meets GitOps!\n#. **Makefiles** for ML: Describes how data or model artifacts are built from other data and code in a standard format. Now you can version your data pipelines with Git.\n#. Local **experiment tracking**: Turn your machine into an ML experiment management platform, and collaborate with others using existing Git hosting (Github, Gitlab, etc.).\n\nGit is employed as usual to store and version code (including DVC meta-files as placeholders for data).\nDVC `stores data and model files \u003Chttps:\u002F\u002Fdvc.org\u002Fdoc\u002Fstart\u002Fdata-management>`_ seamlessly in a cache outside of Git, while preserving almost the same user experience as if they were in the repo.\nTo share and back up the *data cache*, DVC supports multiple remote storage platforms - any cloud (S3, Azure, Google Cloud, etc.) or on-premise network storage (via SSH, for example).\n\n|Flowchart|\n\n`DVC pipelines \u003Chttps:\u002F\u002Fdvc.org\u002Fdoc\u002Fstart\u002Fdata-management\u002Fdata-pipelines>`_ (computational graphs) connect code and data together.\nThey specify all steps required to produce a model: input dependencies including code, data, commands to run; and output information to be saved.\n\nLast but not least, `DVC Experiment Versioning \u003Chttps:\u002F\u002Fdvc.org\u002Fdoc\u002Fstart\u002Fexperiments>`_ lets you prepare and run a large number of experiments.\nTheir results can be filtered and compared based on hyperparameters and metrics, and visualized with multiple plots.\n\n.. _`VS Code Extension`:\n\nVS Code Extension\n=================\n\n|VS Code|\n\nTo use DVC as a GUI right from your VS Code IDE, install the `DVC Extension \u003Chttps:\u002F\u002Fmarketplace.visualstudio.com\u002Fitems?itemName=Iterative.dvc>`_ from the Marketplace.\nIt currently features experiment tracking and data management, and more features (data pipeline support, etc.) are coming soon!\n\n|VS Code Extension Overview|\n\n    Note: You'll have to install core DVC on your system separately (as detailed\n    below). The Extension will guide you if needed.\n\nInstallation\n============\n\nThere are several ways to install DVC: in VS Code; using ``snap``, ``choco``, ``brew``, ``conda``, ``pip``; or with an OS-specific package.\nFull instructions are `available here \u003Chttps:\u002F\u002Fdvc.org\u002Fdoc\u002Fget-started\u002Finstall>`_.\n\nSnapcraft (Linux)\n-----------------\n\n|Snap|\n\n.. code-block:: bash\n\n   snap install dvc --classic\n\nThis corresponds to the latest tagged release.\nAdd ``--beta`` for the latest tagged release candidate, or ``--edge`` for the latest ``main`` version.\n\nChocolatey (Windows)\n--------------------\n\n|Choco|\n\n.. code-block:: bash\n\n   choco install dvc\n\nBrew (mac OS)\n-------------\n\n|Brew|\n\n.. code-block:: bash\n\n   brew install dvc\n\nAnaconda (Any platform)\n-----------------------\n\n|Conda|\n\n.. code-block:: bash\n\n   conda install -c conda-forge mamba # installs much faster than conda\n   mamba install -c conda-forge dvc\n\nDepending on the remote storage type you plan to use to keep and share your data, you might need to install optional dependencies: `dvc-s3`, `dvc-azure`, `dvc-gdrive`, `dvc-gs`, `dvc-oss`, `dvc-ssh`.\n\nPyPI (Python)\n-------------\n\n|PyPI|\n\n.. code-block:: bash\n\n   pip install dvc\n\nDepending on the remote storage type you plan to use to keep and share your data, you might need to specify one of the optional dependencies: ``s3``, ``gs``, ``azure``, ``oss``, ``ssh``. Or ``all`` to include them all.\nThe command should look like this: ``pip install 'dvc[s3]'`` (in this case AWS S3 dependencies such as ``boto3`` will be installed automatically).\n\nTo install the development version, run:\n\n.. code-block:: bash\n\n   pip install \"dvc @ git+https:\u002F\u002Fgithub.com\u002Ftreeverse\u002Fdvc\"\n\nPackage (Platform-specific)\n---------------------------\n\n|Packages|\n\nSelf-contained packages for Linux, Windows, and Mac are available.\nThe latest version of the packages can be found on the GitHub `releases page \u003Chttps:\u002F\u002Fgithub.com\u002Ftreeverse\u002Fdvc\u002Freleases>`_.\n\nUbuntu \u002F Debian (deb)\n^^^^^^^^^^^^^^^^^^^^^\n.. code-block:: bash\n\n   sudo wget https:\u002F\u002Fdvc.org\u002Fdeb\u002Fdvc.list -O \u002Fetc\u002Fapt\u002Fsources.list.d\u002Fdvc.list\n   wget -qO - https:\u002F\u002Fdvc.org\u002Fdeb\u002Fiterative.asc | sudo apt-key add -\n   sudo apt update\n   sudo apt install dvc\n\nFedora \u002F CentOS (rpm)\n^^^^^^^^^^^^^^^^^^^^^\n.. code-block:: bash\n\n   sudo wget https:\u002F\u002Fdvc.org\u002Frpm\u002Fdvc.repo -O \u002Fetc\u002Fyum.repos.d\u002Fdvc.repo\n   sudo rpm --import https:\u002F\u002Fdvc.org\u002Frpm\u002Fiterative.asc\n   sudo yum update\n   sudo yum install dvc\n\nContributing\n============\n\nContributions are welcome!\nPlease see our `Contributing Guide \u003Chttps:\u002F\u002Fdvc.org\u002Fdoc\u002Fuser-guide\u002Fcontributing\u002Fcore>`_ for more details.\nThanks to all our contributors!\n\n|Contribs|\n\nCommunity and Support\n=====================\n\n* `Twitter \u003Chttps:\u002F\u002Ftwitter.com\u002FDVCorg>`_\n* `Forum \u003Chttps:\u002F\u002Fdiscuss.dvc.org\u002F>`_\n* `Discord Chat \u003Chttps:\u002F\u002Fdvc.org\u002Fchat>`_\n* `Email \u003Cmailto:support@dvc.org>`_\n* `Mailing List \u003Chttps:\u002F\u002Fdvc.org\u002Fcommunity#subscribe>`_\n\nCopyright\n=========\n\nThis project is distributed under the Apache license version 2.0 (see the LICENSE file in the project root).\n\nBy submitting a pull request to this project, you agree to license your contribution under the Apache license version 2.0 to this project.\n\nCitation\n========\n\n|DOI|\n\nIterative, *DVC: Data Version Control - Git for Data & Models* (2020)\n`DOI:10.5281\u002Fzenodo.012345 \u003Chttps:\u002F\u002Fdoi.org\u002F10.5281\u002Fzenodo.3677553>`_.\n\nBarrak, A., Eghan, E.E. and Adams, B. `On the Co-evolution of ML Pipelines and Source Code - Empirical Study of DVC Projects \u003Chttps:\u002F\u002Fmcis.cs.queensu.ca\u002Fpublications\u002F2021\u002Fsaner.pdf>`_ , in Proceedings of the 28th IEEE International Conference on Software Analysis, Evolution, and Reengineering, SANER 2021. Hawaii, USA.\n\n\n.. |Banner| image:: https:\u002F\u002Fdvc.org\u002Fimg\u002Flogo-github-readme.png\n   :target: https:\u002F\u002Fdvc.org\n   :alt: DVC logo\n\n.. |VS Code Extension Overview| image:: https:\u002F\u002Fraw.githubusercontent.com\u002Ftreeverse\u002Fvscode-dvc\u002Fmain\u002Fextension\u002Fdocs\u002Foverview.gif\n   :alt: DVC Extension for VS Code\n\n.. |CI| image:: https:\u002F\u002Fgithub.com\u002Ftreeverse\u002Fdvc\u002Factions\u002Fworkflows\u002Ftests.yaml\u002Fbadge.svg\n   :target: https:\u002F\u002Fgithub.com\u002Ftreeverse\u002Fdvc\u002Factions\u002Fworkflows\u002Ftests.yaml\n   :alt: GHA Tests\n\n.. |Python Version| image:: https:\u002F\u002Fimg.shields.io\u002Fpypi\u002Fpyversions\u002Fdvc\n   :target: https:\u002F\u002Fpypi.org\u002Fproject\u002Fdvc\n   :alt: Python Version\n\n.. |Coverage| image:: https:\u002F\u002Fcodecov.io\u002Fgh\u002Ftreeverse\u002Fdvc\u002Fbranch\u002Fmain\u002Fgraph\u002Fbadge.svg\n   :target: https:\u002F\u002Fcodecov.io\u002Fgh\u002Ftreeverse\u002Fdvc\n   :alt: Codecov\n\n.. |Snap| image:: https:\u002F\u002Fimg.shields.io\u002Fbadge\u002Fsnap-install-82BEA0.svg?logo=snapcraft\n   :target: https:\u002F\u002Fsnapcraft.io\u002Fdvc\n   :alt: Snapcraft\n\n.. |Choco| image:: https:\u002F\u002Fimg.shields.io\u002Fchocolatey\u002Fv\u002Fdvc?label=choco\n   :target: https:\u002F\u002Fchocolatey.org\u002Fpackages\u002Fdvc\n   :alt: Chocolatey\n\n.. |Brew| image:: https:\u002F\u002Fimg.shields.io\u002Fhomebrew\u002Fv\u002Fdvc?label=brew\n   :target: https:\u002F\u002Fformulae.brew.sh\u002Fformula\u002Fdvc\n   :alt: Homebrew\n\n.. |Conda| image:: https:\u002F\u002Fanaconda.org\u002Fconda-forge\u002Fdvc\u002Fbadges\u002Fversion.svg\n   :target: https:\u002F\u002Fanaconda.org\u002Fconda-forge\u002Fdvc\n   :alt: Conda-forge\n\n.. |PyPI| image:: https:\u002F\u002Fimg.shields.io\u002Fpypi\u002Fv\u002Fdvc.svg?label=pip&logo=PyPI&logoColor=white\n   :target: https:\u002F\u002Fpypi.org\u002Fproject\u002Fdvc\n   :alt: PyPI\n\n.. |PyPI Downloads| image:: https:\u002F\u002Fimg.shields.io\u002Fpypi\u002Fdm\u002Fdvc.svg?color=blue&label=Downloads&logo=pypi&logoColor=gold\n   :target: https:\u002F\u002Fpypi.org\u002Fproject\u002Fdvc\n   :alt: PyPI Downloads\n\n.. |Packages| image:: https:\u002F\u002Fimg.shields.io\u002Fbadge\u002Fdeb|pkg|rpm|exe-blue\n   :target: https:\u002F\u002Fdvc.org\u002Fdoc\u002Finstall\n   :alt: deb|pkg|rpm|exe\n\n.. |DOI| image:: https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FDOI-10.5281\u002Fzenodo.3677553-blue.svg\n   :target: https:\u002F\u002Fdoi.org\u002F10.5281\u002Fzenodo.3677553\n   :alt: DOI\n\n.. |Flowchart| image:: https:\u002F\u002Fdvc.org\u002Fimg\u002Fflow.gif\n   :target: https:\u002F\u002Fdvc.org\u002Fimg\u002Fflow.gif\n   :alt: how_dvc_works\n\n.. |Contribs| image:: https:\u002F\u002Fcontrib.rocks\u002Fimage?repo=treeverse\u002Fdvc\n   :target: https:\u002F\u002Fgithub.com\u002Ftreeverse\u002Fdvc\u002Fgraphs\u002Fcontributors\n   :alt: Contributors\n\n.. |VS Code| image:: https:\u002F\u002Fimg.shields.io\u002Fvisual-studio-marketplace\u002Fv\u002FIterative.dvc?color=blue&label=VSCode&logo=visualstudiocode&logoColor=blue\n   :target: https:\u002F\u002Fmarketplace.visualstudio.com\u002Fitems?itemName=Iterative.dvc\n   :alt: VS Code Extension\n","DVC（Data Version Control）是一个用于开发可重现机器学习项目的命令行工具及VS Code扩展。它支持数据和模型版本控制，允许用户将这些资源存储于云端的同时，在Git仓库中保留版本信息；通过轻量级管道快速迭代，仅重新运行受更改影响的部分步骤；在本地Git仓库中跟踪实验而无需额外服务器；比较不同数据、代码、参数、模型或性能图表；并分享实验结果以自动重现他人实验。适用于需要高效管理数据版本、构建可复现的机器学习流程以及团队协作共享研究成果的场景。",2,"2026-06-11 03:23:46","top_topic"]