[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"project-71944":3},{"id":4,"name":5,"fullName":6,"owner":7,"repo":5,"description":8,"homepage":9,"htmlUrl":10,"language":11,"languages":10,"totalLinesOfCode":10,"stars":12,"forks":13,"watchers":14,"openIssues":15,"contributorsCount":16,"subscribersCount":16,"size":16,"stars1d":16,"stars7d":17,"stars30d":18,"stars90d":16,"forks30d":16,"starsTrendScore":16,"compositeScore":19,"rankGlobal":10,"rankLanguage":10,"license":20,"archived":21,"fork":21,"defaultBranch":22,"hasWiki":23,"hasPages":21,"topics":24,"createdAt":10,"pushedAt":10,"updatedAt":25,"readmeContent":26,"aiSummary":27,"trendingCount":16,"starSnapshotCount":16,"syncStatus":17,"lastSyncTime":28,"discoverSource":29},71944,"yolov10","THU-MIG\u002Fyolov10","THU-MIG","YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024] ","https:\u002F\u002Farxiv.org\u002Fabs\u002F2405.14458",null,"Python",11319,1179,59,277,0,2,23,72.52,"GNU Affero General Public License v3.0",false,"main",true,[],"2026-06-12 04:01:02","## Latest Updates -- [YOLOE: Real-Time Seeing Anything](https:\u002F\u002Fgithub.com\u002FTHU-MIG\u002Fyoloe)\n\nPlease check out our new release on [**YOLOE**](https:\u002F\u002Fgithub.com\u002FTHU-MIG\u002Fyoloe).\n\n* YOLOE code: https:\u002F\u002Fgithub.com\u002FTHU-MIG\u002Fyoloe\n* YOLOE paper: https:\u002F\u002Farxiv.org\u002Fabs\u002F2503.07465\n\n \u003Cp align=\"center\">\n  \u003Cimg src=\"https:\u002F\u002Fgithub.com\u002FTHU-MIG\u002Fyoloe\u002Fblob\u002Fmain\u002Ffigures\u002Fcomparison.svg\" width=70%> \u003Cbr>\n  Comparison of performance, training cost, and inference efficiency between YOLOE (Ours) and YOLO-Worldv2 in terms of open text prompts.\n\u003C\u002Fp>\n\n**YOLOE(ye)** is a highly **efficient**, **unified**, and **open** object detection and segmentation model for real-time seeing anything, like human eye, under different prompt mechanisms, like *texts*, *visual inputs*, and *prompt-free paradigm*, with **zero inference and transferring overhead** compared with closed-set YOLOs.\n\n\u003Cp align=\"center\">\n  \u003Cimg src=\"https:\u002F\u002Fgithub.com\u002FTHU-MIG\u002Fyoloe\u002Fblob\u002Fmain\u002Ffigures\u002Fvisualization.svg\" width=96%> \u003Cbr>\n\u003C\u002Fp>\n\n\u003Cdetails>\n  \u003Csummary>\n  \u003Cfont size=\"+1\">Abstract\u003C\u002Ffont>\n  \u003C\u002Fsummary>\nObject detection and segmentation are widely employed in computer vision applications, yet conventional models like YOLO series, while efficient and accurate, are limited by predefined categories, hindering adaptability in open scenarios. Recent open-set methods leverage text prompts, visual cues, or prompt-free paradigm to overcome this, but often compromise between performance and efficiency due to high computational demands or deployment complexity. In this work, we introduce YOLOE, which integrates detection and segmentation across diverse open prompt mechanisms within a single highly efficient model, achieving real-time seeing anything. For text prompts, we propose Re-parameterizable Region-Text Alignment (RepRTA) strategy. It refines pretrained textual embeddings via a re-parameterizable lightweight auxiliary network and enhances visual-textual alignment with zero inference and transferring overhead. For visual prompts, we present Semantic-Activated Visual Prompt Encoder (SAVPE). It employs decoupled semantic and activation branches to bring improved visual embedding and accuracy with minimal complexity. For prompt-free scenario, we introduce Lazy Region-Prompt Contrast (LRPC) strategy. It utilizes a built-in large vocabulary and specialized embedding to identify all objects, avoiding costly language model dependency. Extensive experiments show YOLOE's exceptional zero-shot performance and transferability with high inference efficiency and low training cost. Notably, on LVIS, with $3\\times$ less training cost and $1.4\\times$ inference speedup, YOLOE-v8-S surpasses YOLO-Worldv2-S by 3.5 AP. When transferring to COCO, YOLOE-v8-L achieves 0.6 $AP^b$ and 0.4 $AP^m$ gains over closed-set YOLOv8-L with nearly $4\\times$ less training time.\n\u003C\u002Fdetails>\n\u003Cp>\u003C\u002Fp>\n\u003Cp align=\"center\">\n  \u003Cimg src=\"https:\u002F\u002Fgithub.com\u002FTHU-MIG\u002Fyoloe\u002Fblob\u002Fmain\u002Ffigures\u002Fpipeline.svg\" width=96%> \u003Cbr>\n\u003C\u002Fp>\n\n\n# [YOLOv10: Real-Time End-to-End Object Detection](https:\u002F\u002Farxiv.org\u002Fabs\u002F2405.14458)\n\n\nOfficial PyTorch implementation of **YOLOv10**. NeurIPS 2024.\n\n\u003Cp align=\"center\">\n  \u003Cimg src=\"figures\u002Flatency.svg\" width=48%>\n  \u003Cimg src=\"figures\u002Fparams.svg\" width=48%> \u003Cbr>\n  Comparisons with others in terms of latency-accuracy (left) and size-accuracy (right) trade-offs.\n\u003C\u002Fp>\n\n[YOLOv10: Real-Time End-to-End Object Detection](https:\u002F\u002Farxiv.org\u002Fabs\u002F2405.14458).\\\nAo Wang, Hui Chen, Lihao Liu, Kai Chen, Zijia Lin, Jungong Han, and Guiguang Ding\\\n[![arXiv](https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FarXiv-2405.14458-b31b1b.svg)](https:\u002F\u002Farxiv.org\u002Fabs\u002F2405.14458) \u003Ca href=\"https:\u002F\u002Fcolab.research.google.com\u002Fgithub\u002Froboflow-ai\u002Fnotebooks\u002Fblob\u002Fmain\u002Fnotebooks\u002Ftrain-yolov10-object-detection-on-custom-dataset.ipynb#scrollTo=SaKTSzSWnG7s\">\u003Cimg src=\"https:\u002F\u002Fcolab.research.google.com\u002Fassets\u002Fcolab-badge.svg\" alt=\"Open In Colab\">\u003C\u002Fa> [![Hugging Face Spaces](https:\u002F\u002Fimg.shields.io\u002Fbadge\u002F%F0%9F%A4%97%20Hugging%20Face-Models-blue)](https:\u002F\u002Fhuggingface.co\u002Fcollections\u002Fjameslahm\u002Fyolov10-665b0d90b0b5bb85129460c2) [![Hugging Face Spaces](https:\u002F\u002Fimg.shields.io\u002Fbadge\u002F%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https:\u002F\u002Fhuggingface.co\u002Fspaces\u002Fjameslahm\u002FYOLOv10)  [![Hugging Face Spaces](https:\u002F\u002Fimg.shields.io\u002Fbadge\u002F%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https:\u002F\u002Fhuggingface.co\u002Fspaces\u002Fkadirnar\u002FYolov10)  [![Transformers.js Demo](https:\u002F\u002Fimg.shields.io\u002Fbadge\u002F%F0%9F%A4%97%20Hugging%20Face-Transformers.js-blue)](https:\u002F\u002Fhuggingface.co\u002Fspaces\u002FXenova\u002Fyolov10-web) [![LearnOpenCV](https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FBlogPost-blue?logo=data%3Aimage%2Fpng%3Bbase64%2CiVBORw0KGgoAAAANSUhEUgAAAAoAAAAKCAMAAAC67D%2BPAAAALVBMVEX%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F%2F6%2Bfn6%2Bvq3y%2BJ8rOFSne9Jm%2FQcOlr5DJ7GAAAAB3RSTlMAB2LM94H1yMxlvwAAADNJREFUCFtjZGAEAob%2FQMDIyAJl%2FmFkYmEGM%2F%2F%2BYWRmYWYCMv8BmSxYmUgKkLQhGYawAgApySgfFDPqowAAAABJRU5ErkJggg%3D%3D&logoColor=black&labelColor=gray)](https:\u002F\u002Flearnopencv.com\u002Fyolov10\u002F) [![Openbayes Demo](https:\u002F\u002Fimg.shields.io\u002Fstatic\u002Fv1?label=Demo&message=OpenBayes%E8%B4%9D%E5%BC%8F%E8%AE%A1%E7%AE%97&color=green)](https:\u002F\u002Fopenbayes.com\u002Fconsole\u002Fpublic\u002Ftutorials\u002Fim29uYrnIoz) \n\n\n\u003Cdetails>\n  \u003Csummary>\n  \u003Cfont size=\"+1\">Abstract\u003C\u002Ffont>\n  \u003C\u002Fsummary>\nOver the past years, YOLOs have emerged as the predominant paradigm in the field of real-time object detection owing to their effective balance between computational cost and detection performance. Researchers have explored the architectural designs, optimization objectives, data augmentation strategies, and others for YOLOs, achieving notable progress. However, the reliance on the non-maximum suppression (NMS) for post-processing hampers the end-to-end deployment of YOLOs and adversely impacts the inference latency. Besides, the design of various components in YOLOs lacks the comprehensive and thorough inspection, resulting in noticeable computational redundancy and limiting the model's capability. It renders the suboptimal efficiency, along with considerable potential for performance improvements. In this work, we aim to further advance the performance-efficiency boundary of YOLOs from both the post-processing and the model architecture. To this end, we first present the consistent dual assignments for NMS-free training of YOLOs, which brings the competitive performance and low inference latency simultaneously. Moreover, we introduce the holistic efficiency-accuracy driven model design strategy for YOLOs. We comprehensively optimize various components of YOLOs from both the efficiency and accuracy perspectives, which greatly reduces the computational overhead and enhances the capability. The outcome of our effort is a new generation of YOLO series for real-time end-to-end object detection, dubbed YOLOv10. Extensive experiments show that YOLOv10 achieves the state-of-the-art performance and efficiency across various model scales. For example, our YOLOv10-S is 1.8$\\times$ faster than RT-DETR-R18 under the similar AP on COCO, meanwhile enjoying 2.8$\\times$ smaller number of parameters and FLOPs. Compared with YOLOv9-C, YOLOv10-B has 46\\% less latency and 25\\% fewer parameters for the same performance.\n\u003C\u002Fdetails>\n\n## Notes\n- 2024\u002F05\u002F31: Please use the [exported format](https:\u002F\u002Fgithub.com\u002FTHU-MIG\u002Fyolov10?tab=readme-ov-file#export) for benchmark. In the non-exported format, e.g., pytorch, the speed of YOLOv10 is biased because the unnecessary `cv2` and `cv3` operations in the `v10Detect` are executed during inference.\n- 2024\u002F05\u002F30: We provide [some clarifications and suggestions](https:\u002F\u002Fgithub.com\u002FTHU-MIG\u002Fyolov10\u002Fissues\u002F136) for detecting smaller objects or objects in the distance with YOLOv10. Thanks to [SkalskiP](https:\u002F\u002Fgithub.com\u002FSkalskiP)!\n- 2024\u002F05\u002F27: We have updated the [checkpoints](https:\u002F\u002Fhuggingface.co\u002Fcollections\u002Fjameslahm\u002Fyolov10-665b0d90b0b5bb85129460c2) with class names, for ease of use.\n\n## UPDATES 🔥\n- 2024\u002F06\u002F01: Thanks to [ErlanggaYudiPradana](https:\u002F\u002Fgithub.com\u002Frlggyp) for the integration with [C++ | OpenVINO | OpenCV](https:\u002F\u002Fgithub.com\u002Frlggyp\u002FYOLOv10-OpenVINO-CPP-Inference)\n- 2024\u002F06\u002F01: Thanks to [NielsRogge](https:\u002F\u002Fgithub.com\u002FNielsRogge) and [AK](https:\u002F\u002Fx.com\u002F_akhaliq) for hosting the models on the HuggingFace Hub!\n- 2024\u002F05\u002F31: Build [yolov10-jetson](https:\u002F\u002Fgithub.com\u002FSeeed-Projects\u002Fjetson-examples\u002Fblob\u002Fmain\u002FreComputer\u002Fscripts\u002Fyolov10\u002FREADME.md) docker image by [youjiang](https:\u002F\u002Fgithub.com\u002Fyuyoujiang)!\n- 2024\u002F05\u002F31: Thanks to [mohamedsamirx](https:\u002F\u002Fgithub.com\u002Fmohamedsamirx) for the integration with [BoTSORT, DeepOCSORT, OCSORT, HybridSORT, ByteTrack, StrongSORT using BoxMOT library](https:\u002F\u002Fcolab.research.google.com\u002Fdrive\u002F1-QV2TNfqaMsh14w5VxieEyanugVBG14V?usp=sharing)!\n- 2024\u002F05\u002F31: Thanks to [kaylorchen](https:\u002F\u002Fgithub.com\u002Fkaylorchen) for the integration with [rk3588](https:\u002F\u002Fgithub.com\u002Fkaylorchen\u002Frk3588-yolo-demo)!\n- 2024\u002F05\u002F30: Thanks to [eaidova](https:\u002F\u002Fgithub.com\u002Feaidova) for the integration with [OpenVINO™](https:\u002F\u002Fgithub.com\u002Fopenvinotoolkit\u002Fopenvino_notebooks\u002Fblob\u002F0ba3c0211bcd49aa860369feddffdf7273a73c64\u002Fnotebooks\u002Fyolov10-optimization\u002Fyolov10-optimization.ipynb)!\n- 2024\u002F05\u002F29: Add the gradio demo for running the models locally. Thanks to [AK](https:\u002F\u002Fx.com\u002F_akhaliq)!\n- 2024\u002F05\u002F27: Thanks to [sujanshresstha](sujanshresstha) for the integration with [DeepSORT](https:\u002F\u002Fgithub.com\u002Fsujanshresstha\u002FYOLOv10_DeepSORT.git)!\n- 2024\u002F05\u002F26: Thanks to [CVHub520](https:\u002F\u002Fgithub.com\u002FCVHub520) for the integration into [X-AnyLabeling](https:\u002F\u002Fgithub.com\u002FCVHub520\u002FX-AnyLabeling)!\n- 2024\u002F05\u002F26: Thanks to [DanielSarmiento04](https:\u002F\u002Fgithub.com\u002FDanielSarmiento04) for integrate in [c++ | ONNX | OPENCV](https:\u002F\u002Fgithub.com\u002FDanielSarmiento04\u002Fyolov10cpp)!\n- 2024\u002F05\u002F25: Add [Transformers.js demo](https:\u002F\u002Fhuggingface.co\u002Fspaces\u002FXenova\u002Fyolov10-web) and onnx weights(yolov10[n](https:\u002F\u002Fhuggingface.co\u002Fonnx-community\u002Fyolov10n)\u002F[s](https:\u002F\u002Fhuggingface.co\u002Fonnx-community\u002Fyolov10s)\u002F[m](https:\u002F\u002Fhuggingface.co\u002Fonnx-community\u002Fyolov10m)\u002F[b](https:\u002F\u002Fhuggingface.co\u002Fonnx-community\u002Fyolov10b)\u002F[l](https:\u002F\u002Fhuggingface.co\u002Fonnx-community\u002Fyolov10l)\u002F[x](https:\u002F\u002Fhuggingface.co\u002Fonnx-community\u002Fyolov10x)). Thanks to [xenova](https:\u002F\u002Fgithub.com\u002Fxenova)!\n- 2024\u002F05\u002F25: Add [colab demo](https:\u002F\u002Fcolab.research.google.com\u002Fgithub\u002Froboflow-ai\u002Fnotebooks\u002Fblob\u002Fmain\u002Fnotebooks\u002Ftrain-yolov10-object-detection-on-custom-dataset.ipynb#scrollTo=SaKTSzSWnG7s), [HuggingFace Demo](https:\u002F\u002Fhuggingface.co\u002Fspaces\u002Fkadirnar\u002FYolov10), and [HuggingFace Model Page](https:\u002F\u002Fhuggingface.co\u002Fkadirnar\u002FYolov10). Thanks to [SkalskiP](https:\u002F\u002Fgithub.com\u002FSkalskiP) and [kadirnar](https:\u002F\u002Fgithub.com\u002Fkadirnar)! \n\n## Performance\nCOCO\n\n| Model | Test Size | #Params | FLOPs | AP\u003Csup>val\u003C\u002Fsup> | Latency |\n|:---------------|:----:|:---:|:--:|:--:|:--:|\n| [YOLOv10-N](https:\u002F\u002Fhuggingface.co\u002Fjameslahm\u002Fyolov10n) |   640  |     2.3M    |   6.7G   |     38.5%     | 1.84ms |\n| [YOLOv10-S](https:\u002F\u002Fhuggingface.co\u002Fjameslahm\u002Fyolov10s) |   640  |     7.2M    |   21.6G  |     46.3%     | 2.49ms |\n| [YOLOv10-M](https:\u002F\u002Fhuggingface.co\u002Fjameslahm\u002Fyolov10m) |   640  |     15.4M   |   59.1G  |     51.1%     | 4.74ms |\n| [YOLOv10-B](https:\u002F\u002Fhuggingface.co\u002Fjameslahm\u002Fyolov10b) |   640  |     19.1M   |  92.0G |     52.5%     | 5.74ms |\n| [YOLOv10-L](https:\u002F\u002Fhuggingface.co\u002Fjameslahm\u002Fyolov10l) |   640  |     24.4M   |  120.3G   |     53.2%     | 7.28ms |\n| [YOLOv10-X](https:\u002F\u002Fhuggingface.co\u002Fjameslahm\u002Fyolov10x) |   640  |     29.5M    |   160.4G   |     54.4%     | 10.70ms |\n\n## Installation\n`conda` virtual environment is recommended. \n```\nconda create -n yolov10 python=3.9\nconda activate yolov10\npip install -r requirements.txt\npip install -e .\n```\n## Demo\n```\npython app.py\n# Please visit http:\u002F\u002F127.0.0.1:7860\n```\n\n## Validation\n[`yolov10n`](https:\u002F\u002Fhuggingface.co\u002Fjameslahm\u002Fyolov10n)  [`yolov10s`](https:\u002F\u002Fhuggingface.co\u002Fjameslahm\u002Fyolov10s)  [`yolov10m`](https:\u002F\u002Fhuggingface.co\u002Fjameslahm\u002Fyolov10m)  [`yolov10b`](https:\u002F\u002Fhuggingface.co\u002Fjameslahm\u002Fyolov10b)  [`yolov10l`](https:\u002F\u002Fhuggingface.co\u002Fjameslahm\u002Fyolov10l)  [`yolov10x`](https:\u002F\u002Fhuggingface.co\u002Fjameslahm\u002Fyolov10x)  \n```\nyolo val model=jameslahm\u002Fyolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx} data=coco.yaml batch=256\n```\n\nOr\n```python\nfrom ultralytics import YOLOv10\n\nmodel = YOLOv10.from_pretrained('jameslahm\u002Fyolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx}')\n# or\n# wget https:\u002F\u002Fgithub.com\u002FTHU-MIG\u002Fyolov10\u002Freleases\u002Fdownload\u002Fv1.1\u002Fyolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx}.pt\nmodel = YOLOv10('yolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx}.pt')\n\nmodel.val(data='coco.yaml', batch=256)\n```\n\n\n## Training \n```\nyolo detect train data=coco.yaml model=yolov10n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx.yaml epochs=500 batch=256 imgsz=640 device=0,1,2,3,4,5,6,7\n```\n\nOr\n```python\nfrom ultralytics import YOLOv10\n\nmodel = YOLOv10()\n# If you want to finetune the model with pretrained weights, you could load the \n# pretrained weights like below\n# model = YOLOv10.from_pretrained('jameslahm\u002Fyolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx}')\n# or\n# wget https:\u002F\u002Fgithub.com\u002FTHU-MIG\u002Fyolov10\u002Freleases\u002Fdownload\u002Fv1.1\u002Fyolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx}.pt\n# model = YOLOv10('yolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx}.pt')\n\nmodel.train(data='coco.yaml', epochs=500, batch=256, imgsz=640)\n```\n\n## Push to hub to 🤗\n\nOptionally, you can push your fine-tuned model to the [Hugging Face hub](https:\u002F\u002Fhuggingface.co\u002F) as a public or private model:\n\n```python\n# let's say you have fine-tuned a model for crop detection\nmodel.push_to_hub(\"\u003Cyour-hf-username-or-organization\u002Fyolov10-finetuned-crop-detection\")\n\n# you can also pass `private=True` if you don't want everyone to see your model\nmodel.push_to_hub(\"\u003Cyour-hf-username-or-organization\u002Fyolov10-finetuned-crop-detection\", private=True)\n```\n\n## Prediction\nNote that a smaller confidence threshold can be set to detect smaller objects or objects in the distance. Please refer to [here](https:\u002F\u002Fgithub.com\u002FTHU-MIG\u002Fyolov10\u002Fissues\u002F136) for details.\n```\nyolo predict model=jameslahm\u002Fyolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx}\n```\n\nOr\n```python\nfrom ultralytics import YOLOv10\n\nmodel = YOLOv10.from_pretrained('jameslahm\u002Fyolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx}')\n# or\n# wget https:\u002F\u002Fgithub.com\u002FTHU-MIG\u002Fyolov10\u002Freleases\u002Fdownload\u002Fv1.1\u002Fyolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx}.pt\nmodel = YOLOv10('yolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx}.pt')\n\nmodel.predict()\n```\n\n## Export\n```\n# End-to-End ONNX\nyolo export model=jameslahm\u002Fyolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx} format=onnx opset=13 simplify\n# Predict with ONNX\nyolo predict model=yolov10n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx.onnx\n\n# End-to-End TensorRT\nyolo export model=jameslahm\u002Fyolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx} format=engine half=True simplify opset=13 workspace=16\n# or\ntrtexec --onnx=yolov10n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx.onnx --saveEngine=yolov10n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx.engine --fp16\n# Predict with TensorRT\nyolo predict model=yolov10n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx.engine\n```\n\nOr\n```python\nfrom ultralytics import YOLOv10\n\nmodel = YOLOv10.from_pretrained('jameslahm\u002Fyolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx}')\n# or\n# wget https:\u002F\u002Fgithub.com\u002FTHU-MIG\u002Fyolov10\u002Freleases\u002Fdownload\u002Fv1.1\u002Fyolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx}.pt\nmodel = YOLOv10('yolov10{n\u002Fs\u002Fm\u002Fb\u002Fl\u002Fx}.pt')\n\nmodel.export(...)\n```\n\n## Acknowledgement\n\nThe code base is built with [ultralytics](https:\u002F\u002Fgithub.com\u002Fultralytics\u002Fultralytics) and [RT-DETR](https:\u002F\u002Fgithub.com\u002Flyuwenyu\u002FRT-DETR).\n\nThanks for the great implementations! \n\n## Citation\n\nIf our code or models help your work, please cite our paper:\n```BibTeX\n@article{wang2024yolov10,\n  title={YOLOv10: Real-Time End-to-End Object Detection},\n  author={Wang, Ao and Chen, Hui and Liu, Lihao and Chen, Kai and Lin, Zijia and Han, Jungong and Ding, Guiguang},\n  journal={arXiv preprint arXiv:2405.14458},\n  year={2024}\n}\n```\n","YOLOv10是一个面向实时端到端目标检测的模型，其最新版本YOLOE在开放场景下实现了高效、统一且开放的目标检测与分割。核心功能包括通过文本、视觉输入及无提示机制实现零推理和迁移开销的对象识别，其中针对文本提示提出了可重参数化的区域-文本对齐策略（RepRTA），对于视觉提示则设计了语义激活的视觉提示编码器（SAVPE），而在无提示情况下引入了懒惰区域-提示对比策略（LRPC）。这些技术使得YOLOE能够在保持高性能的同时大幅降低训练成本并提高推理效率，特别适用于需要快速响应多种类型对象识别的应用场景，如自动驾驶、安防监控等。","2026-06-11 03:39:38","high_star"]