EMOTR：一种高效端到端多目标跟踪框架

doi:10.7690/bgzdh.2026.05.020

首页 > 过刊浏览>2026年第45卷第05期 >. DOI:10.7690/bgzdh.2026.05.020

EMOTR：一种高效端到端多目标跟踪框架
DOI:
                        10.7690/bgzdh.2026.05.020
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:

EMOTR: An End-to-end Framework for Efficient and Accurate Multi-object Tracking

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

为解决Transformer端到端多目标跟踪(end-to-end multi-object tracking with transformer，MOTR)自注意力计算量随分辨率二次增长，导致训练慢、显存占用高、批量小等问题，提出一种高效端到端多目标跟踪(efficient MOTR，EMOTR)方法。在不改变MOTR范式的前提下，快速多尺度注意力(fast multi-scale attention，FMA)以I/O友好的注意力内核配合多尺度特征融合，降低计算与存储开销，同时提升小目标分辨；时空批处理与解码器权重共享，实现变长序列同批训练并减少约15%参数；自动混合精度(automatic mixed precision，AMP)结合动态Loss Scaling，充分释放Tensor Core吞吐。基于VisDrone2019的实验结果表明：相较原始MOTR，训练时间下降80.4%、参数下降15.5%，MOTA提升2.1至24.9且IDF1保持稳定，验证了在不牺牲精度的前提下显著提升端到端MOT实用性的可能性。

Abstract:

To address the issues of slow training, high memory usage, and small batch sizes caused by the quadratic growth of self-attention computation with resolution in Transformer-based end-to-end multi-object tracking (MOTR), an efficient MOTR (EMOTR) method is proposed. Without altering the MOTR paradigm, fast multi-scale attention (FMA) utilizes an I/O-friendly attention kernel combined with multi-scale feature fusion to reduce computational and storage overhead while enhancing small object resolution. Spatio-temporal batch processing and decoder weight sharing enable variable-length sequence training within the same batch and reduce approximately 15% of parameters. Automatic mixed precision (AMP) combined with dynamic Loss Scaling fully utilizes Tensor Core throughput. Experimental results based on VisDrone2019 show that compared to the original MOTR, training time is reduced by 80.4%, parameters are decreased by 15.5%, MOTA is improved by 2.1 to 24.9, and IDF1 remains stable, verifying the possibility of significantly enhancing the practicality of end-to-end MOT without sacrificing accuracy.

参考文献

相似文献

引证文献

引用本文

傅文鸿. EMOTR：一种高效端到端多目标跟踪框架[J].,2026,45(05).

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-12-11
最后修改日期:2025-01-25
录用日期:
在线发布日期: 2026-05-26
出版日期:

首页

期刊介绍

新闻公告

相关下载

投稿指南

刊庆动态

期刊订阅

广告指南

English

引用本文

分享

相关视频

文章指标

历史

文章二维码