T R A C K       P A P E R
Close

Login Panel

Close tab

Password Meter

Volume 12 Issue 03 (March 2025)

S.No. Title & Authors Page No View
1

Title : An Improved Remote Sensing Object Detection Algorithm Based on YOLOv11

Authors : ZiJian Lin

Click Here For Abstract

Download Certificate
Abstract :

To address the multiple challenges in existing remote sensing images detection methods, including insufficient localization accuracy, imprecise category recognition, and high false positive and false negative rates, this paper proposes RS-YOLOv11 (Remote Sensing-YOLOv11), an improved object detection algorithm specifically designed for remote sensing applications based on the YOLOv11 framework.This study introduces the fine-grained SPD-Conv module to optimize backbone network downsampling, effectively preserving feature information and enhancing small object detection. The detection head employs Dynamic Head architecture with integrated multi-dimensional attention mechanisms, significantly improving model performance.To reduce network complexity, Faster_Block replaces the Bottleneck design, decreasing C3K2 module computational cost and addressing YOLOv11 deployment challenges. This improvement achieves lightweight design while maintaining performance and balancing Dynamic Head overhead. Additionally, WIoU loss function replaces CIoU to suppress gradient issues from low-quality images.Experiments on the VisDrone2021 dataset demonstrate that our improved model achieves a 3.9% increase in mAP50 compared to the YOLOv11n baseline, while maintaining comparable computational complexity and parameter efficiency.

1-5
2

Title : CASG:A Conditional Autoregressive Co-Speech Gesture Generation Network for Semantics

Authors : Wangyang Tuo

Click Here For Abstract

Download Certificate
Abstract :

Co-speech gesture generation, a subset of 3D motion generation, aims to generate appropriate motion sequences from audio or other conditions. While many existing methods focus on the rhythm between motion and audio, they often neglect the semantics of gestures. Furthermore, approaches based on diffusion models or Transformer require significant time for training and inference, making them unsuitable for real-time applications. In this paper, we proposed CASG, a network based on the conditional autoregressive model, which effectively enhances the semantics of generated gestures through a semantic enhancement module inspired by VQ-VAE. Additionally, the loss function is improved for 3D rotational and translational transformations of motion sequences, addressing the instability issue in generated models. Extensive experiments demonstrate that our model outperforms competing methods in terms of semantics, rhythm and stability.

6-10
3

Title : An Autism Classification Method based on Deep Neural Network and Attention Mechanism

Authors : XiaoXu Ma, Xiang Guo

Click Here For Abstract

Download Certificate
Abstract :

Autism Spectrum Disorder (ASD) is a developmental disorder whose incidence has been increasing, significantly impacting patients' daily lives. Recent research trends focus on leveraging large-scale, multi-center neuroimaging datasets to enhance clinical applicability and statistical validity. However, the lack of reliable biomarkers and data heterogeneity across datasets limit classification effectiveness. This paper proposes a resting-state functional MRI (rs-fMRI) based approach to improve classification accuracy for ASD by integrating multi-site data.

In this study, a MultiModal Deep Attention (MMDA) model is introduced for ASD identification, which effectively combines rs-fMRI features with demographic characteristics through three modules: feature extraction, feature learning, and deep perception. The feature extraction module uses autoencoders to clean rs-fMRI time series data; the feature learning module employs multi-head attention mechanisms and convolutional neural networks to uncover intrinsic data structures; and the deep perception module integrates multimodal features to produce final ASD classifications. Simulation results demonstrate that the MMDA model outperforms benchmark algorithms in ASD identification.

In summary, this research advances ASD diagnostic accuracy by integrating multimodal data with neural networks, offering a promising tool for objective auxiliary diagnosis and providing new insights into ASD pathomechanisms.

11-15
4

Title : Boundary Adjustment Algorithm Based On Element Triangulation

Authors : Hongchen Zhu, Kaifang Chen

Click Here For Abstract

Download Certificate
Abstract :

To address the issue of edge thickness affecting surface quality and color in 3D printing, particularly when dealing with variations in the thickness of upper and lower surfaces, a surface isometric offset transformation method is employed to obtain the optimal subsurface structure. Based on these offset surfaces, adaptive offset calculations are performed for the exterior surface structure using different offset distances for different surfaces. Subsequently, each surface is iteratively optimized, redesigning the surface structure of the printed part to meet the requirements for surface quality and color depth. The results show that the adaptive offset method for surfaces is superior to the traditional edge line offset method in 3D printing. It provides an effective surface treatment method for color printing, especially under requirements for translucency and light colors, allowing for customized surface thickness settings based on parameters such as surface hardness and brightness, thereby enabling adaptive surface thickness.

16-20