👀 About Me

Hi, I’m Xiaorui Jiang, a third-year master’s student at University of Science and Technology of China.

My research interests focus on cross-modal information representation and fusion, with emphasis on the following aspects:

Multimodal Information Fusion: Investigating the representation and fusion of multimodal data in unsupervised environments,
with a focus on methods such as FL and MVL, aiming to achieve efficient information fusion without relying on labeled data.
Multimodal Large Language Models: Concentrating on improving the inference performance of MLLMs, particularly focusing
on strategies that do not require additional training, striving to optimize their reasoning ability and generalization.

Currently, my papers under review or in progress include: (a) Unsupervised federated multi-view data representation;
(b) Learning dynamics in multi-view data representation; (c) Research on attention mechanisms in MLLMs and training-free
strategies for performance enhancement; (d) Training-free token compression methods for MLLMs.

If you find any of these topics of interest, I would be truly grateful for your insights.

👇 Please feel free to reach out to me (Last updated: Sept. 20, 2025)

📖 Google Scholar 📧 Email