Research Blog


Training LLMs Towards Holistic Learning
Github, June, 2023.
Training Language Models From Fragmentation Learning To Holistic Learning.

Publications


A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation
LREC-COLING, 2024.
Towards Vision Enhancing LLMs: Empowering Multimodal Knowledge Storage and Sharing in LLMs
arXive, 2023.
A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering
arXive, 2023.
LMEye: An Interactive Perception Network for Large Language Models
arXive, 2023.
Training Multimedia Event Extraction With Generated Images and Captions
ACM on Multimedia (ACM MM), 2023.
A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text
ACL 2023 Main Conference.
A Multi-Modal Context Reasoning Approach for Conditional Inference on Joint Textual and Visual Clues
ACL 2023 Main Conference.
Chunk-aware Alignment and Lexical Constraint for Visual Entailment with Natural Language Explanations
ACM on Multimedia (ACM MM), 2022.
Medical Dialogue Response Generation with Pivotal Information Recalling
SIGKDD, 2022.
Fast and Robust Online Handwritten Chinese Character Recognition with Deep Spatial & Contextual Information Fusion Network
IEEE Transactions on Multimedia (TMM), 2022.

Service

I serve as the reviewer for ARR (2023-), ACM MM (2023-), IEEE TMM (2023-), and Neural Networks (2024-).
Assist with editorial board work of Neural Networks.