Search Results for author: Xiao Xiao

Found 10 papers, 5 papers with code

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

1 code implementation • 14 May 2024 • Zhimin Li, Jianwei Zhang, Qin Lin, Jiangfeng Xiong, Yanxin Long, Xinchi Deng, Yingfang Zhang, Xingchao Liu, Minbin Huang, Zedong Xiao, Dayou Chen, Jiajun He, Jiahao Li, Wenyue Li, Chen Zhang, Rongwei Quan, Jianxiang Lu, Jiabin Huang, Xiaoyan Yuan, Xiaoxiao Zheng, Yixuan Li, Jihong Zhang, Chao Zhang, Meng Chen, Jie Liu, Zheng Fang, Weiyan Wang, Jinbao Xue, Yangyu Tao, Jianchen Zhu, Kai Liu, Sihuan Lin, Yifu Sun, Yun Li, Dongdong Wang, Mingtao Chen, Zhichao Hu, Xiao Xiao, Yan Chen, Yuhong Liu, Wei Liu, Di Wang, Yong Yang, Jie Jiang, Qinglin Lu

For fine-grained language understanding, we train a Multimodal Large Language Model to refine the captions of the images.

Image Generation Language Modelling +2

1,565

Paper
Code

Android in the Zoo: Chain-of-Action-Thought for GUI Agents

1 code implementation • 5 Mar 2024 • Jiwen Zhang, Jihao Wu, Yihua Teng, Minghui Liao, Nuo Xu, Xiao Xiao, Zhongyu Wei, Duyu Tang

To address this, this work presents Chain-of-Action-Thought (dubbed CoAT), which takes the description of the previous actions, the current screen, and more importantly the action thinking of what actions should be performed and the outcomes led by the chosen action.

Language Modelling Large Language Model

Paper
Code

Mitigating Pooling Bias in E-commerce Search via False Negative Estimation

no code implementations • 11 Nov 2023 • Xiaochen Wang, Xiao Xiao, Ruhan Zhang, Xuan Zhang, Taesik Na, Tejaswi Tenneti, Haixun Wang, Fenglong Ma

Efficient and accurate product relevance assessment is critical for user experiences and business success.

Paper
Add Code

BPNet: Bézier Primitive Segmentation on 3D Point Clouds

1 code implementation • 8 Jul 2023 • Rao Fu, Cheng Wen, Qian Li, Xiao Xiao, Pierre Alliez

This paper proposes BPNet, a novel end-to-end deep learning framework to learn B\'ezier primitive segmentation on 3D point clouds.

Point Cloud Segmentation Segmentation

Paper
Code

Dynamic Graph Neural Network with Adaptive Edge Attributes for Air Quality Predictions

no code implementations • 20 Feb 2023 • Jing Xu, Shuo Wang, Na Ying, Xiao Xiao, Jiang Zhang, Yun Cheng, Zhiling Jin, Gangfeng Zhang

Previous GCNs-based methods usually require providing spatial correlation graph structure of observation sites in advance.

Decision Making Time Series +1

Paper
Add Code

An Embedding-Based Grocery Search Model at Instacart

no code implementations • 12 Sep 2022 • Yuqing Xie, Taesik Na, Xiao Xiao, Saurav Manchanda, Young Rao, Zhihong Xu, Guanghua Shu, Esther Vasiete, Tejaswi Tenneti, Haixun Wang

To train the model efficiently on noisy data, we propose a self-adversarial learning method and a cascade training method.

Paper
Add Code

Long-term Spatio-temporal Forecasting via Dynamic Multiple-Graph Attention

1 code implementation • 23 Apr 2022 • Wei Shao, Zhiling Jin, Shuo Wang, Yufan Kang, Xiao Xiao, Hamid Menouar, Zhaofeng Zhang, Junshan Zhang, Flora Salim

To address these issues, we construct new graph models to represent the contextual information of each node and the long-term spatio-temporal data dependency structure.

Graph Attention Spatio-Temporal Forecasting

Paper
Code

Computational modelling and data-driven homogenisation of knitted membranes

no code implementations • 12 Jul 2021 • Sumudu Herath, Xiao Xiao, Fehmi Cirak

The trained GPR model encodes the nonlinearities and anisotropies present in the microscale and serves as a material model for the membrane response of the macroscale shell.

GPR

Paper
Add Code

Unknown-box Approximation to Improve Optical Character Recognition Performance

1 code implementation • 17 May 2021 • Ayantha Randika, Nilanjan Ray, Xiao Xiao, Allegra Latimer

Unlike the previous OCR agnostic preprocessing techniques, the proposed approach approximates the gradient of a particular OCR engine to train a preprocessor module.

Optical Character Recognition Optical Character Recognition (OCR)

Paper
Code

MSDU-net: A Multi-Scale Dilated U-net for Blur Detection

no code implementations • 5 Jun 2020 • Fan Yang, Xiao Xiao

Blur detection is the separation of blurred and clear regions of an image, which is an important and challenging task in computer vision.

Image Segmentation Segmentation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.