Alert button

"Text": models, code, and papers
Alert button

LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints

Sep 29, 2023
Weidi Xu, Jingwei Wang, Lele Xie, Jianshan He, Hongting Zhou, Taifeng Wang, Xiaopei Wan, Jingdong Chen, Chao Qu, Wei Chu

Figure 1 for LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints
Figure 2 for LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints
Figure 3 for LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints
Figure 4 for LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints
Viaarxiv icon

A Large-scale Dataset for Audio-Language Representation Learning

Sep 20, 2023
Luoyi Sun, Xuenan Xu, Mengyue Wu, Weidi Xie

Figure 1 for A Large-scale Dataset for Audio-Language Representation Learning
Figure 2 for A Large-scale Dataset for Audio-Language Representation Learning
Figure 3 for A Large-scale Dataset for Audio-Language Representation Learning
Figure 4 for A Large-scale Dataset for Audio-Language Representation Learning
Viaarxiv icon

Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates

Sep 25, 2023
Ka Chun Shum, Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

Figure 1 for Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates
Figure 2 for Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates
Figure 3 for Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates
Figure 4 for Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates
Viaarxiv icon

HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution

Jul 31, 2023
Minyi Zhao, Yi Xu, Bingjia Li, Jie Wang, Jihong Guan, Shuigeng Zhou

Figure 1 for HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Figure 2 for HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Figure 3 for HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Figure 4 for HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Viaarxiv icon

RedPenNet for Grammatical Error Correction: Outputs to Tokens, Attentions to Spans

Sep 19, 2023
Bohdan Didenko, Andrii Sameliuk

Viaarxiv icon

Does Video Summarization Require Videos? Quantifying the Effectiveness of Language in Video Summarization

Sep 18, 2023
Yoonsoo Nam, Adam Lehavi, Daniel Yang, Digbalay Bose, Swabha Swayamdipta, Shrikanth Narayanan

Figure 1 for Does Video Summarization Require Videos? Quantifying the Effectiveness of Language in Video Summarization
Figure 2 for Does Video Summarization Require Videos? Quantifying the Effectiveness of Language in Video Summarization
Figure 3 for Does Video Summarization Require Videos? Quantifying the Effectiveness of Language in Video Summarization
Viaarxiv icon

LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation

Aug 12, 2023
Leigang Qu, Shengqiong Wu, Hao Fei, Liqiang Nie, Tat-Seng Chua

Figure 1 for LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation
Figure 2 for LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation
Figure 3 for LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation
Figure 4 for LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation
Viaarxiv icon

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation

Sep 22, 2023
Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy

Figure 1 for MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Figure 2 for MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Figure 3 for MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Figure 4 for MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Viaarxiv icon

Lexical Squad@Multimodal Hate Speech Event Detection 2023: Multimodal Hate Speech Detection using Fused Ensemble Approach

Sep 23, 2023
Mohammad Kashif, Mohammad Zohair, Saquib Ali

Figure 1 for Lexical Squad@Multimodal Hate Speech Event Detection 2023: Multimodal Hate Speech Detection using Fused Ensemble Approach
Figure 2 for Lexical Squad@Multimodal Hate Speech Event Detection 2023: Multimodal Hate Speech Detection using Fused Ensemble Approach
Figure 3 for Lexical Squad@Multimodal Hate Speech Event Detection 2023: Multimodal Hate Speech Detection using Fused Ensemble Approach
Figure 4 for Lexical Squad@Multimodal Hate Speech Event Detection 2023: Multimodal Hate Speech Detection using Fused Ensemble Approach
Viaarxiv icon

BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

Aug 10, 2023
Jinheng Xie, Yuexiang Li, Yawen Huang, Haozhe Liu, Wentian Zhang, Yefeng Zheng, Mike Zheng Shou

Figure 1 for BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
Figure 2 for BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
Figure 3 for BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
Figure 4 for BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
Viaarxiv icon