Alert button
Picture for Yixin Nie

Yixin Nie

Alert button

Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning

Add code
Bookmark button
Alert button
Mar 22, 2024
Rao Fu, Jingyu Liu, Xilun Chen, Yixin Nie, Wenhan Xiong

Figure 1 for Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning
Figure 2 for Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning
Figure 3 for Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning
Figure 4 for Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning
Viaarxiv icon

The Role of Chain-of-Thought in Complex Vision-Language Reasoning Task

Add code
Bookmark button
Alert button
Nov 15, 2023
Yifan Wu, Pengchuan Zhang, Wenhan Xiong, Barlas Oguz, James C. Gee, Yixin Nie

Viaarxiv icon

Jointly Training Large Autoregressive Multimodal Models

Add code
Bookmark button
Alert button
Sep 28, 2023
Emanuele Aiello, Lili Yu, Yixin Nie, Armen Aghajanyan, Barlas Oguz

Figure 1 for Jointly Training Large Autoregressive Multimodal Models
Figure 2 for Jointly Training Large Autoregressive Multimodal Models
Figure 3 for Jointly Training Large Autoregressive Multimodal Models
Figure 4 for Jointly Training Large Autoregressive Multimodal Models
Viaarxiv icon

Llama 2: Open Foundation and Fine-Tuned Chat Models

Add code
Bookmark button
Alert button
Jul 19, 2023
Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-Anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian Xiang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom

Figure 1 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 2 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 3 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 4 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Viaarxiv icon

Text-guided 3D Human Generation from 2D Collections

Add code
Bookmark button
Alert button
May 23, 2023
Tsu-Jui Fu, Wenhan Xiong, Yixin Nie, Jingyu Liu, Barlas Oğuz, William Yang Wang

Figure 1 for Text-guided 3D Human Generation from 2D Collections
Figure 2 for Text-guided 3D Human Generation from 2D Collections
Figure 3 for Text-guided 3D Human Generation from 2D Collections
Figure 4 for Text-guided 3D Human Generation from 2D Collections
Viaarxiv icon

TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models

Add code
Bookmark button
Alert button
Apr 18, 2023
Yuwei Yin, Jean Kaddour, Xiang Zhang, Yixin Nie, Zhenguang Liu, Lingpeng Kong, Qi Liu

Figure 1 for TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models
Figure 2 for TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models
Figure 3 for TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models
Figure 4 for TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models
Viaarxiv icon

3DGen: Triplane Latent Diffusion for Textured Mesh Generation

Add code
Bookmark button
Alert button
Mar 09, 2023
Anchit Gupta, Wenhan Xiong, Yixin Nie, Ian Jones, Barlas Oğuz

Figure 1 for 3DGen: Triplane Latent Diffusion for Textured Mesh Generation
Figure 2 for 3DGen: Triplane Latent Diffusion for Textured Mesh Generation
Figure 3 for 3DGen: Triplane Latent Diffusion for Textured Mesh Generation
Figure 4 for 3DGen: Triplane Latent Diffusion for Textured Mesh Generation
Viaarxiv icon

CLIP-Layout: Style-Consistent Indoor Scene Synthesis with Semantic Furniture Embedding

Add code
Bookmark button
Alert button
Mar 07, 2023
Jingyu Liu, Wenhan Xiong, Ian Jones, Yixin Nie, Anchit Gupta, Barlas Oğuz

Figure 1 for CLIP-Layout: Style-Consistent Indoor Scene Synthesis with Semantic Furniture Embedding
Figure 2 for CLIP-Layout: Style-Consistent Indoor Scene Synthesis with Semantic Furniture Embedding
Figure 3 for CLIP-Layout: Style-Consistent Indoor Scene Synthesis with Semantic Furniture Embedding
Figure 4 for CLIP-Layout: Style-Consistent Indoor Scene Synthesis with Semantic Furniture Embedding
Viaarxiv icon

TVLT: Textless Vision-Language Transformer

Add code
Bookmark button
Alert button
Sep 28, 2022
Zineng Tang, Jaemin Cho, Yixin Nie, Mohit Bansal

Figure 1 for TVLT: Textless Vision-Language Transformer
Figure 2 for TVLT: Textless Vision-Language Transformer
Figure 3 for TVLT: Textless Vision-Language Transformer
Figure 4 for TVLT: Textless Vision-Language Transformer
Viaarxiv icon