Picture for Dahua Lin

Dahua Lin

Eric

Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning

Add code
Oct 09, 2024
Figure 1 for Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning
Figure 2 for Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning
Figure 3 for Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning
Figure 4 for Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning
Viaarxiv icon

BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way

Add code
Oct 08, 2024
Figure 1 for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way
Figure 2 for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way
Figure 3 for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way
Figure 4 for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way
Viaarxiv icon

MinerU: An Open-Source Solution for Precise Document Content Extraction

Add code
Sep 27, 2024
Figure 1 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 2 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 3 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 4 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Viaarxiv icon

Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation

Add code
Sep 26, 2024
Figure 1 for Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
Figure 2 for Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
Figure 3 for Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
Figure 4 for Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
Viaarxiv icon

Scaling Behavior for Large Language Models regarding Numeral Systems: An Example using Pythia

Add code
Sep 25, 2024
Viaarxiv icon

What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices

Add code
Sep 03, 2024
Figure 1 for What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices
Figure 2 for What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices
Figure 3 for What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices
Figure 4 for What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices
Viaarxiv icon

UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios

Add code
Aug 30, 2024
Viaarxiv icon

CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis

Add code
Aug 27, 2024
Figure 1 for CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis
Figure 2 for CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis
Figure 3 for CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis
Figure 4 for CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis
Viaarxiv icon

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation

Add code
Aug 23, 2024
Figure 1 for LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Figure 2 for LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Figure 3 for LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Figure 4 for LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Viaarxiv icon

HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation

Add code
Jul 28, 2024
Figure 1 for HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Figure 2 for HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Figure 3 for HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Figure 4 for HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Viaarxiv icon