Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Cheng Li

Paul C. Lauterbur Research Center for Biomedical Imaging, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China

Generalizable Learning Reconstruction for Accelerating MR Imaging via Federated Neural Architecture Search

Aug 27, 2023

Ruoyou Wu, Cheng Li, Juan Zou, Shanshan Wang

Figure 1 for Generalizable Learning Reconstruction for Accelerating MR Imaging via Federated Neural Architecture Search

Figure 2 for Generalizable Learning Reconstruction for Accelerating MR Imaging via Federated Neural Architecture Search

Figure 3 for Generalizable Learning Reconstruction for Accelerating MR Imaging via Federated Neural Architecture Search

Figure 4 for Generalizable Learning Reconstruction for Accelerating MR Imaging via Federated Neural Architecture Search

Abstract:Heterogeneous data captured by different scanning devices and imaging protocols can affect the generalization performance of the deep learning magnetic resonance (MR) reconstruction model. While a centralized training model is effective in mitigating this problem, it raises concerns about privacy protection. Federated learning is a distributed training paradigm that can utilize multi-institutional data for collaborative training without sharing data. However, existing federated learning MR image reconstruction methods rely on models designed manually by experts, which are complex and computational expensive, suffering from performance degradation when facing heterogeneous data distributions. In addition, these methods give inadequate consideration to fairness issues, namely, ensuring that the model's training does not introduce bias towards any specific dataset's distribution. To this end, this paper proposes a generalizable federated neural architecture search framework for accelerating MR imaging (GAutoMRI). Specifically, automatic neural architecture search is investigated for effective and efficient neural network representation learning of MR images from different centers. Furthermore, we design a fairness adjustment approach that can enable the model to learn features fairly from inconsistent distributions of different devices and centers, and thus enforce the model generalize to the unseen center. Extensive experiments show that our proposed GAutoMRI has better performances and generalization ability compared with six state-of-the-art federated learning methods. Moreover, the GAutoMRI model is significantly more lightweight, making it an efficient choice for MR image reconstruction tasks. The code will be made available at https://github.com/ternencewu123/GAutoMRI.

* 10 pages

Via

Access Paper or Ask Questions

Illumination strategies for space-bandwidth-time product improvement in Fourier ptychography

Aug 26, 2023

Haibo Xu, Cheng Li, Mingzhe Wei, Ziwen Zhou, Longqian Huang

Abstract:Fourier ptychography (FP) is a promising technique for high-throughput imaging. Reconstruction algorithms and illumination paradigm are two key aspects of FP. In this review, we mainly focus on illumination strategies in FP. We derive the space-bandwidth-time product (SBP-T) for the characterization of FP performance. Based on the analysis of SBP-T, we categorize the illumination strategy in FP effectively and discuss each category

Via

Access Paper or Ask Questions

Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach

Aug 21, 2023

Ziyin Zhang, Ning Lu, Minghui Liao, Yongshuai Huang, Cheng Li, Min Wang, Wei Peng

Abstract:Text recognition methods are gaining rapid development. Some advanced techniques, e.g., powerful modules, language models, and un- and semi-supervised learning schemes, consecutively push the performance on public benchmarks forward. However, the problem of how to better optimize a text recognition model from the perspective of loss functions is largely overlooked. CTC-based methods, widely used in practice due to their good balance between performance and inference speed, still grapple with accuracy degradation. This is because CTC loss emphasizes the optimization of the entire sequence target while neglecting to learn individual characters. We propose a self-distillation scheme for CTC-based model to address this issue. It incorporates a framewise regularization term in CTC loss to emphasize individual supervision, and leverages the maximizing-a-posteriori of latent alignment to solve the inconsistency problem that arises in distillation between CTC-based models. We refer to the regularized CTC loss as Distillation Connectionist Temporal Classification (DCTC) loss. DCTC loss is module-free, requiring no extra parameters, longer inference lag, or additional training data or phases. Extensive experiments on public benchmarks demonstrate that DCTC can boost text recognition model accuracy by up to 2.6%, without any of these drawbacks.

* Ziyin Zhang and Ning Lu are co-first authors

Via

Access Paper or Ask Questions

ChatHaruhi: Reviving Anime Character in Reality via Large Language Model

Aug 18, 2023

Cheng Li, Ziang Leng, Chenxi Yan, Junyi Shen, Hao Wang, Weishi MI, Yaying Fei, Xiaoyang Feng, Song Yan, HaoSheng Wang(+4 more)

Figure 1 for ChatHaruhi: Reviving Anime Character in Reality via Large Language Model

Figure 2 for ChatHaruhi: Reviving Anime Character in Reality via Large Language Model

Figure 3 for ChatHaruhi: Reviving Anime Character in Reality via Large Language Model

Figure 4 for ChatHaruhi: Reviving Anime Character in Reality via Large Language Model

Abstract:Role-playing chatbots built on large language models have drawn interest, but better techniques are needed to enable mimicking specific fictional characters. We propose an algorithm that controls language models via an improved prompt and memories of the character extracted from scripts. We construct ChatHaruhi, a dataset covering 32 Chinese / English TV / anime characters with over 54k simulated dialogues. Both automatic and human evaluations show our approach improves role-playing ability over baselines. Code and data are available at https://github.com/LC1332/Chat-Haruhi-Suzumiya .

* v1 - First version of techique report

Via

Access Paper or Ask Questions

Teach LLMs to Personalize -- An Approach inspired by Writing Education

Aug 15, 2023

Cheng Li, Mingyang Zhang, Qiaozhu Mei, Yaqing Wang, Spurthi Amba Hombaiah, Yi Liang, Michael Bendersky

Figure 1 for Teach LLMs to Personalize -- An Approach inspired by Writing Education

Figure 2 for Teach LLMs to Personalize -- An Approach inspired by Writing Education

Figure 3 for Teach LLMs to Personalize -- An Approach inspired by Writing Education

Figure 4 for Teach LLMs to Personalize -- An Approach inspired by Writing Education

Abstract:Personalized text generation is an emerging research area that has attracted much attention in recent years. Most studies in this direction focus on a particular domain by designing bespoke features or models. In this work, we propose a general approach for personalized text generation using large language models (LLMs). Inspired by the practice of writing education, we develop a multistage and multitask framework to teach LLMs for personalized generation. In writing instruction, the task of writing from sources is often decomposed into multiple steps that involve finding, evaluating, summarizing, synthesizing, and integrating information. Analogously, our approach to personalized text generation consists of multiple stages: retrieval, ranking, summarization, synthesis, and generation. In addition, we introduce a multitask setting that helps the model improve its generation ability further, which is inspired by the observation in education that a student's reading proficiency and writing ability are often correlated. We evaluate our approach on three public datasets, each of which covers a different and representative domain. Our results show significant improvements over a variety of baselines.

Via

Access Paper or Ask Questions

EmotionPrompt: Leveraging Psychology for Large Language Models Enhancement via Emotional Stimulus

Aug 01, 2023

Cheng Li, Jindong Wang, Kaijie Zhu, Yixuan Zhang, Wenxin Hou, Jianxun Lian, Xing Xie

Figure 1 for EmotionPrompt: Leveraging Psychology for Large Language Models Enhancement via Emotional Stimulus

Figure 2 for EmotionPrompt: Leveraging Psychology for Large Language Models Enhancement via Emotional Stimulus

Figure 3 for EmotionPrompt: Leveraging Psychology for Large Language Models Enhancement via Emotional Stimulus

Figure 4 for EmotionPrompt: Leveraging Psychology for Large Language Models Enhancement via Emotional Stimulus

Abstract:Large language models (LLMs) have achieved significant performance in many fields such as reasoning, language understanding, and math problem-solving, and are regarded as a crucial step to artificial general intelligence (AGI). However, the sensitivity of LLMs to prompts remains a major bottleneck for their daily adoption. In this paper, we take inspiration from psychology and propose EmotionPrompt to explore emotional intelligence to enhance the performance of LLMs. EmotionPrompt operates on a remarkably straightforward principle: the incorporation of emotional stimulus into prompts. Experimental results demonstrate that our EmotionPrompt, using the same single prompt templates, significantly outperforms original zero-shot prompt and Zero-shot-CoT on 8 tasks with diverse models: ChatGPT, Vicuna-13b, Bloom, and T5. Further, EmotionPrompt was observed to improve both truthfulness and informativeness. We believe that EmotionPrompt heralds a novel avenue for exploring interdisciplinary knowledge for humans-LLMs interaction.

* Work in progress; 9 pages

Via

Access Paper or Ask Questions

FedAutoMRI: Federated Neural Architecture Search for MR Image Reconstruction

Jul 21, 2023

Ruoyou Wu, Cheng Li, Juan Zou, Shanshan Wang

Figure 1 for FedAutoMRI: Federated Neural Architecture Search for MR Image Reconstruction

Figure 2 for FedAutoMRI: Federated Neural Architecture Search for MR Image Reconstruction

Figure 3 for FedAutoMRI: Federated Neural Architecture Search for MR Image Reconstruction

Figure 4 for FedAutoMRI: Federated Neural Architecture Search for MR Image Reconstruction

Abstract:Centralized training methods have shown promising results in MR image reconstruction, but privacy concerns arise when gathering data from multiple institutions. Federated learning, a distributed collaborative training scheme, can utilize multi-center data without the need to transfer data between institutions. However, existing federated learning MR image reconstruction methods rely on manually designed models which have extensive parameters and suffer from performance degradation when facing heterogeneous data distributions. To this end, this paper proposes a novel FederAted neUral archiTecture search approach fOr MR Image reconstruction (FedAutoMRI). The proposed method utilizes differentiable architecture search to automatically find the optimal network architecture. In addition, an exponential moving average method is introduced to improve the robustness of the client model to address the data heterogeneity issue. To the best of our knowledge, this is the first work to use federated neural architecture search for MR image reconstruction. Experimental results demonstrate that our proposed FedAutoMRI can achieve promising performances while utilizing a lightweight model with only a small number of model parameters compared to the classical federated learning methods.

* 10 pages

Via

Access Paper or Ask Questions

RestGPT: Connecting Large Language Models with Real-World Applications via RESTful APIs

Jun 11, 2023

Yifan Song, Weimin Xiong, Dawei Zhu, Cheng Li, Ke Wang, Ye Tian, Sujian Li

Figure 1 for RestGPT: Connecting Large Language Models with Real-World Applications via RESTful APIs

Figure 2 for RestGPT: Connecting Large Language Models with Real-World Applications via RESTful APIs

Figure 3 for RestGPT: Connecting Large Language Models with Real-World Applications via RESTful APIs

Figure 4 for RestGPT: Connecting Large Language Models with Real-World Applications via RESTful APIs

Abstract:Tool-augmented large language models (LLMs) have achieved remarkable progress in tackling a broad range of queries. However, existing work are still in the experimental stage and has limitations in extensibility and robustness, especially facing the real-world applications. In this paper, we consider a more realistic scenario, connecting LLMs with RESTful APIs, which use the commonly adopted REST software architectural style for web service development. To address the practical challenges of planning and API usage, we introduce RestGPT, which leverages LLMs to solve user requests by connecting with RESTful APIs. Specifically, we propose a coarse-to-fine online planning mechanism to enhance the ability of planning and API selection. For the complex scenario of calling RESTful APIs, we also specially designed an API executor to formulate parameters and parse API responses. Experiments show that RestGPT is able to achieve impressive results in complex tasks and has strong robustness, which paves a new way towards AGI.

* Work in progress

Via

Access Paper or Ask Questions

Machine learning-based characterization of hydrochar from biomass: Implications for sustainable energy and material production

May 24, 2023

Alireza Shafizadeh, Hossein Shahbeik, Shahin Rafiee, Aysooda Moradi, Mohammadreza Shahbaz, Meysam Madadi, Cheng Li, Wanxi Peng, Meisam Tabatabaei, Mortaza Aghbashlo

Figure 1 for Machine learning-based characterization of hydrochar from biomass: Implications for sustainable energy and material production

Figure 2 for Machine learning-based characterization of hydrochar from biomass: Implications for sustainable energy and material production

Figure 3 for Machine learning-based characterization of hydrochar from biomass: Implications for sustainable energy and material production

Figure 4 for Machine learning-based characterization of hydrochar from biomass: Implications for sustainable energy and material production

Abstract:Hydrothermal carbonization (HTC) is a process that converts biomass into versatile hydrochar without the need for prior drying. The physicochemical properties of hydrochar are influenced by biomass properties and processing parameters, making it challenging to optimize for specific applications through trial-and-error experiments. To save time and money, machine learning can be used to develop a model that characterizes hydrochar produced from different biomass sources under varying reaction processing parameters. Thus, this study aims to develop an inclusive model to characterize hydrochar using a database covering a range of biomass types and reaction processing parameters. The quality and quantity of hydrochar are predicted using two models (decision tree regression and support vector regression). The decision tree regression model outperforms the support vector regression model in terms of forecast accuracy (R2 > 0.88, RMSE < 6.848, and MAE < 4.718). Using an evolutionary algorithm, optimum inputs are identified based on cost functions provided by the selected model to optimize hydrochar for energy production, soil amendment, and pollutant adsorption, resulting in hydrochar yields of 84.31%, 84.91%, and 80.40%, respectively. The feature importance analysis reveals that biomass ash/carbon content and operating temperature are the primary factors affecting hydrochar production in the HTC process.

* Fuel 347, 1 September 2023, 128467

Via

Access Paper or Ask Questions

Self-Supervised Federated Learning for Fast MR Imaging

May 10, 2023

Juan Zou, Cheng Li, Ruoyou Wu, Tingrui Pei, Hairong Zheng, Shanshan Wang

Figure 1 for Self-Supervised Federated Learning for Fast MR Imaging

Figure 2 for Self-Supervised Federated Learning for Fast MR Imaging

Figure 3 for Self-Supervised Federated Learning for Fast MR Imaging

Figure 4 for Self-Supervised Federated Learning for Fast MR Imaging

Abstract:Federated learning (FL) based magnetic resonance (MR) image reconstruction can facilitate learning valuable priors from multi-site institutions without violating patient's privacy for accelerating MR imaging. However, existing methods rely on fully sampled data for collaborative training of the model. The client that only possesses undersampled data can neither participate in FL nor benefit from other clients. Furthermore, heterogeneous data distributions hinder FL from training an effective deep learning reconstruction model and thus cause performance degradation. To address these issues, we propose a Self-Supervised Federated Learning method (SSFedMRI). SSFedMRI explores the physics-based contrastive reconstruction networks in each client to realize cross-site collaborative training in the absence of fully sampled data. Furthermore, a personalized soft update scheme is designed to simultaneously capture the global shared representations among different centers and maintain the specific data distribution of each client. The proposed method is evaluated on four datasets and compared to the latest state-of-the-art approaches. Experimental results demonstrate that SSFedMRI possesses strong capability in reconstructing accurate MR images both visually and quantitatively on both in-distribution and out-of-distribution datasets.

* 10 pages,4 figures

Via

Access Paper or Ask Questions