Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Changtong Luo

Block building programming for symbolic regression

Oct 28, 2017

Chen Chen, Changtong Luo, Zonglin Jiang

Figure 1 for Block building programming for symbolic regression

Figure 2 for Block building programming for symbolic regression

Figure 3 for Block building programming for symbolic regression

Figure 4 for Block building programming for symbolic regression

Abstract:Symbolic regression that aims to detect underlying data-driven models has become increasingly important for industrial data analysis. For most existing algorithms such as genetic programming (GP), the convergence speed might be too slow for large-scale problems with a large number of variables. This situation may become even worse with increasing problem size. The aforementioned difficulty makes symbolic regression limited in practical applications. Fortunately, in many engineering problems, the independent variables in target models are separable or partially separable. This feature inspires us to develop a new approach, block building programming (BBP). BBP divides the original target function into several blocks, and further into factors. The factors are then modeled by an optimization engine (e.g. GP). Under such circumstances, BBP can make large reductions to the search space. The partition of separability is based on a special method, block and factor detection. Two different optimization engines are applied to test the performance of BBP on a set of symbolic regression problems. Numerical results show that BBP has a good capability of structure and coefficient optimization with high computational efficiency.

* Accepted for publication in Neurocomputing

Via

Access Paper or Ask Questions

Fast Modeling Methods for Complex System with Separable Features

Aug 05, 2017

Chen Chen, Changtong Luo, Zonglin Jiang

Figure 1 for Fast Modeling Methods for Complex System with Separable Features

Figure 2 for Fast Modeling Methods for Complex System with Separable Features

Figure 3 for Fast Modeling Methods for Complex System with Separable Features

Abstract:Data-driven modeling plays an increasingly important role in different areas of engineering. For most of existing methods, such as genetic programming (GP), the convergence speed might be too slow for large scale problems with a large number of variables. Fortunately, in many applications, the target models are separable in some sense. In this paper, we analyze different types of separability of some real-world engineering equations and establish a mathematical model of generalized separable system (GS system). In order to get the structure of the GS system, two concepts, namely block and factor are introduced, and a special method, block and factor detection is also proposed, in which the target model is decomposed into a number of blocks, further into minimal blocks and factors. Compare to the conventional GP, the new method can make large reductions to the search space. The minimal blocks and factors are optimized and assembled with a global optimization search engine, low dimensional simplex evolution (LDSE). An extensive study between the proposed method and a state-of-the-art data-driven fitting tool, Eureqa, has been presented with several man-made problems. Test results indicate that the proposed method is more effective and efficient under all the investigated cases.

* arXiv admin note: substantial text overlap with arXiv:1706.02281

Via

Access Paper or Ask Questions

A divide and conquer method for symbolic regression

Jun 27, 2017

Changtong Luo, Chen Chen, Zonglin Jiang

Figure 1 for A divide and conquer method for symbolic regression

Figure 2 for A divide and conquer method for symbolic regression

Figure 3 for A divide and conquer method for symbolic regression

Figure 4 for A divide and conquer method for symbolic regression

Abstract:Symbolic regression aims to find a function that best explains the relationship between independent variables and the objective value based on a given set of sample data. Genetic programming (GP) is usually considered as an appropriate method for the problem since it can optimize functional structure and coefficients simultaneously. However, the convergence speed of GP might be too slow for large scale problems that involve a large number of variables. Fortunately, in many applications, the target function is separable or partially separable. This feature motivated us to develop a new method, divide and conquer (D&C), for symbolic regression, in which the target function is divided into a number of sub-functions and the sub-functions are then determined by any of a GP algorithm. The separability is probed by a new proposed technique, Bi-Correlation test (BiCT). D&C powered GP has been tested on some real-world applications, and the study shows that D&C can help GP to get the target function much more rapidly.

Via

Access Paper or Ask Questions

Elite Bases Regression: A Real-time Algorithm for Symbolic Regression

May 15, 2017

Chen Chen, Changtong Luo, Zonglin Jiang

Figure 1 for Elite Bases Regression: A Real-time Algorithm for Symbolic Regression

Figure 2 for Elite Bases Regression: A Real-time Algorithm for Symbolic Regression

Figure 3 for Elite Bases Regression: A Real-time Algorithm for Symbolic Regression

Figure 4 for Elite Bases Regression: A Real-time Algorithm for Symbolic Regression

Abstract:Symbolic regression is an important but challenging research topic in data mining. It can detect the underlying mathematical models. Genetic programming (GP) is one of the most popular methods for symbolic regression. However, its convergence speed might be too slow for large scale problems with a large number of variables. This drawback has become a bottleneck in practical applications. In this paper, a new non-evolutionary real-time algorithm for symbolic regression, Elite Bases Regression (EBR), is proposed. EBR generates a set of candidate basis functions coded with parse-matrix in specific mapping rules. Meanwhile, a certain number of elite bases are preserved and updated iteratively according to the correlation coefficients with respect to the target model. The regression model is then spanned by the elite bases. A comparative study between EBR and a recent proposed machine learning method for symbolic regression, Fast Function eXtraction (FFX), are conducted. Numerical results indicate that EBR can solve symbolic regression problems more effectively.

* The 2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD 2017)

Via

Access Paper or Ask Questions