Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chao-Lin Liu

Matrix and Graph Operations for Relationship Inference: An Illustration with the Kinship Inference in the China Biographical Database

Sep 09, 2017

Chao-Lin Liu, Hongsu Wang

Figure 1 for Matrix and Graph Operations for Relationship Inference: An Illustration with the Kinship Inference in the China Biographical Database

Abstract:Biographical databases contain diverse information about individuals. Person names, birth information, career, friends, family and special achievements are some possible items in the record for an individual. The relationships between individuals, such as kinship and friendship, provide invaluable insights about hidden communities which are not directly recorded in databases. We show that some simple matrix and graph-based operations are effective for inferring relationships among individuals, and illustrate the main ideas with the China Biographical Database (CBDB).

* 3 pages, 3 figures, 2017 Annual Meeting of the Japanese Association for Digital Humanities

Via

Access Paper or Ask Questions

Quantitative Analyses of Chinese Poetry of Tang and Song Dynasties: Using Changing Colors and Innovative Terms as Examples

Aug 28, 2016

Chao-Lin Liu

Abstract:Tang (618-907 AD) and Song (960-1279) dynasties are two very important periods in the development of Chinese literary. The most influential forms of the poetry in Tang and Song were Shi and Ci, respectively. Tang Shi and Song Ci established crucial foundations of the Chinese literature, and their influences in both literary works and daily lives of the Chinese communities last until today. We can analyze and compare the Complete Tang Shi and the Complete Song Ci from various viewpoints. In this presentation, we report our findings about the differences in their vocabularies. Interesting new words that started to appear in Song Ci and continue to be used in modern Chinese were identified. Colors are an important ingredient of the imagery in poetry, and we discuss the most frequent color words that appeared in Tang Shi and Song Ci.

* 2016 International Conference on Digital Humanities

Via

Access Paper or Ask Questions

Color Aesthetics and Social Networks in Complete Tang Poems: Explorations and Discoveries

Nov 05, 2015

Chao-Lin Liu, Hongsu Wang, Wen-Huei Cheng, Chu-Ting Hsu, Wei-Yun Chiu

Figure 1 for Color Aesthetics and Social Networks in Complete Tang Poems: Explorations and Discoveries

Figure 2 for Color Aesthetics and Social Networks in Complete Tang Poems: Explorations and Discoveries

Figure 3 for Color Aesthetics and Social Networks in Complete Tang Poems: Explorations and Discoveries

Figure 4 for Color Aesthetics and Social Networks in Complete Tang Poems: Explorations and Discoveries

Abstract:The Complete Tang Poems (CTP) is the most important source to study Tang poems. We look into CTP with computational tools from specific linguistic perspectives, including distributional semantics and collocational analysis. From such quantitative viewpoints, we compare the usage of "wind" and "moon" in the poems of Li Bai and Du Fu. Colors in poems function like sounds in movies, and play a crucial role in the imageries of poems. Thus, words for colors are studied, and "white" is the main focus because it is the most frequent color in CTP. We also explore some cases of using colored words in antithesis pairs that were central for fostering the imageries of the poems. CTP also contains useful historical information, and we extract person names in CTP to study the social networks of the Tang poets. Such information can then be integrated with the China Biographical Database of Harvard University.

* 10 pages, 1 figure, 8 tables, The 29th Pacific Asia Conference on Language, Information and Computation (PACLIC 29), The 27th Conference on Computational Linguistics and Speech Analysis (ROCLING XXVII, Chinese version)

Via

Access Paper or Ask Questions

Mining Local Gazetteers of Literary Chinese with CRF and Pattern based Methods for Biographical Information in Chinese History

Nov 04, 2015

Chao-Lin Liu, Chih-Kai Huang, Hongsu Wang, Peter K. Bol

Figure 1 for Mining Local Gazetteers of Literary Chinese with CRF and Pattern based Methods for Biographical Information in Chinese History

Figure 2 for Mining Local Gazetteers of Literary Chinese with CRF and Pattern based Methods for Biographical Information in Chinese History

Figure 3 for Mining Local Gazetteers of Literary Chinese with CRF and Pattern based Methods for Biographical Information in Chinese History

Figure 4 for Mining Local Gazetteers of Literary Chinese with CRF and Pattern based Methods for Biographical Information in Chinese History

Abstract:Person names and location names are essential building blocks for identifying events and social networks in historical documents that were written in literary Chinese. We take the lead to explore the research on algorithmically recognizing named entities in literary Chinese for historical studies with language-model based and conditional-random-field based methods, and extend our work to mining the document structures in historical documents. Practical evaluations were conducted with texts that were extracted from more than 220 volumes of local gazetteers (Difangzhi). Difangzhi is a huge and the single most important collection that contains information about officers who served in local government in Chinese history. Our methods performed very well on these realistic tests. Thousands of names and addresses were identified from the texts. A good portion of the extracted names match the biographical information currently recorded in the China Biographical Database (CBDB) of Harvard University, and many others can be verified by historians and will become as new additions to CBDB.

* 11 pages, 5 figures, 5 tables, the Third Workshop on Big Humanities Data (2015 IEEE BigData), the 29th Pacific Asia Conference on Language, Information and Computation (PACLIC 29)

Via

Access Paper or Ask Questions

Textual Analysis for Studying Chinese Historical Documents and Literary Novels

Oct 11, 2015

Chao-Lin Liu, Guan-Tao Jin, Hongsu Wang, Qing-Feng Liu, Wen-Huei Cheng, Wei-Yun Chiu, Richard Tzong-Han Tsai, Yu-Chun Wang

Figure 1 for Textual Analysis for Studying Chinese Historical Documents and Literary Novels

Figure 2 for Textual Analysis for Studying Chinese Historical Documents and Literary Novels

Figure 3 for Textual Analysis for Studying Chinese Historical Documents and Literary Novels

Figure 4 for Textual Analysis for Studying Chinese Historical Documents and Literary Novels

Abstract:We analyzed historical and literary documents in Chinese to gain insights into research issues, and overview our studies which utilized four different sources of text materials in this paper. We investigated the history of concepts and transliterated words in China with the Database for the Study of Modern China Thought and Literature, which contains historical documents about China between 1830 and 1930. We also attempted to disambiguate names that were shared by multiple government officers who served between 618 and 1912 and were recorded in Chinese local gazetteers. To showcase the potentials and challenges of computer-assisted analysis of Chinese literatures, we explored some interesting yet non-trivial questions about two of the Four Great Classical Novels of China: (1) Which monsters attempted to consume the Buddhist monk Xuanzang in the Journey to the West (JTTW), which was published in the 16th century, (2) Which was the most powerful monster in JTTW, and (3) Which major role smiled the most in the Dream of the Red Chamber, which was published in the 18th century. Similar approaches can be applied to the analysis and study of modern documents, such as the newspaper articles published about the 228 incident that occurred in 1947 in Taiwan.

* 11 pages, 7 figures, 2 tables, The Fourth ASE International Conference on Social Informatics

Via

Access Paper or Ask Questions

Exploring Lexical, Syntactic, and Semantic Features for Chinese Textual Entailment in NTCIR RITE Evaluation Tasks

Apr 08, 2015

Wei-Jie Huang, Chao-Lin Liu

Figure 1 for Exploring Lexical, Syntactic, and Semantic Features for Chinese Textual Entailment in NTCIR RITE Evaluation Tasks

Figure 2 for Exploring Lexical, Syntactic, and Semantic Features for Chinese Textual Entailment in NTCIR RITE Evaluation Tasks

Figure 3 for Exploring Lexical, Syntactic, and Semantic Features for Chinese Textual Entailment in NTCIR RITE Evaluation Tasks

Figure 4 for Exploring Lexical, Syntactic, and Semantic Features for Chinese Textual Entailment in NTCIR RITE Evaluation Tasks

Abstract:We computed linguistic information at the lexical, syntactic, and semantic levels for Recognizing Inference in Text (RITE) tasks for both traditional and simplified Chinese in NTCIR-9 and NTCIR-10. Techniques for syntactic parsing, named-entity recognition, and near synonym recognition were employed, and features like counts of common words, statement lengths, negation words, and antonyms were considered to judge the entailment relationships of two statements, while we explored both heuristics-based functions and machine-learning approaches. The reported systems showed robustness by simultaneously achieving second positions in the binary-classification subtasks for both simplified and traditional Chinese in NTCIR-10 RITE-2. We conducted more experiments with the test data of NTCIR-9 RITE, with good results. We also extended our work to search for better configurations of our classifiers and investigated contributions of individual features. This extended work showed interesting results and should encourage further discussion.

* 20 pages, 1 figure, 26 tables, Journal article in Soft Computing (Spinger). Soft Computing, online. Springer, Germany, 2015

Via

Access Paper or Ask Questions

Mining and discovering biographical information in Difangzhi with a language-model-based approach

Apr 08, 2015

Peter K. Bol, Chao-Lin Liu, Hongsu Wang

Figure 1 for Mining and discovering biographical information in Difangzhi with a language-model-based approach

Abstract:We present results of expanding the contents of the China Biographical Database by text mining historical local gazetteers, difangzhi. The goal of the database is to see how people are connected together, through kinship, social connections, and the places and offices in which they served. The gazetteers are the single most important collection of names and offices covering the Song through Qing periods. Although we begin with local officials we shall eventually include lists of local examination candidates, people from the locality who served in government, and notable local figures with biographies. The more data we collect the more connections emerge. The value of doing systematic text mining work is that we can identify relevant connections that are either directly informative or can become useful without deep historical research. Academia Sinica is developing a name database for officials in the central governments of the Ming and Qing dynasties.

* 6 pages, 4 figures, 1 table, 2015 International Conference on Digital Humanities. in Proceedings of the 2015 International Conference on Digital Humanities (DH 2015). July 2015

Via

Access Paper or Ask Questions

State-space Abstraction for Anytime Evaluation of Probabilistic Networks

Feb 27, 2013

Michael P. Wellman, Chao-Lin Liu

Figure 1 for State-space Abstraction for Anytime Evaluation of Probabilistic Networks

Figure 2 for State-space Abstraction for Anytime Evaluation of Probabilistic Networks

Figure 3 for State-space Abstraction for Anytime Evaluation of Probabilistic Networks

Abstract:One important factor determining the computational complexity of evaluating a probabilistic network is the cardinality of the state spaces of the nodes. By varying the granularity of the state spaces, one can trade off accuracy in the result for computational efficiency. We present an anytime procedure for approximate evaluation of probabilistic networks based on this idea. On application to some simple networks, the procedure exhibits a smooth improvement in approximation quality as computation time increases. This suggests that state-space abstraction is one more useful control parameter for designing real-time probabilistic reasoners.

* Appears in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence (UAI1994)

Via

Access Paper or Ask Questions

Using Qualitative Relationships for Bounding Probability Distributions

Jan 30, 2013

Chao-Lin Liu, Michael P. Wellman

Figure 1 for Using Qualitative Relationships for Bounding Probability Distributions

Figure 2 for Using Qualitative Relationships for Bounding Probability Distributions

Figure 3 for Using Qualitative Relationships for Bounding Probability Distributions

Abstract:We exploit qualitative probabilistic relationships among variables for computing bounds of conditional probability distributions of interest in Bayesian networks. Using the signs of qualitative relationships, we can implement abstraction operations that are guaranteed to bound the distributions of interest in the desired direction. By evaluating incrementally improved approximate networks, our algorithm obtains monotonically tightening bounds that converge to exact distributions. For supermodular utility functions, the tightening bounds monotonically reduce the set of admissible decision alternatives as well.

* Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998)

Via

Access Paper or Ask Questions

Incremental Tradeoff Resolution in Qualitative Probabilistic Networks

Jan 30, 2013

Chao-Lin Liu, Michael P. Wellman

Figure 1 for Incremental Tradeoff Resolution in Qualitative Probabilistic Networks

Figure 2 for Incremental Tradeoff Resolution in Qualitative Probabilistic Networks

Figure 3 for Incremental Tradeoff Resolution in Qualitative Probabilistic Networks

Figure 4 for Incremental Tradeoff Resolution in Qualitative Probabilistic Networks

Abstract:Qualitative probabilistic reasoning in a Bayesian network often reveals tradeoffs: relationships that are ambiguous due to competing qualitative influences. We present two techniques that combine qualitative and numeric probabilistic reasoning to resolve such tradeoffs, inferring the qualitative relationship between nodes in a Bayesian network. The first approach incrementally marginalizes nodes that contribute to the ambiguous qualitative relationships. The second approach evaluates approximate Bayesian networks for bounds of probability distributions, and uses these bounds to determinate qualitative relationships in question. This approach is also incremental in that the algorithm refines the state spaces of random variables for tighter bounds until the qualitative relationships are resolved. Both approaches provide systematic methods for tradeoff resolution at potentially lower computational cost than application of purely numeric methods.

* Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998)

Via

Access Paper or Ask Questions