Computer Science and Communication Engineering : [69]

Follow this collection to receive daily e-mail notification of new additions
Collection's Items (Sorted by Submit Date in Descending order): 1 to 20 of 69
  • 228-1-979-5-10-20190624.pdf.jpg
  • Article


  • Authors: Vu, Thi Ngoc Anh; Nguyen, Trong Dong; Nguyen, Vu Hoang Vuong; Dang, Thanh Hai; Do, Duc Dong (2019)

  • Aligning protein-protein interaction networks from different species is a useful mechanism for figuring out orthologous proteins, predicting/verifying protein unknown functions or constructing evolutionary relationships. The network alignment problem is proved to be NP-hard, requiring exponential-time algorithms, which is not feasible for the fast growth of biological data. In this paper, we present a novel global protein-protein interaction network alignment algorithm, which is enhanced with an extended large neighborhood search heuristics. Evaluated on benchmark datasets of yeast, fly, human and worm, the proposed algorithm outperforms state-of-the-art algorithms. Furthermore,...

  • 222-1-964-2-10-20190524.pdf.jpg
  • Article


  • Authors: Hoang, Van Xiem; Duong, Thi Hang; Trinh, Anh Vu; Vu, Xuan Thang (2019)

  • Caching has received much attention as a promising technique to overcome high data rate and stringent latency requirements in the future wireless networks. The premise of caching technique is to prefetch most popular contents closer to end users in local cache of edge nodes, e.g., base station (BS). When a user requests a content that is available in the cache, it can be served directly without being sent from the core network. In this paper, we investigat e the performance of hierarchical caching systems, in which both BS and end users are equipped with a storage memory. In particular, we propose a novel cooperative caching scheme that jointly optimizes the content placement at the ...

  • 220-1-901-3-10-20190524.pdf.jpg
  • Article


  • Authors: Nguyen, Hoai Son; Tan, Yasuo (2019)

  • In this paper, we propose a simple model predictive control (MPC) scheme for Heating, ventilation, and air conditioning (HVAC) systems in residential houses. Our control scheme utilizes a fitted thermal simulation model for each house to achieve precise prediction of room temperature and energy consumption in each prediction period. The set points for each control step of HVAC systems are selected to minimize the amount of energy consumption while maintaining room temperature within a desirable range to satisfy user comfort. Our control system is simple enough to implement in residential houses and is more efficient comparing with rule-based control methods

  • 218-1-978-3-10-20190605.pdf.jpg
  • Article


  • Authors: Dang, Khanh N.; Tran, Xuan Tu (2019)

  • The soft error rates per single-bit due to alpha particles in sub-micron technology is expectedly reduced as the feature size is shrinking. On the other hand, the complexity and density of integrated systems are accelerating which demand efficient soft error protection mechanisms, especially for on-chip communication. Using soft error protection method has to satisfy tight requirements for the area and energy consumption, therefore a low complexity and low redundancy coding method is necessary. In this work, we propose a method to enhance Parity Product Code (PPC) and provide adaptation methods for this code. First, PPC is improved as forward error correcting using transposable retran...

  • 206-1-900-1-10-20190109.pdf.jpg
  • Article


  • Authors: Do, Khac Phong; Nguyen, Xuan Thanh; Yu, Hongchuan (2019)

  • Motion style transfer is a primary problem in computer animation, allowing us to convert the motion of an actor to that of another one. Myriads approaches have been developed to perform this task, however, the majority of them are data-driven, which require a large dataset and a time-consuming period for training a model in order to achieve good results. In contrast, we propose a novel method applied successfully for this task in a small dataset. This exploits Sparse PCA to decompose original motions into smaller components which are learned with particular constraints. The synthesized results are highly precise and smooth motions with its emotion as shown in our experiments

  • 211-1-880-2-10-20190109.pdf.jpg
  • Article


  • Authors: Vo, Chau; Cao, Tru; Ho, Bao (2018)

  • Abbreviations have been widely used in clinical notes because generating clinical notes often takes place under high pressure with lack of writing time and medical record simplification. Those abbreviations limit the clarity and understanding of the records and greatly affect all the computer -based data processing tasks. In this paper, we propose a solution to the abbreviation identification task on clinical notes in a practical context where a few clinical notes have been labeled while so many clinical notes need to be labeled. Our solution is defined with a semi-supervised learning approach that uses level-wise feature engineering to construct an abbreviation identifier, from using...

  • 210-1-891-3-10-20190109.pdf.jpg
  • Article


  • Authors: Nguyen, Hung D.; Cao, Tru H. (2018)

  • Electronic medical records (EMR) have emerged as an important source of data for research in medicine and information technology, as they contain much of valuable human medical knowledge in healthcare and patient treatment. This paper tackles the problem of coreference resolution in Vietnamese EMRs. Unlike in English ones, in Vietnamese clinical texts, verbs are often used to describe disease symptoms. So we first define rules to annotate verbs as mentions and consider coreference between verbs and other noun or adjective mentions possible. Then we propose a support vector machine classifier on bag-of-words vector representation of mentions that takes into account the special characte...

  • 209-1-890-2-10-20190109.pdf.jpg
  • Article


  • Authors: Hoang, Viet Tran; Pham, Ngoc Hung (2018)

  • Assume-guarantee reasoning, a well-known approach in component-based software (CBS) verification, is in fact a language containment problem whose computational cost depends on the sizes of languages of the software components under checking and the assumption to be generated. Therefore, the smaller language assumptions, the more computational cost we can reduce in software verification. Moreover, strong assumptions are more important in CBS verification in the context of software evolution because they can be reused many times in the verification process. For this reason, this paper presents a method for generating locally strongest assumptions with locally smallest languages during C...

  • 201-1-879-3-10-20190109.pdf.jpg
  • Article


  • Authors: Nguyen, Thi Thanh Nhan; Do, Thanh Binh; Nguyen, Huy Hoang; Vu, Hai; Tran, Thi Thanh Hai; Le, Thi Lan (2018)

  • This paper describes some fusion techniques for achieving high accuracy species identification from images of different plant organs. Given a series of different image organs such as branch, entire, flower, or leaf, we firstly extract confidence scores for each single organ using a deep convolutional neural network. Then, various late fusion approaches including conventional transformation-based approaches (sum rule, max rule, product rule), a classification-based approach (support vector machine), and our proposed hybrid fusion model are deployed to determine the identity of the plant of interest. For single organ identification, two schemes are proposed. The first scheme uses one ...

  • 205-1-785-4-10-20180705.pdf.jpg
  • Article


  • Authors: Nguyen, Van Hao; Nguyen, Duc Minh; Pham, Nguyen Thanh Loan (2018)

  • In this paper, an adaptive and wide-range output DC-DC converter designed for lithium-ion (Li-Ion) battery charger circuit is proposed. The converter operates in continuous conduction mode (CCM) to provide an output voltage in response to battery voltage and a wide-range output current to ensure that circuit requirements are met. This circuit is designed on Cadence using 0.35-micromet BCD technology. Simulation results show that the circuit fully operates in CCM mode with a load current from 50 mA to 1000 mA and output voltage ripple factor is less than 1 %. Furthermore, the current supplied to the load circuit responses to three types of Li-Ion rechargeable currents. The output vo...

  • 202-1-822-1-10-20180927.pdf.jpg
  • Article


  • Authors: Dao, Van Lan; Nguyen, Anh Thai; Hoang, Van Phuc (2018)

  • This paper presents a low area, low power AES-CCM authenticated encryption IP core with silicon demonstration in 180nm standard CMOS process. The proposed AES-CCM core combines a low area 8-bit single S-box AES encryption core, improved iterative structure and other optimized circuits. The implementation results show that the proposed AES-CCM core achieves very high resource efficiency with 6.5 kgates GE and the low power consumption of 11.6 µW/MHz while meeting the requirement of the operation speed for many applications including IEEE 802.15.6 WBANs. The detail implementation and optimization results are also presented and discussed

  • 199-1-824-4-10-20180927.pdf.jpg
  • Article


  • Authors: Do, Dac Thiem; Ho, Van Khuong (2018)

  • Spectrum sharing environment creates cross-interference between licensed network and unlicensed network. Most existing works consider unlicensed interference (i.e., interference from unlicensed network to licensed network) while ignoring licensed interference (i.e., interference from licensed network to unlicensed network). Moreover, existing channel estimation algorithms cannot exactly estimate channel information. In this paper, impacts of licensed interference and inaccurate channel information on information security in the spectrum sharing environment is analyzed under peak transmit power bound, peak interference power bound, and Rayleigh fading. Toward this end, a secrecy outage...

  • 198-1-823-4-10-20181002.pdf.jpg
  • Article


  • Authors: Le, Dao Thi Hue; Luong, Pham Van; Duong, Dinh Trieu; Xiem, HoangVan (2018)

  • Video surveillance has been playing an important role in public safety and privacy protection in recent years thanks to its capability of providing the activity monitoring an d content analyzing. However, the data associated with long hours surveillance video is huge, making it less attractive to practical applications. In this paper, we propose a low complexity, yet efficient scalable video coding solution for video surveillance system. The proposed surveillance video compression scheme is able to provide the quality scalability feature by following a layered coding structure that consists of one or several enhancement layers on the top of a base layer. In addition, to maintain the b...

  • 194-1-769-1-10-20180610.pdf.jpg
  • Article


  • Authors: Tran, Ngoc Ha; Le, Nhu Hien; Hoang, Xuan Huan (2018)

  • One of the main tasks of structural biology is comparing the structure of proteins. Comparisons of protein structure can determine their functional similarities. Multigraph alignment is a useful tool for identifying functional similarities based on structural analysis. This article proposes a new algorithm for aligning protein binding sites called ACOTS-MGA. This algorithm is based on the memetic scheme. It uses the ant colony optimization (ACO) method to construct a set of solutions, then selects the best solution for implementing Tabu Search to improve the solution quality. Experimental results have shown that ACOTS-MGA outperforms state-of-the-art algorithms while producing al...

  • 174-1-821-1-10-20180927.pdf.jpg
  • Article


  • Authors: Vu, Ngoc Cham; Nguyen, Tuan Anh (2018)

  • A federation is usually an alliance of organisations where users from one organisation are trusted to access resources in another organisation. The membership of federations is diverse and continually changing. Federations require distributed and dynamic security policy management to meet these challenges. We propose an authorisation policy management model, FABACD, which simplifies the management of collaborations between organisations. It allows distributed and trusted administrators to adjust the authorisation policies in a resource holding organisation, whilst ensuring that the latter remains in ultimate control. The net result is that a resource’s authorisation system is able t...

  • 172-1-729-1-10-20180311.pdf.jpg
  • Article


  • Authors: Vo, Thi Ngoc Chau; Nguyen, Hua Phung (2018)

  • Educational data clustering on the students’ data collected with a program can find several groups of the students sharing the similar characteristics in their behaviors and study performance. For some programs, it is not trivial for us to prepare enough data for the clustering task. Data shortage might then influence the effectiveness of the clustering process and thus, true clusters can not be discovered appropriately. On the other hand, there are other programs that have been well examined with much larger data sets available for the task. Therefore, it is wondered if we can exploit the larger data sets from other source programs to enhance the educational data clustering task on t...

  • 170-1-688-2-10-20180311.pdf.jpg
  • Article


  • Authors: Vu, Thi Ngoc Anh; Dinh, Phuc Thai; Nguyen, Hoang Duc; Dang, Thanh Hai; Do, Duc Dong (2018)

  • Reconstruction of a set of genetic sequences (founders) that can combine together to form given genetic sequences (e.g. DNA) of individuals of a population is an important problem in evolutionary biology. Such reconstruction can be modeled as a combinatorial optimization problem, in which we have to find a set of founders upon that genetic sequences of the population can be generated using a smallest number of recombinations. In this paper we propose an ant colony optimization algorithm (ACO) based method, equipped with some important improvements, for the founder DNA sequence reconstruction problem. The proposed method yields excellent performance when validating on 108 test sets fro...

  • 166-1-641-2-10-20180311.pdf.jpg
  • Article


  • Authors: Le, Hong Phuong; Pham, Thai Hoang; Pham, Xuan Khoai; Nguyen, Thi Minh Huyen; Nguyen, Thi Luong; Nguyen, Minh Hiep (2018)

  • In this paper, we study semantic role labelling (SRL), a subtask of semantic parsing of natural language sentences and its application for the Vietnamese language. We present our effort in building Vietnamese PropBank, the first Vietnamese SRL corpus and a software system for labelling semantic roles of Vietnamese texts. In particular, we present a novel constituent extraction algorithm in the argument candidate identification step which is more suitable and more accurate than the common node-mapping method. In the machine learning part, our system integrates distributed word features produced by two recent unsupervised learning models in two learned statistical classifiers and makes...

  • 165-1-661-2-10-20180311.pdf.jpg
  • Article


  • Authors: Dinh, Ngoc Thi (2018)

  • Search-based test data generation is a very popular domain in the field of automatic test data generation. However, existing search-based test data generators suffer fromsome problems. By combining static program analysis and search-based testing, our proposed approach overcomesone of these problems. Considering the automatic ability and the path coverage as the test adequacycriterion, this paper proposes using Particle Swarm Optimization, an alternative search technique, for automating the generation of test data for evolutionary structural testing. Experimental results demonstrate that our test data generator can generate suitable test data with higher path coverage than the previou...

  • 164-1-620-3-10-20180311.pdf.jpg
  • Article


  • Authors: Tran, Hong Viet; Nguyen, Van Vinh; Vu, Thuong Huyen; Nguyen, Le Minh (2018)

  • Reordering is a major challenge in machine translation (MT) between two languages with significant differences in word order. In this paper, we present an approach as pre-processing step based on a dependency parser in phrase-based statistical machine translation (SMT) to learn automatic and manual reordering rules from English to Vietnamese. The dependency parse trees and transformation rules are used to reorder the source sentences and applied for systems translating from English to Vietnamese. We evaluated our approach on English-Vietnamese machine translation tasks, and showed that it outperforms the baseline phrase-based SMT system.

Collection's Items (Sorted by Submit Date in Descending order): 1 to 20 of 69

Computer Science and Communication Engineering : [69]

Follow this collection to receive daily e-mail notification of new additions
Collection's Items (Sorted by Submit Date in Descending order): 1 to 20 of 69
  • 228-1-979-5-10-20190624.pdf.jpg
  • Article


  • Authors: Vu, Thi Ngoc Anh; Nguyen, Trong Dong; Nguyen, Vu Hoang Vuong; Dang, Thanh Hai; Do, Duc Dong (2019)

  • Aligning protein-protein interaction networks from different species is a useful mechanism for figuring out orthologous proteins, predicting/verifying protein unknown functions or constructing evolutionary relationships. The network alignment problem is proved to be NP-hard, requiring exponential-time algorithms, which is not feasible for the fast growth of biological data. In this paper, we present a novel global protein-protein interaction network alignment algorithm, which is enhanced with an extended large neighborhood search heuristics. Evaluated on benchmark datasets of yeast, fly, human and worm, the proposed algorithm outperforms state-of-the-art algorithms. Furthermore,...

  • 222-1-964-2-10-20190524.pdf.jpg
  • Article


  • Authors: Hoang, Van Xiem; Duong, Thi Hang; Trinh, Anh Vu; Vu, Xuan Thang (2019)

  • Caching has received much attention as a promising technique to overcome high data rate and stringent latency requirements in the future wireless networks. The premise of caching technique is to prefetch most popular contents closer to end users in local cache of edge nodes, e.g., base station (BS). When a user requests a content that is available in the cache, it can be served directly without being sent from the core network. In this paper, we investigat e the performance of hierarchical caching systems, in which both BS and end users are equipped with a storage memory. In particular, we propose a novel cooperative caching scheme that jointly optimizes the content placement at the ...

  • 220-1-901-3-10-20190524.pdf.jpg
  • Article


  • Authors: Nguyen, Hoai Son; Tan, Yasuo (2019)

  • In this paper, we propose a simple model predictive control (MPC) scheme for Heating, ventilation, and air conditioning (HVAC) systems in residential houses. Our control scheme utilizes a fitted thermal simulation model for each house to achieve precise prediction of room temperature and energy consumption in each prediction period. The set points for each control step of HVAC systems are selected to minimize the amount of energy consumption while maintaining room temperature within a desirable range to satisfy user comfort. Our control system is simple enough to implement in residential houses and is more efficient comparing with rule-based control methods

  • 218-1-978-3-10-20190605.pdf.jpg
  • Article


  • Authors: Dang, Khanh N.; Tran, Xuan Tu (2019)

  • The soft error rates per single-bit due to alpha particles in sub-micron technology is expectedly reduced as the feature size is shrinking. On the other hand, the complexity and density of integrated systems are accelerating which demand efficient soft error protection mechanisms, especially for on-chip communication. Using soft error protection method has to satisfy tight requirements for the area and energy consumption, therefore a low complexity and low redundancy coding method is necessary. In this work, we propose a method to enhance Parity Product Code (PPC) and provide adaptation methods for this code. First, PPC is improved as forward error correcting using transposable retran...

  • 206-1-900-1-10-20190109.pdf.jpg
  • Article


  • Authors: Do, Khac Phong; Nguyen, Xuan Thanh; Yu, Hongchuan (2019)

  • Motion style transfer is a primary problem in computer animation, allowing us to convert the motion of an actor to that of another one. Myriads approaches have been developed to perform this task, however, the majority of them are data-driven, which require a large dataset and a time-consuming period for training a model in order to achieve good results. In contrast, we propose a novel method applied successfully for this task in a small dataset. This exploits Sparse PCA to decompose original motions into smaller components which are learned with particular constraints. The synthesized results are highly precise and smooth motions with its emotion as shown in our experiments

  • 211-1-880-2-10-20190109.pdf.jpg
  • Article


  • Authors: Vo, Chau; Cao, Tru; Ho, Bao (2018)

  • Abbreviations have been widely used in clinical notes because generating clinical notes often takes place under high pressure with lack of writing time and medical record simplification. Those abbreviations limit the clarity and understanding of the records and greatly affect all the computer -based data processing tasks. In this paper, we propose a solution to the abbreviation identification task on clinical notes in a practical context where a few clinical notes have been labeled while so many clinical notes need to be labeled. Our solution is defined with a semi-supervised learning approach that uses level-wise feature engineering to construct an abbreviation identifier, from using...

  • 210-1-891-3-10-20190109.pdf.jpg
  • Article


  • Authors: Nguyen, Hung D.; Cao, Tru H. (2018)

  • Electronic medical records (EMR) have emerged as an important source of data for research in medicine and information technology, as they contain much of valuable human medical knowledge in healthcare and patient treatment. This paper tackles the problem of coreference resolution in Vietnamese EMRs. Unlike in English ones, in Vietnamese clinical texts, verbs are often used to describe disease symptoms. So we first define rules to annotate verbs as mentions and consider coreference between verbs and other noun or adjective mentions possible. Then we propose a support vector machine classifier on bag-of-words vector representation of mentions that takes into account the special characte...

  • 209-1-890-2-10-20190109.pdf.jpg
  • Article


  • Authors: Hoang, Viet Tran; Pham, Ngoc Hung (2018)

  • Assume-guarantee reasoning, a well-known approach in component-based software (CBS) verification, is in fact a language containment problem whose computational cost depends on the sizes of languages of the software components under checking and the assumption to be generated. Therefore, the smaller language assumptions, the more computational cost we can reduce in software verification. Moreover, strong assumptions are more important in CBS verification in the context of software evolution because they can be reused many times in the verification process. For this reason, this paper presents a method for generating locally strongest assumptions with locally smallest languages during C...

  • 201-1-879-3-10-20190109.pdf.jpg
  • Article


  • Authors: Nguyen, Thi Thanh Nhan; Do, Thanh Binh; Nguyen, Huy Hoang; Vu, Hai; Tran, Thi Thanh Hai; Le, Thi Lan (2018)

  • This paper describes some fusion techniques for achieving high accuracy species identification from images of different plant organs. Given a series of different image organs such as branch, entire, flower, or leaf, we firstly extract confidence scores for each single organ using a deep convolutional neural network. Then, various late fusion approaches including conventional transformation-based approaches (sum rule, max rule, product rule), a classification-based approach (support vector machine), and our proposed hybrid fusion model are deployed to determine the identity of the plant of interest. For single organ identification, two schemes are proposed. The first scheme uses one ...

  • 205-1-785-4-10-20180705.pdf.jpg
  • Article


  • Authors: Nguyen, Van Hao; Nguyen, Duc Minh; Pham, Nguyen Thanh Loan (2018)

  • In this paper, an adaptive and wide-range output DC-DC converter designed for lithium-ion (Li-Ion) battery charger circuit is proposed. The converter operates in continuous conduction mode (CCM) to provide an output voltage in response to battery voltage and a wide-range output current to ensure that circuit requirements are met. This circuit is designed on Cadence using 0.35-micromet BCD technology. Simulation results show that the circuit fully operates in CCM mode with a load current from 50 mA to 1000 mA and output voltage ripple factor is less than 1 %. Furthermore, the current supplied to the load circuit responses to three types of Li-Ion rechargeable currents. The output vo...

  • 202-1-822-1-10-20180927.pdf.jpg
  • Article


  • Authors: Dao, Van Lan; Nguyen, Anh Thai; Hoang, Van Phuc (2018)

  • This paper presents a low area, low power AES-CCM authenticated encryption IP core with silicon demonstration in 180nm standard CMOS process. The proposed AES-CCM core combines a low area 8-bit single S-box AES encryption core, improved iterative structure and other optimized circuits. The implementation results show that the proposed AES-CCM core achieves very high resource efficiency with 6.5 kgates GE and the low power consumption of 11.6 µW/MHz while meeting the requirement of the operation speed for many applications including IEEE 802.15.6 WBANs. The detail implementation and optimization results are also presented and discussed

  • 199-1-824-4-10-20180927.pdf.jpg
  • Article


  • Authors: Do, Dac Thiem; Ho, Van Khuong (2018)

  • Spectrum sharing environment creates cross-interference between licensed network and unlicensed network. Most existing works consider unlicensed interference (i.e., interference from unlicensed network to licensed network) while ignoring licensed interference (i.e., interference from licensed network to unlicensed network). Moreover, existing channel estimation algorithms cannot exactly estimate channel information. In this paper, impacts of licensed interference and inaccurate channel information on information security in the spectrum sharing environment is analyzed under peak transmit power bound, peak interference power bound, and Rayleigh fading. Toward this end, a secrecy outage...

  • 198-1-823-4-10-20181002.pdf.jpg
  • Article


  • Authors: Le, Dao Thi Hue; Luong, Pham Van; Duong, Dinh Trieu; Xiem, HoangVan (2018)

  • Video surveillance has been playing an important role in public safety and privacy protection in recent years thanks to its capability of providing the activity monitoring an d content analyzing. However, the data associated with long hours surveillance video is huge, making it less attractive to practical applications. In this paper, we propose a low complexity, yet efficient scalable video coding solution for video surveillance system. The proposed surveillance video compression scheme is able to provide the quality scalability feature by following a layered coding structure that consists of one or several enhancement layers on the top of a base layer. In addition, to maintain the b...

  • 194-1-769-1-10-20180610.pdf.jpg
  • Article


  • Authors: Tran, Ngoc Ha; Le, Nhu Hien; Hoang, Xuan Huan (2018)

  • One of the main tasks of structural biology is comparing the structure of proteins. Comparisons of protein structure can determine their functional similarities. Multigraph alignment is a useful tool for identifying functional similarities based on structural analysis. This article proposes a new algorithm for aligning protein binding sites called ACOTS-MGA. This algorithm is based on the memetic scheme. It uses the ant colony optimization (ACO) method to construct a set of solutions, then selects the best solution for implementing Tabu Search to improve the solution quality. Experimental results have shown that ACOTS-MGA outperforms state-of-the-art algorithms while producing al...

  • 174-1-821-1-10-20180927.pdf.jpg
  • Article


  • Authors: Vu, Ngoc Cham; Nguyen, Tuan Anh (2018)

  • A federation is usually an alliance of organisations where users from one organisation are trusted to access resources in another organisation. The membership of federations is diverse and continually changing. Federations require distributed and dynamic security policy management to meet these challenges. We propose an authorisation policy management model, FABACD, which simplifies the management of collaborations between organisations. It allows distributed and trusted administrators to adjust the authorisation policies in a resource holding organisation, whilst ensuring that the latter remains in ultimate control. The net result is that a resource’s authorisation system is able t...

  • 172-1-729-1-10-20180311.pdf.jpg
  • Article


  • Authors: Vo, Thi Ngoc Chau; Nguyen, Hua Phung (2018)

  • Educational data clustering on the students’ data collected with a program can find several groups of the students sharing the similar characteristics in their behaviors and study performance. For some programs, it is not trivial for us to prepare enough data for the clustering task. Data shortage might then influence the effectiveness of the clustering process and thus, true clusters can not be discovered appropriately. On the other hand, there are other programs that have been well examined with much larger data sets available for the task. Therefore, it is wondered if we can exploit the larger data sets from other source programs to enhance the educational data clustering task on t...

  • 170-1-688-2-10-20180311.pdf.jpg
  • Article


  • Authors: Vu, Thi Ngoc Anh; Dinh, Phuc Thai; Nguyen, Hoang Duc; Dang, Thanh Hai; Do, Duc Dong (2018)

  • Reconstruction of a set of genetic sequences (founders) that can combine together to form given genetic sequences (e.g. DNA) of individuals of a population is an important problem in evolutionary biology. Such reconstruction can be modeled as a combinatorial optimization problem, in which we have to find a set of founders upon that genetic sequences of the population can be generated using a smallest number of recombinations. In this paper we propose an ant colony optimization algorithm (ACO) based method, equipped with some important improvements, for the founder DNA sequence reconstruction problem. The proposed method yields excellent performance when validating on 108 test sets fro...

  • 166-1-641-2-10-20180311.pdf.jpg
  • Article


  • Authors: Le, Hong Phuong; Pham, Thai Hoang; Pham, Xuan Khoai; Nguyen, Thi Minh Huyen; Nguyen, Thi Luong; Nguyen, Minh Hiep (2018)

  • In this paper, we study semantic role labelling (SRL), a subtask of semantic parsing of natural language sentences and its application for the Vietnamese language. We present our effort in building Vietnamese PropBank, the first Vietnamese SRL corpus and a software system for labelling semantic roles of Vietnamese texts. In particular, we present a novel constituent extraction algorithm in the argument candidate identification step which is more suitable and more accurate than the common node-mapping method. In the machine learning part, our system integrates distributed word features produced by two recent unsupervised learning models in two learned statistical classifiers and makes...

  • 165-1-661-2-10-20180311.pdf.jpg
  • Article


  • Authors: Dinh, Ngoc Thi (2018)

  • Search-based test data generation is a very popular domain in the field of automatic test data generation. However, existing search-based test data generators suffer fromsome problems. By combining static program analysis and search-based testing, our proposed approach overcomesone of these problems. Considering the automatic ability and the path coverage as the test adequacycriterion, this paper proposes using Particle Swarm Optimization, an alternative search technique, for automating the generation of test data for evolutionary structural testing. Experimental results demonstrate that our test data generator can generate suitable test data with higher path coverage than the previou...

  • 164-1-620-3-10-20180311.pdf.jpg
  • Article


  • Authors: Tran, Hong Viet; Nguyen, Van Vinh; Vu, Thuong Huyen; Nguyen, Le Minh (2018)

  • Reordering is a major challenge in machine translation (MT) between two languages with significant differences in word order. In this paper, we present an approach as pre-processing step based on a dependency parser in phrase-based statistical machine translation (SMT) to learn automatic and manual reordering rules from English to Vietnamese. The dependency parse trees and transformation rules are used to reorder the source sentences and applied for systems translating from English to Vietnamese. We evaluated our approach on English-Vietnamese machine translation tasks, and showed that it outperforms the baseline phrase-based SMT system.

Collection's Items (Sorted by Submit Date in Descending order): 1 to 20 of 69