Persons
Ing. Jan Drchal, Ph.D.
All publications
Pipeline and dataset generation for automated fact-checking in almost any language
- Authors: Ing. Jan Drchal, Ph.D., Ing. Herbert Ullrich, Mlynář, T., Moravec, V.
- Publication: Neural Computing and Applications. 2024, 36(30), 19023-19054. ISSN 0941-0643.
- Year: 2024
- DOI: 10.1007/s00521-024-10113-5
- Link: https://doi.org/10.1007/s00521-024-10113-5
- Department: Department of Computer Science, Artificial Intelligence Center
-
Annotation:
This article presents a pipeline for automated fact-checking leveraging publicly available Language Models and data. The objective is to assess the accuracy of textual claims using evidence from a ground-truth evidence corpus. The pipeline consists of two main modules − the evidence retrieval and the claim veracity evaluation. Our primary focus is on the ease of deployment in various languages that remain unexplored in the field of automated fact-checking. Unlike most similar pipelines, which work with evidence sentences, our pipeline processes data on a paragraph level, simplifying the overall architecture and data requirements. Given the high cost of annotating language-specific fact-checking training data, our solution builds on the Question Answering for Claim Generation (QACG) method, which we adapt and use to generate the data for all models of the pipeline. Our strategy enables the introduction of new languages through machine translation of only two fixed datasets of moderate size. Subsequently, any number of training samples can be generated based on an evidence corpus in the target language. We provide open access to all data and fine-tuned models for Czech, English, Polish, and Slovak pipelines, as well as to our codebase that may be used to reproduce the results. We comprehensively evaluate the pipelines for all four languages, including human annotations and per-sample difficulty assessment using Pointwise V-information. The presented experiments are based on full Wikipedia snapshots to promote reproducibility. To facilitate implementation and user interaction, we develop the FactSearch application featuring the proposed pipeline and the preliminary feedback on its performance.
AI Life Cycle and Human Rights: Risks and Remedies
- Authors: Šmuclerová, M., Ing. Luboš Král, Ph.D., Ing. Jan Drchal, Ph.D.,
- Publication: Artificial Intelligence and Human Rights. Oxford: Oxford University Press, 2023. p. 16-41. ISBN 978-0-19-288248-6.
- Year: 2023
- Department: Artificial Intelligence Center
-
Annotation:
This interdisciplinary chapter, based on the collaboration of international lawyers and artificial intelligence (AI) experts, introduces various phases of the AI life cycle in light of possible human rights violations that may arise from each of them. It identifies the root cause of the risks to human rights and analyses the possible remedies that are common to all AI systems despite their great diversity and domain of use today. The risk of human rights violation arises notably due to unbalanced or biased data, insufficiently identified system boundary conditions or modified context, the existence of the black box, malicious use, or abuse of AI. The gist is to introduce the human rights risk assessment throughout the whole AI life cycle and integrate it into the user requirements and the system specifications in the initial phase. This ensures, inter alia, that the AI system will be developed, tested, and monitored in light of the applicable human rights limitations. Requirements related to transparency, explainability, certification, or the selection of development data are all highly relevant for protection of human rights
CsFEVER and CTKFacts: Acquiring Czech Data for Fact Verification
- Authors: Ing. Herbert Ullrich, Ing. Jan Drchal, Ph.D., Rýpar, M., Vincourová, H., Moravec, V.
- Publication: Language Resources and Evaluation. 2023, 57(4), 1571-1605. ISSN 1574-020X.
- Year: 2023
- DOI: 10.1007/s10579-023-09654-3
- Link: https://doi.org/10.1007/s10579-023-09654-3
- Department: Department of Computer Science, Artificial Intelligence Center
-
Annotation:
In this paper, we examine several methods of acquiring Czech data for automated fact-checking, which is a task commonly modeled as a classification of textual claim veracity w.r.t. a corpus of trusted ground truths. We attempt to collect sets of data in form of a factual claim, evidence within the ground truth corpus, and its veracity label (supported, refuted or not enough info). As a first attempt, we generate a Czech version of the large-scale FEVER dataset built on top of Wikipedia corpus. We take a hybrid approach of machine translation and document alignment; the approach and the tools we provide can be easily applied to other languages. We discuss its weaknesses, propose a future strategy for their mitigation and publish the 127k resulting translations, as well as a version of such dataset reliably applicable for the Natural Language Inference task—the CsFEVER-NLI. Furthermore, we collect a novel dataset of 3,097 claims, which is annotated using the corpus of 2.2 M articles of Czech News Agency. We present an extended dataset annotation methodology based on the FEVER approach, and, as the underlying corpus is proprietary, we also publish a standalone version of the dataset for the task of Natural Language Inference we call CTKFactsNLI. We analyze both acquired datasets for spurious cues—annotation patterns leading to model overfitting. CTKFacts is further examined for inter-annotator agreement, thoroughly cleaned, and a typology of common annotator errors is extracted. Finally, we provide baseline models for all stages of the fact-checking pipeline and publish the NLI datasets, as well as our annotation platform and other experimental data.
Continually trained life-long classification
- Authors: Ing. Rudolf Jakub Szadkowski, Ing. Jan Drchal, Ph.D., prof. Ing. Jan Faigl, Ph.D.,
- Publication: Neural Computing and Applications. 2022, 34(1), 135-152. ISSN 0941-0643.
- Year: 2022
- DOI: 10.1007/s00521-021-06154-9
- Link: https://doi.org/10.1007/s00521-021-06154-9
- Department: Artificial Intelligence Center
-
Annotation:
Two challenges can be found in a life-long classifier that learns continually: the concept drift, when the probability distribution of data is changing in time, and catastrophic forgetting when the earlier learned knowledge is lost. There are many proposed solutions to each challenge, but very little research is done to solve both challenges simultaneously. We show that both the concept drift and catastrophic forgetting are closely related to our proposed description of the life-long continual classification. We describe the process of continual learning as a wrap modification, where a wrap is a manifold that can be trained to cover or uncover a given set of samples. The notion of wraps and their cover/uncover modifiers are theoretical building blocks of a novel general life-long learning scheme, implemented as an ensemble of variational autoencoders. The proposed algorithm is examined on evaluation scenarios for continual learning and compared to state-of-the-art algorithms demonstrating the robustness to catastrophic forgetting and adaptability to concept drift but also showing the new challenges of the life-long classification.
WiSM: Windowing Surrogate Model for Evaluation of Curvature-Constrained Tours With Dubins Vehicle
- Authors: Ing. Jan Drchal, Ph.D., prof. Ing. Jan Faigl, Ph.D., Váňa, P.
- Publication: IEEE Transactions on Cybernetics. 2022, 52(2), 1302-1311. ISSN 2168-2267.
- Year: 2022
- DOI: 10.1109/TCYB.2020.3000465
- Link: https://doi.org/10.1109/TCYB.2020.3000465
- Department: Artificial Intelligence Center
-
Annotation:
Dubins tours represent a solution of the Dubins traveling salesman problem (DTSP) that is a variant of the optimization routing problem to determine a curvature-constrained shortest path to visit a set of locations such that the path is feasible for Dubins vehicle, which moves only forward and has a limited turning radius. The DTSP combines the NP-hard combinatorial optimization to determine the optimal sequence of visits to the locations, as in the regular TSP, with the continuous optimization of the heading angles at the locations, where the optimal heading values depend on the sequence of visits and vice versa. We address the computationally challenging DTSP by fast evaluation of the sequence of visits by the proposed windowing surrogate model (WiSM), which estimates the length of the optimal Dubins path connecting a sequence of locations in a Dubins tour. The estimation is sped up by a regression model trained using close to optimum solutions of small Dubins tours that are generalized for large-scale instances of the addressed DTSP utilizing the sliding-window technique and a cache for already computed results. The reported results support that the proposed WiSM enables fast convergence of a relatively simple evolutionary algorithm to high-quality solutions of the DTSP. We show that with an increasing number of locations, our algorithm scales significantly better than other state-of-the-art DTSP solvers.
Fast Sequence Rejection for Multi-Goal Planning with Dubins Vehicle
- Authors: prof. Ing. Jan Faigl, Ph.D., Váňa, P., Ing. Jan Drchal, Ph.D.,
- Publication: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems. Piscataway: IEEE Robotics and Automation Society, 2020. p. 6773-6780. ISSN 2153-0866. ISBN 978-1-7281-6212-6.
- Year: 2020
- DOI: 10.1109/IROS45743.2020.9340644
- Link: https://doi.org/10.1109/IROS45743.2020.9340644
- Department: Artificial Intelligence Center
-
Annotation:
Multi-goal curvature-constrained planning such as the Dubins Traveling Salesman Problem (DTSP) combines NP-hard combinatorial routing with continuous optimization to determine the optimal vehicle heading angle for each target location. The problem can be addressed as combinatorial routing using a finite set of heading samples at target locations. In such a case, optimal heading samples can be determined for a sequence of targets in polynomial time, and the DTSP can be solved as searching for a sequence with the minimal cost. However, the examination of sequences can be computationally demanding for high numbers of heading samples and target locations. A fast rejection schema is proposed to quickly examine unfavorable sequences using lower bound estimation of Dubins tour cost based on windowing technique that evaluates short subtours of the sequences. Furthermore, the computation using small problem instances can benefit from reusing stored results and thus speed up the search. The reported results indicate that the computational burden is decreased about two orders of magnitude, and the proposed approach supports finding high-quality solutions of routing problems with Dubins vehicle.
Autoencoders Covering Space as a Life-Long Classifier
- Authors: Ing. Rudolf Jakub Szadkowski, Ing. Jan Drchal, Ph.D., prof. Ing. Jan Faigl, Ph.D.,
- Publication: Advances in Self-Organizing Maps, Learning Vector Quantization, Clustering and Data Visualization. Düsseldorf: Springer-VDI-Verlag, 2019. p. 271-281. ISSN 2194-5357. ISBN 978-3-030-19641-7.
- Year: 2019
- DOI: 10.1007/978-3-030-19642-4_27
- Link: https://doi.org/10.1007/978-3-030-19642-4_27
- Department: Department of Computer Science, Artificial Intelligence Center
-
Annotation:
A life-long classifier that learns incrementally has many challenges such as concept drift, when the class changes in time, and catastrophic forgetting when the earlier learned knowledge is lost. Many successful connectionist solutions are based on an idea that new data are learned only in a part of a network that is relevant to the new data. We leverage this idea and propose a novel method for learning an ensemble of specialized autoencoders. We interpret autoencoders as manifolds that can be trained to contain or exclude given points from the input space. This manifold manipulation allows us to implement a classifier that can suppress catastrophic forgetting and adapt to concept drift. The proposed algorithm is evaluated on an incremental version of the XOR problem and on an incremental version of the MNIST classification where we achieved 0.9 accuracy which is a significant improvement over the previously published results
Basic Evaluation Scenarios for Incrementally Trained Classifiers
- Authors: Ing. Rudolf Jakub Szadkowski, Ing. Jan Drchal, Ph.D., prof. Ing. Jan Faigl, Ph.D.,
- Publication: Artificial Neural Networks and Machine Learning – ICANN 2019: Deep Learning. Basel: Springer, 2019. p. 507-517. Lecture Notes in Computer Science. vol. 11728. ISSN 0302-9743. ISBN 978-3-030-30483-6.
- Year: 2019
- DOI: 10.1007/978-3-030-30484-3_41
- Link: https://doi.org/10.1007/978-3-030-30484-3_41
- Department: Artificial Intelligence Center
-
Annotation:
Evaluation of incremental classification algorithms is a complex task because there are many aspects to evaluate. Besides the aspects such as accuracy and generalization that are usually evaluated in the context of classification, we also need to assess how the algorithm handles two main challenges of the incremental learning: the concept drift and the catastrophic forgetting. However, only catastrophic forgetting is evaluated by the current methodology, where the classifier is evaluated in two scenarios for class addition and expansion. We generalize the methodology by proposing two new scenarios of incremental learning for class inclusion and separation that evaluate the handling of the concept drift. We demonstrate the proposed methodology on the evaluation of three different incremental classifiers, where we show that the proposed methodology provides a more complete and finer evaluation.
Data-driven Activity Scheduler for Agent-based Mobility Models
- Authors: Ing. Jan Drchal, Ph.D., Čertický, M., doc. Ing. Michal Jakob, Ph.D.,
- Publication: Transportation Research Part C: Emerging Technologies. 2019, 98 370-390. ISSN 0968-090X.
- Year: 2019
- DOI: 10.1016/j.trc.2018.12.002
- Link: https://doi.org/10.1016/j.trc.2018.12.002
- Department: Department of Computer Science, Artificial Intelligence Center
-
Annotation:
Activity-based modelling is a modern agent-based approach to travel demand modelling, in which the transport demand is derived from the agent’s needs to perform certain activities at specific places and times. The agent’s mobility is considered in a broader context, which allows the activity-based models to produce more realistic trip chains, compared to traditional trip-based models. The core of any activity-based model is an activity scheduler – a software component producing sequences of agent’s daily activities interconnected by trips, called activity schedules. Traditionally, activity schedulers used to rely heavily on hard-coded knowledge of transport behaviour experts. We introduce the concept of a Data-Driven Activity Scheduler (DDAS), which replaces numerous expert-designed components and their intricately engineered interactions with a collection of machine learning models. Its architecture is significantly simpler, making it easier to deploy and maintain. This shift towards data-driven, machine learning based approach is possible due to increased availability of mobility-related data. We demonstrate DDAS concept using our own proof-of-concept implementation, perform a rigorous analysis and compare the validity of the resulting model to one of the rule-based alternatives using the Validation Framework for Activity-Based Models (VALFRAM).
Terrain Classification with Crawling Robot Using Long Short-Term Memory Network
- Authors: Ing. Rudolf Jakub Szadkowski, Ing. Jan Drchal, Ph.D., prof. Ing. Jan Faigl, Ph.D.,
- Publication: Artificial Neural Networks and Machine Learning – ICANN 2018. Basel: Springer, 2018. p. 771-780. ISSN 0302-9743. ISBN 978-3-030-01423-0.
- Year: 2018
- DOI: 10.1007/978-3-030-01424-7_75
- Link: https://doi.org/10.1007/978-3-030-01424-7_75
- Department: Department of Computer Science, Artificial Intelligence Center
-
Annotation:
Terrain classification is a crucial feature for mobile robots operating across multiple terrains. One way to learn a terrain classifier is to use a stream of labeled proprioceptive data recorded during a terrain traversal. In this paper, we propose a new terrain classifier that combines a feature extraction from a data stream with the long short-term memory (LSTM) network. Features are extracted from the information-sparse data stream by applying a sliding window computing three central moments. The feature sequence is continuously classified by the LSTM network into multiple terrain classes. Furthermore, a modified bagging method is used to deal with a limited and unbalanced training set. In comparison to the previous work on terrain classifiers for a hexapod crawling robot using only servo-drive feedback, the proposed classifier provides continuous classification with the F1 score up to 0.88, and thus provide better results than SVM classifier learned on the same input data.
Towards Data-Driven on-Demand Transport
- Authors: Egan, M., Ing. Jan Drchal, Ph.D., Ing. Jan Mrkos, Ph.D., doc. Ing. Michal Jakob, Ph.D.,
- Publication: EAI Endorsed Transaction on Industrial Networks and Intelligent Systems. 2018, 5(14), 1-10. ISSN 2410-0218.
- Year: 2018
- DOI: 10.4108/eai.27-6-2018.154835
- Link: https://doi.org/10.4108/eai.27-6-2018.154835
- Department: Department of Computer Science, Artificial Intelligence Center
-
Annotation:
On-demand transport has been disrupted by Uber and other providers, which are challenging the traditional approach adopted by taxi services. Instead of using fixed passenger pricing and driver payments, there is now the possibility of adaptation to changes in demand and supply. Properly designed, this new approach can lead to desirable tradeoffs between passenger prices, individual driver profits and provider revenue. However, pricing and allocations known as mechanisms are challenging problems falling in the intersection of economics and computer science. In this paper, we develop a general framework to classify mechanisms in ondemand transport. Moreover, we show that data is key to optimizing each mechanism and analyze a dataset provided by a real-world on-demand transport provider. This analysis provides valuable new insights into efficient pricing and allocation in on-demand transport.
Data Driven Validation Framework for Multi-agent Activity-based Models
- Authors: Ing. Jan Drchal, Ph.D., Čertický, M., doc. Ing. Michal Jakob, Ph.D.,
- Publication: Proceedings of Multi-Agent-Based Simulation Workshop. New York: ACM, 2016, ISBN 978-3-319-31446-4. Available from: http://agents.fel.cvut.cz/~certicky/files/publications/mabs2015.pdf
- Year: 2016
- DOI: 10.1007/978-3-319-31447-1_4
- Link: https://doi.org/10.1007/978-3-319-31447-1_4
- Department: Department of Computer Science
-
Annotation:
Activity-based models, as a specific instance of agent-based models, deal with agents that structure their activity in terms of (daily) activity schedules. An activity schedule consists of a sequence of activity instances, each with its assigned start time, duration and location, together with transport modes used for travel between subsequent activity locations. A critical step in the development of simulation models is validation. Despite the growing importance of activity-based models in modelling transport and mobility, there has been so far no work focusing specifically on statistical validation of such models. In this paper, we propose a six-step Validation Framework for Activity-based Models (VALFRAM) that allows exploiting historical real-world data to assess the validity of activity-based models. The framework compares temporal and spatial properties and the structure of activity schedules against real-world travel diaries and origin-destination matrices. We confirm the usefulness of the framework on three real-world activity-based transport models.
VALFRAM: Validation Framework for Activity-Based Models
- Authors: Ing. Jan Drchal, Ph.D., Čertický, M., doc. Ing. Michal Jakob, Ph.D.,
- Publication: Journal of Artificial Societies and Social Simulation. 2016, 5(19(3)), ISSN 1460-7425.
- Year: 2016
- DOI: 10.18564/jasss.3127
- Link: https://doi.org/10.18564/jasss.3127
- Department: Artificial Intelligence Center
-
Annotation:
Activity-based models are a specific type of agent-based models widely used in transport and urban planning to generate and study travel demand. They deal with agents that structure their behaviour in terms of daily activity schedules: sequences of activity instances (such as work, sleep or shopping) with assigned start times, durations and locations, and interconnected by trips with assigned transport modes and routes. Despite growing importance of activity-based models in transport modelling, there has been no work focusing specifically on statistical validation of such models so far. In this paper, we propose a six-step Validation Framework for Activity-based Models (VALFRAM) that exploits historical real-world data to quantify the model's validity in terms of a set of numeric metrics. The framework compares the temporal and spatial properties and the structure of modelled activity schedules against real-world origin-destination matrices and travel diaries. We demonstrate the usefulness of the framework on a set of six different activity-based transport models.
CaffeLink: Mathematica binding for Caffe Deep Learning Framework
- Authors: Kerhart, M., Ing. Jan Drchal, Ph.D.,
- Publication: Proceedings of the 12th International Mathematica Symposium. Praha: České vysoké učení technické v Praze, Fakulta elektrotechnická, 2015. ISBN 978-80-01-05623-3.
- Year: 2015
- Department: Department of Computer Science
-
Annotation:
In this paper we present CaffeLink an open-source library for Mathematica which is a binding of a well-established Caffe deep learning framework. Caffe is a highly-optimized CUDA accelerated library with focus on convolutional neural networks written in C++ with Python and Matlab bindings. CaffeLink is based upon Mathematica's LibraryLink. It makes accessible most features of Caffe directly from Mathematica environment which includes work with datasets, building networks, training them as well as evaluating them. Here we present an overview of the CaffeLink library with examples on MNIST and ImageNet datasets.
Fully Agent-based Simulation Model of Multimodal Mobility in European Cities
- Authors: Čertický, M., Ing. Jan Drchal, Ph.D., Ing. Marek Cuchý, Ph.D., doc. Ing. Michal Jakob, Ph.D.,
- Publication: Proceedings of the 4th International Conference on Models and Technologies for Intelligent Transportation Systems. Budapest: Budapest University of Technology and Economics, 2015. p. 229-236. ISBN 978-963-313-142-8.
- Year: 2015
- DOI: 10.1109/MTITS.2015.7223261
- Link: https://doi.org/10.1109/MTITS.2015.7223261
- Department: Department of Computer Science
-
Annotation:
Even though the agent-based simulation modelling has become a standard tool in transport research, current imple- mentations still treat travellers as passive data structures, updated synchronously at infrequent, predefined points in time, thus failing to cover within-the-day decision making and negotiation necessary for cooperative behaviour in a dynamic transport system. Leveraging the fully agent-based modelling approach, we have built large-scale activity-based models of multimodal mobility covering areas up to thousands of square kilometres and simulating populations of up to millions of inhabitants of several European cities. Citizens are represented by autonomous, self-interested agents which schedule and execute their activities (work, shopping, leisure, etc.) and trips in time and space. Indi- vidual decisions are influenced by agent’s demographic attributes and modelled using the data from mobility surveys. The model is statistically validated against origin-destination matrices and travel diary data sets.
Genetic Programming of Augmenting Topologies for Hypercube-Based Indirect Encoding of Artificial Neural Networks
- Authors: Ing. Jan Drchal, Ph.D., Šnorek, M.
- Publication: Soft Computing Models in Industrial and Environmental Applications. Heidelberg: Springer, 2013. p. 63-72. ISSN 2194-5357. ISBN 978-3-642-32921-0.
- Year: 2013
- DOI: 10.1007/978-3-642-32922-7_7
- Link: https://doi.org/10.1007/978-3-642-32922-7_7
- Department: Department of Computer Science
-
Annotation:
In this paper we present a novel algorithm called GPAT (Genetic Programming of Augmenting Topologies) which evolves Genetic Programming (GP) trees in a similar way as a well-established neuro-evolutionary algorithm NEAT (NeuroEvolution of Augmenting Topologies) does. The evolution starts from a minimal form and gradually adds structure as needed. A niching evolutionary algorithm is used to protect individuals of a variable complexity in a single population. Although GPAT is a general approach we employ it mainly to evolve artificial neural networks by means of Hypercube-based indirect encoding which is an approach allowing for evolution of large-scale neural networks having theoretically unlimited size. We perform also experiments for directly encoded problems. The results show that GPAT outperforms both GP and NEAT taking the best of both.
Distance Measures for HyperGP with Fitness Sharing
- Authors: Ing. Jan Drchal, Ph.D., Šnorek, M.
- Publication: Proceedings of the fourteenth international conference on Genetic and evolutionary computation conference companion. New York: ACM, 2012. p. 545-552. ISBN 978-1-4503-1177-9.
- Year: 2012
- DOI: 10.1145/2330163.2330241
- Link: https://doi.org/10.1145/2330163.2330241
- Department: Department of Computer Science
-
Annotation:
In this paper we propose a new algorithm called HyperGPEFS (HyperGP with Explicit Fitness Sharing). It is based on a HyperNEAT, which is a well-established evolutionary method employing indirect encoding of artificial neural networks. Indirect encoding in HyperNEAT is realized via special function called Compositional and Pattern Producing Network (CPPN), able to describe a neural network of arbitrary size. CPPNs are represented by network structures, which are evolved by means of a slightly modified version of another, well-known algorithm NEAT (NeuroEvolution of Augmenting Topologies). HyperGP is a variant of HyperNEAT, where the CPPNs are optimized by Genetic Programming (GP). Published results reported promising improvement in the speed of convergence. Our approach further extends HyperGP by using fitness sharing to promote a diversity of a population. Here, we thoroughly compare all three algorithms on six different tasks. Fitness sharing demands a definition of a tree distance measure. Among other five, we propose a generalized distance measure which, in conjunction with HyperGPEFS, significantly outperforms HyperNEAT and HyperGP on all, but one testing problems. Although this paper focuses on indirect encoding, the proposed distance measures are generally applicable.
Meta-learning approach to neural network optimization
- Authors: Kordík, P., Koutník, J., Ing. Jan Drchal, Ph.D., Kovářík, O., Čepek, M., Šnorek, M.
- Publication: Neural Networks. 2010, 2010 (23)(4), 568-582. ISSN 0893-6080.
- Year: 2010
- DOI: 10.1016/j.neunet.2010.02.003
- Link: https://doi.org/10.1016/j.neunet.2010.02.003
- Department: Department of Computer Science
-
Annotation:
Optimization of neural network topology, weights and neuron transfer functions for given data set and problem is not an easy task. In this article, we focus primarily on building optimal feed-forward neural network classifier for i.i.d. data sets. We apply metalearning principles to the neural network structure and function optimization. We show that diversity promotion, ensembling, self-organization and induction are beneficial for the problem. We combine several different neuron types trained by various optimization algorithms to build a supervised feedforward neural network called Group of Adaptive Models Evolution (GAME). The approach was tested on wide number of benchmark data sets. The experiments show that the combination of different optimization algorithms in the network is the best choice when the performance is averaged over several real-world problems.
Combining Multiple Inputs in HyperNEAT Mobile Agent Controller
- Authors: Ing. Jan Drchal, Ph.D., Koutník, J., Šnorek, M.
- Publication: Artificial Neural Networks - ICANN 2009 19th International Conference, Limassol, Cyprus, September 14-17, 2009, Proceedings, Part II. Berlin: Springer, 2009. pp. 775-783. Lecture Notes in Computer Science. ISSN 0302-9743. ISBN 978-3-642-04276-8.
- Year: 2009
- DOI: 10.1007/978-3-642-04277-5
- Link: https://doi.org/10.1007/978-3-642-04277-5
- Department: Department of Computer Science
-
Annotation:
In this paper we present neuro-evolution of neural network controllers for mobile agents in a simulated environment. The controller is obtained through evolution of hypercube encoded weights of recurrent neural networks (HyperNEAT). The simulated agent's goal is to find a target in a shortest time interval. The generated neural network processes three different inputs - surface quality, obstacles and distance to the target. A behavior emerged in agents features ability of driving on roads, obstacle avoidance and provides an efficient way of the target search.
HyperNEAT Controlled Robots Learn How to Drive on Roads in Simulated Environment
- Authors: Ing. Jan Drchal, Ph.D., Koutník, J., Šnorek, M.
- Publication: 2009 IEEE Congress on Evolutionary Computation. Singapore: Research Publishing Services, 2009. ISBN 978-1-4244-2959-2.
- Year: 2009
- DOI: 10.1109/CEC.2009.4983067
- Link: https://doi.org/10.1109/CEC.2009.4983067
- Department: Department of Computer Science
-
Annotation:
In this paper we describe simulation of autonomous robots controlled by recurrent neural networks, which are evolved through indirect encoding using HyperNEAT algorithm. The robots utilize 180 degree wide sensor array. Thanks to the scalability of the neural network generated by HyperNEAT, the sensor array can have various resolution. This would allow to use camera as an input for neural network controller used in real robot. The robots were simulated using software simulation environment. In the experiments the robots were trained to drive with imaximum average speed. Such fitness forces them to learn how to drive on roads and avoid collisions. Evolved neural networks show excellent scalability. Scaling of the sensory input breaks performance of the robots, which should be gained back with re-training of the robot with a different sensory input resolution.
Meta-optimization survey and possible applications of Inductive Approach in this field
- Authors: Kordík, P., Ing. Jan Drchal, Ph.D.,
- Publication: Proceedings of the International Conference ISDMCI 2009. Kyjev: National Academy of Sciences of Ukraine, 2009. pp. 123-132.
- Year: 2009
- Department: Department of Computer Science
-
Annotation:
In this paper we present state of the art in the field of meta-optimization. We are looking for possible applications of Inductive Approach in this field. We propose an inductive based optimization strategy combining several optimization algorithms to solve increasingly complex problems.
Algorithm for Distance-Based Visualization of Data in Computational Intelligence
- Authors: Ing. Jan Drchal, Ph.D., Šnorek, M.
- Publication: Proceedings of Workshop 2008. Praha: Czech Technical University in Prague, 2008. pp. 134-135. ISBN 978-80-01-04016-4.
- Year: 2008
- Department: Department of Computer Science
-
Annotation:
The visualization of multi-dimensional data is a very important task in computational intelligence methods. This article describes a new algorithm which was originally designed to visualize diversity of population in Evolutionary Algorithms (EAs). Later, we realized its abilities for visualization of any general data sets which makes it suitable for data mining purposes.
Grammatical Evolution for Development of Neural Networks with Real-valued Weights Using Cellular Encoding
- Authors: Ing. Jan Drchal, Ph.D., Šnorek, M.
- Publication: European Simulation and Modelling Conference 2008. Ghent: EUROSIS - ETI, 2008. pp. 191-195. ISBN 978-90-77381-44-1.
- Year: 2008
- Department: Department of Computer Science
-
Annotation:
This paper focuses on so-called TWEANNs (Toppology and Weight Evolving Artificial Neural Networks). Here, we concentrate on a use of an indirect developmental encoding which is an approach inspired by multi-cellular organisms' development from a single cell. We examine multiple modifications of a known tree-based indirect developmental encoding: the Cellular Encoding. Grammatical Evolution (GE) is employed instead of Genetic Programming (GP) to optimize program trees. GE is advantageous mainly in the way it can handle constraints (as it evolves program trees which conform to a grammar prespecified using a BNF notation). Moreover, we employ GE's inner mechanisms to efficiently encode Neural Network parameters (weights and biases). In this work, we compare three different link selection schemes. The results of our investigations show that our modifications of Cellular Encoding improve the ability to evolve real-valued Artificial Neural Networks.
Dataset Visualization Based on a Simulation of Intermolecular Forces
- Authors: Ing. Jan Drchal, Ph.D., Kordík, P., Šnorek, M.
- Publication: IWIM 2007 - International Workshop on Inductive Modelling. Praha: Czech Technical University in Prague, 2007. pp. 246-253. ISBN 978-80-01-03881-9.
- Year: 2007
- Department: Department of Computer Science
-
Annotation:
The visualization is an important technique used in many stages of data mining process. This article deals mostly with visualization for preprocessing purposes. The aim of our approach is to visualize distances (Euclidean or others) between data samples. This can be helpful when taking picture of data clustering. In classification tasks it can be used to select outlayer for removal. In this paper we present a novel way of such visualization which is based on a physical system simulation. It is inspired by intermolecular forces and employs overall energy minimization. This minimization is done via known unconstrained optimization numerical methods such as Steepest Descent, Conjugated Gradients or Quasi-Newton. The proposed algorithm was originally designed and was found useful when interpretting diversity in evolutionary algorithms. Here, we show its properties on well-known datasets Iris and Ecoli.
Diversity visualization in evolutionary algorithms
- Authors: Ing. Jan Drchal, Ph.D., Šnorek, M.
- Publication: Proceedings of 41th Spring International Conference MOSIS'07. Ostrava: MARQ, 2007. pp. 77-84. ISBN 978-80-86840-30-7.
- Year: 2007
- Department: Department of Computer Science
-
Annotation:
Evolutionary Algorithms (EAs) are well-known nature-inspired optimization methods. Diversity is an essenial aspect of each EA. It describes the variability of organisms in population. The lack of diversity is common problem - diversity should be preserved in order to evade local extremes (premature convergence). Niching algorithms are modifications of classical EAs. Niching is based on dividing the population into separate subpopulations - it spreads the organisms effectively all over the search space and hence making the overall population diverse. Using niching methods also requires setting of their parameters, which can be very difficult. This paper presents a novel way of diversity visualization based on physical system simulation. This visualization is helpful when designing and tuning niching algorithms but it has also other uses. The visualization will be presented on NEAT - the evolutionary algorithm which optimizes both the topology and the parameters of neural networks.
Diversity Visualization in Niching Evolutionary Algorithms Based on Simulation of Intermolecular Forces
- Authors: Ing. Jan Drchal, Ph.D., Šnorek, M.
- Publication: Počítačové architektury a diagnostika 2007. Plzeň: Západočeská universita, Fakulta aplikovaných věd, 2007. pp. 7-12. ISBN 978-80-7043-605-9.
- Year: 2007
- Department: Department of Computer Science
-
Annotation:
This article deals with visualization of diversity in Evolutionary Algorithms (EAs) Diversity is an essenial aspect of each EA. It describes the variability of individuals in population. The lack of diversity is common problem - diversity should be preserved in order to evade local extremes (premature convergence). Niching algorithms are modifications of classical EAs. Niching is based on dividing the population into separate subpopulations - it spreads the individuals effectively all over the search space and hence making the overall population diverse. Using niching methods also requires setting of their parameters, which can be very difficult. This paper presents a novel way of diversity visualization based on simulation of intermolecular forces. This visualization is helpful when designing and tuning niching algorithms but it can also be used to visualize any data on which a similarity measure is defined.
On Visualization of Diversity in Evolutionary Algorithms Based on Simulation of Physical System
- Authors: Ing. Jan Drchal, Ph.D., Šnorek, M.
- Publication: Proceedings of the 6th EUROSIM Congress on Modelling and Simulation. Vienna: ARGESIM, 2007. ISBN 978-3-901608-32-2.
- Year: 2007
- Department: Department of Computer Science
-
Annotation:
EAs can be seen as algorithms which traverse the search-space in a parallel way. Diversity is an essential aspect of each EA. The lack of diversity is a common problem. Diversity should be preserved in order to evade local extremes (premature convergence). Niching EA is based on dividing the population into separate subpopulations - it spreads the organisms effectively all over the search-space and hence making the overall population diverse. Using niching methods also requires setting of their parameters, which can be very difficult. This paper presents a novel way of diversity visualization based on physical system simulation. It is inspired by intermolecular forces and employs overall energy minimization. This minimization is done via known unconstrained optimization numerical methods. The visualization is helpful when designing and tuning niching algorithms, but it has also other uses.
Visualization of Diversity in Computational Intelligence Methods
- Authors: Ing. Jan Drchal, Ph.D., Kordík, P., Koutník, J.
- Publication: Proceedings of 2nd ISGI 2007 Interational CODATA Symposium on Generalization of Information. Berlin: CODATA - Germany e.V., 2007. pp. 20-34. ISBN 978-3-00-022382-2.
- Year: 2007
- Department: Department of Computer Science
-
Annotation:
Diversity is one of the most important factors that influences quality of results of computational intelligence methods. Study of diversity is not straightforward, since the diversity usually arises from a multidimensional environment. In this paper we present several methods how to visualize diversity in computational intelligence, namely evolutionary algorithms and artificial neural networks. In evolutionary algorithms, there is a need to visualize diversity of individuals in populations. In neural networks a diversity of neurons is being studied. We designed and used several techniques for visualization of diversities.
Maintaining Diversity in Population of Evolved Models
- Authors: Ing. Jan Drchal, Ph.D., Šnorek, M., Kordík, P.
- Publication: Proceedings of 40th Spring International Conference MOSIS 06, Modelling and Simulation of Systems. Ostrava: MARQ, 2006. pp. 113-120. ISBN 80-86840-21-2.
- Year: 2006
- Department: Department of Computer Science
-
Annotation:
This paper deals with creation of models by means of evolutionary algorithms, particularly with maintaining diversity of population using niching methods. Niching algorithms are known for their ability to search for more optima simultaneously. This is done by splitting the population of models into separate species. Species protect promising but yet not fully developed models. Search for more optima at the same time helps to avoid a premature convergence and therefore deals effectively with local optima. Efficiency of two different niching methods is compared on NEAT applied to the neuro-evolution of models.