doc. RNDr. Daniel Průša, Ph.D.

SCOD: From Heuristics to Theory

Autoři: Ing. Vojtěch Franc, Ph.D., Ing. Jakub Paplhám, doc. RNDr. Daniel Průša, Ph.D.,
Publikace: Computer Vision – ECCV 2024, Part LXXXIV. Springer, Cham, 2025. p. 424-441. LNCS. vol. 15142. ISSN 0302-9743. ISBN 978-3-031-72906-5.
Rok: 2025

DOI: 10.1007/978-3-031-72907-2_25
Odkaz: https://doi.org/10.1007/978-3-031-72907-2_25
Pracoviště: Skupina vizuálního rozpoznávání, Strojové učení
Anotace:
This paper addresses the problem of designing reliable prediction models that abstain from predictions when faced with uncertain or out-of-distribution samples - a recently proposed problem known as Selective Classification in the presence of Out-of-Distribution data (SCOD). We make three key contributions to SCOD. Firstly, we demonstrate that the optimal SCOD strategy involves a Bayes classifier for in-distribution (ID) data and a selector represented as a stochastic linear classifier in a 2D space, using i) the conditional risk of the ID classifier, and ii) the likelihood ratio of ID and out-of-distribution (OOD) data as input. This contrasts with suboptimal strategies from current OOD detection methods and the Softmax Information Retaining Combination (SIRC), specifically developed for SCOD. Secondly, we establish that in a distribution-free setting, the SCOD problem is not Probably Approximately Correct learnable when relying solely on an ID data sample. Third, we introduce POSCOD, a simple method for learning a plugin estimate of the optimal SCOD strategy from both an ID data sample and an unlabeled mixture of ID and OOD data. Our empirical results confirm the theoretical findings and demonstrate that our proposed method, POSCOD, outperforms existing OOD methods in effectively addressing the SCOD problem.

Consistent and Tractable Algorithm for Markov Network Learning

Autoři: Ing. Vojtěch Franc, Ph.D., doc. RNDr. Daniel Průša, Ph.D., Ing. Andrii Yermakov,
Publikace: Machine Learning and Knowledge Discovery in Databases, Part IV. Cham: Springer, 2023. p. 435-451. Lecture Notes in Computer Science. vol. 13716. ISSN 0302-9743. ISBN 978-3-031-26411-5.
Rok: 2023

DOI: 10.1007/978-3-031-26412-2_27
Odkaz: https://doi.org/10.1007/978-3-031-26412-2_27
Pracoviště: Strojové učení
Anotace:
Markov network (MN) structured output classifiers provide a transparent and powerful way to model dependencies between output labels. The MN classifiers can be learned using the M3N algorithm, which, however, is not statistically consistent and requires expensive fully annotated examples. We propose an algorithm to learn MN classifiers that is based on Fisher-consistent adversarial loss minimization. Learning is transformed into a tractable convex optimization that is amenable to standard gradient methods. We also extend the algorithm to learn from examples with missing labels. We show that the extended algorithm remains convex, tractable, and statistically consistent.

One-Side Pumping and Two-Side Pumping by Complete CF(¢, $)-grammars

Autoři: Mráz, F., Plátek, M., Pardubská, D., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: Proceedings of the 23rd Conference Information Technologies – Applications and Theory (ITAT 2023). Aachen: CEUR Workshop Proceedings, 2023. p. 110-120. ISSN 1613-0073.
Rok: 2023

Pracoviště: Strojové učení
Anotace:
We introduce context-free grammar with sentinels, CF(¢, $)-grammar, as a generalization of recently introduced LR(¢, $)grammar. Original LR(¢, $)-grammars can be used to construct deterministic pumping restarting automata performing correctness and error preserving pumping analysis by reduction on each word over its input alphabet. Pumping analysis by reduction involves step-wise simplification of an input word by removing at most two continuous parts of the current word while preserving the correctness or incorrectness of the word. Each such simplification step corresponds to removing portions of the current word that can be “pumped” according to the pumping lemma for context-free languages, and thus, it does not use any nonterminals. One-side pumping grammars are CF(¢, $)-grammars that allow removing just one continuous part in each step of pumping reduction. A complete CF(¢, $)-grammar generates both a language and its complement with sentinels. We show that complete one-side pumping CF(¢, $)-grammars characterize the class of regular languages, while LR(¢, $)-grammars that allow two-side pumping reductions characterize the class of deterministic context-free languages. LR(¢, $)-grammars that do not allow any one-side pumping reduction generate non-regular languages only.

Optimal Strategies for Reject Option Classifiers

Autoři: Ing. Vojtěch Franc, Ph.D., doc. RNDr. Daniel Průša, Ph.D., Voráček, V.
Publikace: Journal of Machine Learning Research. 2023, 24 ISSN 1532-4435.
Rok: 2023

Pracoviště: Strojové učení
Anotace:
In classification with a reject option, the classifier is allowed in uncertain cases to abstain from prediction. The classical cost-based model of a reject option classifier requires the rejection cost to be defined explicitly. The alternative bounded-improvement model and the bounded-abstention model avoid the notion of the reject cost. The bounded-improvement model seeks a classifier with a guaranteed selective risk and maximal cover. The bounded-abstention model seeks a classifier with guaranteed cover and minimal selective risk. We prove that despite their different formulations the three rejection models lead to the same prediction strategy: the Bayes classifier endowed with a randomized Bayes selection function. We define the notion of a proper uncertainty score as a scalar summary of the prediction uncertainty sufficient to construct the randomized Bayes selection function. We propose two algorithms to learn the proper uncertainty score from examples for an arbitrary black-box classifier. We prove that both algorithms provide Fisher consistent estimates of the proper uncertainty score and demonstrate their efficiency in different prediction problems, including classification, ordinal regression, and structured output classification.

Two-Dimensional Rank-Reducing Grammars and Their Complexity

Autoři: doc. RNDr. Daniel Průša, Ph.D.,
Publikace: Journal of Automata, Languages and Combinatorics. 2023, 28(1-3), 143-166. ISSN 2567-3785.
Rok: 2023

DOI: 10.25596/jalc-2023-143
Odkaz: https://doi.org/10.25596/jalc-2023-143
Pracoviště: Strojové učení
Anotace:
We study properties of a two-dimensional grammar introduced recently for use in document analysis and recognition. The grammar is obtained from the two-dimensional context-free grammar by limiting the form of productions. Variants (ranks) of the grammar with regard to productions complexity are defined. If the grammar is restricted to produce one-row pictures only, it generates regular languages. We suggest that the lowest-rank variant can be considered as a natural generalization of the regular matrix grammar, which in addition shares some of its good properties. Namely, it can be parsed in time linear in the input area and the emptiness problem is still decidable for the grammar. However, we also show that the higher-rank variants do not loosen complexity of the context-free grammar too much. There is a conditional lower bound preventing to propose a linear-time parsing algorithm. Moreover, the grammar is able to simulate the 2-counter Minsky machine, which results in non-recursive trade-offs between grammars of different ranks and also in undecidability of the emptiness problem.

Weight-reducing Turing machines

Autoři: Guillon, B., Pighizzini, G., Prigioniero, L., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: Information and Computation. 2023, 292 ISSN 0890-5401.
Rok: 2023

DOI: 10.1016/j.ic.2023.105030
Odkaz: https://doi.org/10.1016/j.ic.2023.105030
Pracoviště: Strojové učení
Anotace:
It is well known that one-tape Turing machines running in linear time are no more powerful than finite automata; namely they recognize exactly the class of regular languages. We prove that it is not decidable if a one-tape machine runs in linear time, even if it is deterministic and restricted to use only the portion of the tape that initially contains the input. This motivates the introduction of a constructive variant of one-tape machines, called a weight-reducing machine, and the investigation of its properties. We focus on the deterministic case. In particular, we show that, paying a polynomial size increase only, each weight-reducing machine can be turned into a halting one that runs in linear time. Furthermore each weight-reducing machine can be converted into equivalent nondeterministic and deterministic finite automata by paying an exponential and doubly-exponential increase in size, respectively. These costs cannot be reduced in the worst case.

Converting nondeterministic two-way automata into small deterministic linear-time machines

Autoři: Guillon, B., Pighizzini, G., Prigioniero, L., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: Information and Computation. 2022, 289 ISSN 0890-5401.
Rok: 2022

DOI: 10.1016/j.ic.2022.104938
Odkaz: https://doi.org/10.1016/j.ic.2022.104938
Pracoviště: Strojové učení
Anotace:
In 1978 Sakoda and Sipser raised the question of the cost, in terms of size of representations, of the transformation of two-way and one-way nondeterministic automata into equivalent two-way deterministic automata. Despite all the attempts, the question has been answered only for particular cases, while it remains open in general, the best upper bound currently known being exponential. We present a new approach in which unrestricted nondeterministic automata are simulated by deterministic models extending two-way deterministic automata, paying only a polynomial increase of size. Indeed, we study the costs of the conversions of nondeterministic automata into some variants of one-tape deterministic Turing machines working in linear time; namely Hennie machines, weight-reducing Turing machines, and weight-reducing Hennie machines. All these variants are known to share the same computational power: they characterize the class of regular languages.

On Pumping RP-automata Controlled by Complete LR(¢, $)-grammars

Autoři: Plátek, M., Mráz, F., Pardubská, D., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: Proceedings of the 22nd Conference Information Technologies – Applications and Theory (ITAT 2022). CEUR-WS.org, 2022. p. 111-121. CEUR Workshop Proceedings. vol. 3226. ISSN 1613-0073.
Rok: 2022

Pracoviště: Strojové učení
Anotace:
We introduce complete LR(0)-grammars with sentinels (called complete LR(¢,$)-grammars) to prepare tools for the study of pumping restarting automata controlled by this type of grammars. A complete LR(¢,$)-grammar generates both a language and the complement of the language with sentinels. Based on a complete LR(¢,$)- grammar, we can construct a deterministic pumping restarting automaton performing pumping analysis by reduction on each word over its input alphabet. A pumping reduction analysis is a method where an input word is stepwise simplified by removing at most two continuous parts of the current word in a way that preserves (in)correctness of the word. Each such simplification corresponds to removing parts of the current word that could be “pumped” in the sense of a pumping lemma for context-free languages. The computation of a pumping restarting automaton ends when the original word is shortened under a given length, and then it is decided about the correctness or incorrectness of the original input word. This means that pumping restarting automata can analyze both correct and incorrect inputs with respect to a deterministic context-free language (DCFL). That gives an excellent formal basis for the error localization and the error recovery of DCFL.

On separations of LR(0)-grammars by two types of pumping patterns

Autoři: Plátek, M., Mráz, F., Pardubská, D., doc. RNDr. Daniel Průša, Ph.D., Šíma, J.
Publikace: Proceedings of the 21st Conference Information Technologies – Applications and Theory (ITAT 2021). Aachen: CEUR Workshop Proceedings, 2021. p. 140-146. vol. 2962. ISSN 1613-0073.
Rok: 2021

Pracoviště: Strojové učení
Anotace:
We present two types of pumping patterns that allow a total separation inside the class of LR(0)-grammars. Using the same type of pumping patterns, we obtain a total separation inside of linear LR(0)-grammars. This type of study has a long-term motivation from computational linguistics and the area of syntactic error localization. A recent motivation also comes from the field of formal models of neural networks.

Two-dimensional pattern matching against local and regular-like picture languages

Autoři: Mráz, F., doc. RNDr. Daniel Průša, Ph.D., Wehar, M.
Publikace: Theoretical Computer Science. 2021, 870 137-152. ISSN 0304-3975.
Rok: 2021

DOI: 10.1016/j.tcs.2020.12.026
Odkaz: https://doi.org/10.1016/j.tcs.2020.12.026
Pracoviště: Strojové učení
Anotace:
Given a two-dimensional array of symbols and a picture language over a finite alphabet, we investigate how to find rectangular subarrays that belong to the picture language. Two-dimensional pattern matching problems can be formulated by interpreting subarrays as matches and picture languages as patterns. We formulate four particular problems – finding maximum, minimum, any or all match(es) – and describe algorithms solving the problems for basic classes of picture languages, which include local picture languages and picture languages accepted by various types of deterministic two-dimensional finite automata such as four-way, three-way and two-way automata, on-line tessellation automata, and returning finite automata. We also prove that the pattern matching problems cannot be solved for the class of local picture languages in time linear in the input area unless the well known problem of triangle finding in a graph is solvable in quadratic time. This shows a fundamental difference between the complexity of one-dimensional and two-dimensional pattern matching.

Complexity of Searching for 2 by 2 Submatrices in Boolean Matrices

Autoři: doc. RNDr. Daniel Průša, Ph.D., Wehar, M.
Publikace: International Conference on Developments in Language Theory. Springer, Cham, 2020. p. 266-279. Lecture Notes in Computer Science. vol. 12086. ISSN 0302-9743. ISBN 978-3-030-48515-3.
Rok: 2020

DOI: 10.1007/978-3-030-48516-0_20
Odkaz: https://doi.org/10.1007/978-3-030-48516-0_20
Pracoviště: Strojové učení
Anotace:
We study the problem of finding a given 2x2 matrix as a submatrix of a given Boolean matrix. Three variants are considered: search for a matching submatrix of any area, of minimum area, or of maximum area. The problem relates to 2D pattern matching, and to fields such as data mining, where the search for submatrices plays an important role. Besides these connections, the problem itself is very natural and its investigation helps to demonstrate differences between search tasks in one-dimensional and multidimensional topologies.

Complexity of Two-Dimensional Rank-Reducing Grammars

Autoři: doc. RNDr. Daniel Průša, Ph.D.,
Publikace: 22nd International Conference, DCFS 2020, Vienna, Austria, August 24–26, 2020, Proceedings. Springer, Cham, 2020. p. 155-166. Lecture Notes in Computer Science. vol. 12442. ISSN 0302-9743. ISBN 978-3-030-62535-1.
Rok: 2020

DOI: 10.1007/978-3-030-62536-8_13
Odkaz: https://doi.org/10.1007/978-3-030-62536-8_13
Pracoviště: Strojové učení
Anotace:
We study properties of a two-dimensional grammar introduced recently for use in document analysis and recognition. The grammar is obtained from the two-dimensional context-free grammar by restricting the form of productions. Variants (ranks) of the grammar with regard to productions complexity are defined. It is suggested that the lowest-rank variant can be considered as a natural generalization of the regular matrix grammar, which in addition has good properties with respect to the membership and emptiness problems. However, it is also showed that the higher-rank variants do not loosen complexity of the context-free grammar too much. There is a conditional lower bound preventing to propose a linear-time parsing algorithm. Moreover, the grammar is able to simulate the 2-counter Minsky machine, which results in non-recursive trade-offs and undecidability of the emptiness problem.

Relative Interior Rule in Block-Coordinate Descent

Autoři: doc. Ing. Tomáš Werner, Ph.D., doc. RNDr. Daniel Průša, Ph.D., Dlask, T.
Publikace: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). USA: IEEE Computer Society, 2020. p. 7556-7564. ISSN 1063-6919. ISBN 978-1-7281-7169-2.
Rok: 2020

DOI: 10.1109/CVPR42600.2020.00758
Odkaz: https://doi.org/10.1109/CVPR42600.2020.00758
Pracoviště: Strojové učení
Anotace:
It is well-known that for general convex optimization problems, block-coordinate descent can get stuck in poor local optima. Despite that, versions of this method known as convergent message passing are very successful to approximately solve the dual LP relaxation of the MAP inference problem in graphical models. In attempt to identify the reason why these methods often achieve good local minima, we argue that if in block-coordinate descent the set of minimizers over a variable block has multiple elements, one should choose an element from the relative interior of this set. We show that this rule is not worse than any other rule for choosing block-minimizers. Based on this observation, we develop a theoretical framework for block-coordinate descent applied to general convex problems. We illustrate this theory on convergent message-passing methods.

Solvers for Mathematical Word Problems in Czech

Autoři: Kadlec, J., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: Proceedings of the 20th Conference Information Technologies - Applications and Theory (ITAT 2020). Aachen: CEUR Workshop Proceedings, 2020. p. 18-25. ISSN 1613-0073.
Rok: 2020

Pracoviště: Strojové učení
Anotace:
We study the task of an automatic evaluation of mathematical word problems, which belongs to the category of natural language processing and has become popular in recent years. Since all the so far published methods were developed for inputs in English, our goal is to review them and propose solutions able to cope with inputs in the Czech language. We face the question whether we can achieve a competitive accuracy for a natural language with flexible word order, and with the assumption that only a relatively small dataset of training and testing data is available. We propose and evaluate two methods. One relies on a rule-based processing of dependency trees computed by UDPipe. The other method builds on machine learning. It transforms word problems into numeric vectors and trains SVM to classify them. We also show that it improves in a combination with a search for predefined sequences of words and word classes, achieving 75% accuracy on our dataset of 500 Czech word problems.

A Simple Extension to Finite Tree Automata for Defining Sets of Labeled, Connected Graphs

Autoři: Fujiyoshi, A., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: Implementation and Application of Automata. Berlin, Heidelberg: Springer, 2019. p. 121-132. Lecture Notes in Computer Science. vol. 11601. ISSN 0302-9743. ISBN 978-3-030-23678-6.
Rok: 2019

DOI: 10.1007/978-3-030-23679-3_10
Odkaz: https://doi.org/10.1007/978-3-030-23679-3_10
Pracoviště: Strojové učení
Anotace:
This paper introduces spanning tree automata (ST automata) usable for defining sets of labeled, connected graphs. The automata are simply obtained by extending ordinary top-down finite tree automata for labeled, ordered trees. It is shown that ST automata can define any finite set of labeled, connected graphs, and also some subclasses of infinite sets of graphs that can represent the structure of chemical molecules. Although the membership problem for ST automata is NP-complete, an efficient software was developed which supports a practical use of ST automata in chemoinformatics as well as in other fields.

On Discriminative Learning of Prediction Uncertainty

Autoři: Ing. Vojtěch Franc, Ph.D., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: International Conference on Machine Learning. Proceedings of Machine Learning Research, 2019. p. 3465-3480. ISSN 2640-3498. ISBN 978-1-5108-8698-8.
Rok: 2019

Pracoviště: Strojové učení
Anotace:
In classification with a reject option, the classifier is allowed in uncertain cases to abstain from prediction. The classical cost based model of an optimal classifier with a reject option requires the cost of rejection to be defined explicitly. An alternative bounded-improvement model, avoiding the notion of the reject cost, seeks for a classifier with a guaranteed selective risk and maximal cover. We prove that both models share the same class of optimal strategies, and we provide an explicit relation between the reject cost and the target risk being the parameters of the two models. An optimal rejection strategy for both models is based on thresholding the conditional risk defined by posterior probabilities which are usually unavailable. We propose a discriminative algorithm learning an uncertainty function which preserves ordering of the input space induced by the conditional risk, and hence can be used to construct optimal rejection strategies.

Solving LP Relaxations of Some NP-Hard Problems Is As Hard As Solving Any Linear Program

Autoři: doc. RNDr. Daniel Průša, Ph.D., doc. Ing. Tomáš Werner, Ph.D.,
Publikace: SIAM JOURNAL ON OPTIMIZATION. 2019, 29(3), 1745-1771. ISSN 1052-6234.
Rok: 2019

DOI: 10.1137/17M1142922
Odkaz: https://doi.org/10.1137/17M1142922
Pracoviště: Strojové učení
Anotace:
We show that the general linear programming (LP) problem reduces in nearly linear time to the LP relaxations of many classical NP-hard combinatorial problems, assuming sparse encoding of instances. We distinguish two types of such reductions. In the first type (shown for set cover/packing, facility location, maximum satisfiability, maximum independent set, and multiway cut), the input linear program is feasible and bounded iff the optimum value of the LP relaxation attains a threshold, and then optimal solutions to the input linear program correspond to optimal solutions to the LP relaxation. In the second type (shown for exact set cover, three-dimensional matching, and constraint satisfaction), feasible solutions to the input linear program correspond to feasible solutions to the LP relaxations. Thus, the reduction preserves objective values of all (not only optimal) solutions. In polyhedral terms, every polytope in standard form is a scaled coordinate projection of the optimal or feasible set of the LP relaxation. Besides nearly linear-time reductions, we show that the considered LP relaxations are P-complete under log-space reductions, and therefore also hard to parallelize. These results pose a limitation on designing algorithms to compute exact or even approximate solutions to the LP relaxations, as any lower bound on the complexity of solving the general LP problem is inherited by the LP relaxations.

Two-Dimensional Pattern Matching Against Basic Picture Languages

Autoři: Mráz, F., doc. RNDr. Daniel Průša, Ph.D., Wehar, M.
Publikace: Implementation and Application of Automata. Berlin, Heidelberg: Springer, 2019. p. 209-221. Lecture Notes in Computer Science. vol. 11601. ISSN 0302-9743. ISBN 978-3-030-23678-6.
Rok: 2019

DOI: 10.1007/978-3-030-23679-3_17
Odkaz: https://doi.org/10.1007/978-3-030-23679-3_17
Pracoviště: Strojové učení
Anotace:
Given a two-dimensional array of symbols and a picture language over a finite alphabet, we study the problem of finding rectangular subarrays of the array that belong to the picture language. We formulate four particular problems – finding maximum, minimum, any or all match(es) – and describe algorithms solving them for basic classes of picture languages, including local picture languages and picture languages accepted by deterministic on-line tessellation automata or deterministic four-way finite automata. We also prove that the matching problems cannot be solved for the class of local picture languages in linear time unless the problem of triangle finding is solvable in quadratic time. This shows there is a fundamental difference in the pattern matching complexity regarding the one-dimensional and two-dimensional setting.

Dynamics of the Independence Number and Automata Synchronization

Autoři: Gusev, V.V., Jungers, R.M., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: International Conference on Developments in Language Theory. Cham: Springer International Publishing, 2018. p. 379-391. Lecture Notes in Computer Science. vol. 11088. ISSN 0302-9743. ISBN 978-3-319-98653-1.
Rok: 2018

DOI: 10.1007/978-3-319-98654-8_31
Odkaz: https://doi.org/10.1007/978-3-319-98654-8_31
Pracoviště: Strojové učení
Anotace:
We study the lengths of synchronizing words produced by the classical greedy compression algorithm. We associate a sequence of graphs with every synchronizing automaton and rely on evolution of the independence number to bound the lengths of produced words. By leveraging graph theoretical results we show that in many cases automata with good extension properties have good compression properties as well. More precisely, we show that the compression algorithm will produce a synchronizing word of length O(n^2log(n)) on cyclic, regular and strongly-transitive automata with n states, which is not far from the best possible bound of (n−1)^2. Furthermore, we provide a relatively simple proof that every n-state automaton has a synchronizing word of length at most (1/4)n^3+O(n^2) .

Rank-Reducing Two-Dimensional Grammars for Document Layout Analysis

Autoři: doc. RNDr. Daniel Průša, Ph.D., Fujiyoshi, A.
Publikace: 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). Los Alamitos: IEEE Computer Society, 2018. p. 1120-1125. ISSN 1520-5363. ISBN 978-1-5386-3586-5.
Rok: 2018

DOI: 10.1109/ICDAR.2017.185
Odkaz: https://doi.org/10.1109/ICDAR.2017.185
Pracoviště: Strojové učení
Anotace:
We study the task of document layout analysis based on two-dimensional context-free grammars. We first identify a subclass of the grammars sufficient for a document structure description where productions follow a mechanism inducing regular languages in the case of one-dimensional productions. We then show that properties of such grammars can be conveniently utilized to implement a very fast top-down parser. Experimental results are reported for PDF documents, which are chosen as a test domain since we are motivated by a development of digital document access methods for people with disabilities in which a retrieval of structural information plays an important role.

Two-Way Automata and One-Tape Machines: Read Only Versus Linear Time

Autoři: Guillon, B., Pighizzini, G., Prigioniero, L., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: International Conference on Developments in Language Theory. Cham: Springer International Publishing, 2018. p. 366-378. Lecture Notes in Computer Science. vol. 11088. ISSN 0302-9743. ISBN 978-3-319-98653-1.
Rok: 2018

DOI: 10.1007/978-3-319-98654-8_30
Odkaz: https://doi.org/10.1007/978-3-319-98654-8_30
Pracoviště: Strojové učení
Anotace:
It is well-known that one-tape Turing machines working in linear time are no more powerful than finite automata, namely they recognize exactly the class of regular languages. We study the costs, in terms of description sizes, of the conversion of nondeterministic finite automata into equivalent linear-time one-tape deterministic machines. We prove a polynomial blowup from two-way nondeterministic finite automata into equivalent weight-reducing one-tape deterministic machines that work in linear time. The blowup remains polynomial if the tape in the resulting machines is restricted to the portion which initially contains the input. However, in this case the machines resulting from our construction are not weight reducing, unless the input alphabet is unary.

Complexity of Matching Sets of Two-Dimensional Patterns by Two-Dimensional On-Line Tessellation Automaton

Autoři: doc. RNDr. Daniel Průša, Ph.D.,
Publikace: International Journal of Foundations of Computer Science. 2017, 28(5), 623-640. ISSN 0129-0541.
Rok: 2017

DOI: 10.1142/S012905411740010X
Odkaz: https://doi.org/10.1142/S012905411740010X
Pracoviště: Strojové učení
Anotace:
We study the two-dimensional pattern matching implemented using the deterministic two-dimensional on-line tessellation automaton. This restricted two-dimensional cellular automaton is able to simulate the Baker-Bird algorithm, which was proposed as the first algorithm for the two-dimensional pattern matching. We explore capabilities of this automaton to carry out the matching task against an arbitrary set of equal-sized patterns. To measure amount of resources needed to accomplish it, we introduce the pattern complexity of a picture language. We show that this complexity ranges from a constant to exponential one. All of these are illustrated by giving examples of two-dimensional on-line tessellation automata matching sets of patterns, describing general techniques of how to construct them and proving lower bounds on the pattern complexity of some picture languages as well as operations over them.

LP Relaxation of the Potts Labeling Problem Is as Hard as Any Linear Program

Autoři: doc. RNDr. Daniel Průša, Ph.D., doc. Ing. Tomáš Werner, Ph.D.,
Publikace: IEEE Transactions on Pattern Analysis and Machine Intelligence. 2017, 39(7), 1469-1475. ISSN 0162-8828.
Rok: 2017

DOI: 10.1109/TPAMI.2016.2582165
Odkaz: https://doi.org/10.1109/TPAMI.2016.2582165
Pracoviště: Strojové učení
Anotace:
In our recent work, we showed that solving the LP relaxation of the pairwise min-sum labeling problem (also known as MAP inference in graphical models or discrete energy minimization) is not much easier than solving any linear program. Precisely, the general linear program reduces in linear time (assuming the Turing model of computation) to the LP relaxation of the min-sum labeling problem. The reduction is possible, though in quadratic time, even to the min-sum labeling problem with planar structure. Here we prove similar results for the pairwise min-sum labeling problem with attractive Potts interactions (also known as the uniform metric labeling problem).

LP Relaxations of Some NP-Hard Problems Are as Hard as Any LP

Autoři: doc. RNDr. Daniel Průša, Ph.D., doc. Ing. Tomáš Werner, Ph.D.,
Publikace: Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms. Philadelphia, PA: Society for Industrial and Applied Mathematics, 2017. p. 1372-1382. ISBN 978-1-61197-478-2.
Rok: 2017

DOI: 10.1137/1.9781611974782.89
Odkaz: https://doi.org/10.1137/1.9781611974782.89
Pracoviště: Strojové učení
Anotace:
We show that solving linear programming (LP) relaxations of many classical NP-hard combinatorial optimization problems is as hard as solving the general LP problem. Precisely, the general LP can be reduced in linear time to the LP relaxation of each of these problems. This result poses a fundamental limitation for designing efficient algorithms to solve the LP relaxations, because finding such an algorithm might improve the complexity of best known algorithms for the general LP. Besides linear-time reductions, we show that the LP relaxations of the considered problems are P-complete under log-space reduction, therefore also hard to parallelize.

Recognizing Off-Line Flowcharts by Reconstructing Strokes and Using On-Line Recognition Techniques

Autoři: Bresler, M., doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V.
Publikace: Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition. Los Alamitos: IEEE Computer Society, 2017. p. 48-53. ISSN 2167-6445. ISBN 978-1-5090-0981-7.
Rok: 2017

DOI: 10.1109/ICFHR.2016.0022
Odkaz: https://doi.org/10.1109/ICFHR.2016.0022
Pracoviště: Strojové učení
Anotace:
We experiment with off-line recognition of handwritten flowcharts based on strokes reconstruction and our state-of-the-art on-line diagram recognizer. A simple baseline algorithm for strokes reconstruction is presented and necessary modifications of the original recognizer are identified. We achieve very promising results on a flowcharts database created as an extension of our previously published on-line database.

Template-Based Pattern Matching in Two-Dimensional Arrays

Autoři: Han, Y. S., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: Combinatorial Image Analysis. Cham: Springer, 2017. p. 79-92. Lecture Notes in Computer Science. vol. 10256. ISSN 0302-9743. ISBN 978-3-319-59107-0.
Rok: 2017

DOI: 10.1007/978-3-319-59108-7_7
Odkaz: https://doi.org/10.1007/978-3-319-59108-7_7
Pracoviště: Strojové učení
Anotace:
We propose a framework for pattern matching in twodimensional arrays of symbols where the patterns are described by an extended version of the regular matrix grammar and the size of desired matches is prescribed. We then demonstrate how to reformulate the 2D pattern matching as the one-dimensional pattern matching (string pattern matching), and study the efficiency of the string pattern matching algorithm based on pattern complexity with respect to two finite automaton models: (1) the classical finite automaton and (2) the finite automaton equipped with two scanning heads placed in a fixed distance. We also identify several subclasses of the considered templates for which the framework yields a more efficient matching than the naive algorithm.

Undecidability of the emptiness problem for context-free picture languages

Autoři: doc. RNDr. Daniel Průša, Ph.D., Reinhardt, K.
Publikace: Theoretical Computer Science. 2017, 679 118-125. ISSN 0304-3975.
Rok: 2017

DOI: 10.1016/j.tcs.2016.03.025
Odkaz: https://doi.org/10.1016/j.tcs.2016.03.025
Pracoviště: Strojové učení
Anotace:
A two-dimensional Kolam grammar as defined by Siromoney et al. in 1972 and independently by Matz in 1997 and Schlesinger in 1989 allows context-free productions of the form A→a, A→BC, A→B/C, and S→λ which concatenate sub-pictures produced by B and C horizontally respectively vertically if their height respectively width fits. We demonstrate that this grammar is quite powerful by proving undecidability of the emptiness problem. We further analyze consequences of this finding and give additional characteristics related to size of generated pictures.

Complexity of sets of two-dimensional patterns

Autoři: doc. RNDr. Daniel Průša, Ph.D.,
Publikace: CIAA 2016: Proceedings of the 21st International Conference on Implementation and Application of Automata. Heidelberg: Springer-Verlag, GmbH, 2016. pp. 236-247. Lecture Notes in Computer Science. ISSN 0302-9743. ISBN 978-3-319-40945-0.
Rok: 2016

DOI: 10.1007/978-3-319-40946-7_20
Odkaz: https://doi.org/10.1007/978-3-319-40946-7_20
Pracoviště: Strojové učení
Anotace:
We study the two-dimensional pattern matching implemented using the two-dimensional on-line tessellation automaton, which is a restricted type of the cellular automaton able to simulate the Baker-Bird algorithm, proposed as the first algorithm for the two-dimensional pattern matching. We further explore capabilities of this automaton to carry out the matching task against an arbitrary set of equally sized patterns. To measure amount of resources needed to accomplish it, we introduce the pattern complexity of a picture language. We show that this complexity spreads in a wide range. It is demonstrated by giving examples, deriving general techniques and proving some lower bounds.

Non-recursive trade-offs between two-dimensional automata and grammars

Autoři: doc. RNDr. Daniel Průša, Ph.D.,
Publikace: Theoretical Computer Science. 2016, 610 121-132. ISSN 0304-3975.
Rok: 2016

DOI: 10.1016/j.tcs.2015.05.033
Odkaz: https://doi.org/10.1016/j.tcs.2015.05.033
Pracoviště: Strojové učení
Anotace:
We study succinctness of descriptional systems for picture languages. Basic models of two-dimensional finite automata and generalizations of context-free grammars are considered. They include the four-way automaton of Blum and Hewitt, the two-dimensional online tessellation automaton of Inoue and Nakamura and the context-free Kolam grammar of Siromoney et al. We show that non-recursive trade-offs between the systems are very common. Basically, each separation result proving that one system describes a picture language which cannot be described by another system can usually be turned into a non-recursive trade-off result between the systems. These findings are strongly based on the ability of the systems to simulate Turing machines.

Online recognition of sketched arrow-connected diagrams

Autoři: Bresler, M., doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V.
Publikace: International Journal on Document Analysis and Recognition. 2016, 19(3), 253-267. ISSN 1433-2833.
Rok: 2016

DOI: 10.1007/s10032-016-0269-z
Odkaz: https://doi.org/10.1007/s10032-016-0269-z
Pracoviště: Strojové učení
Anotace:
We introduce a new, online, stroke-based recognition system for hand-drawn diagrams which belong to a group of documents with an explicit structure obvious to humans but only loosely defined from the machine point of view. We propose a model for recognition by selection of symbol candidates, based on evaluation of relations between candidates using a set of predicates. It is suitable for simpler structures where the relations are explicitly given by symbols, arrows in the case of diagrams. Knowledge of a specific diagram domain is used—the two domains are flowcharts and finite automata. Although the individual pipeline steps are tailored for these, the system can readily be adapted for other domains. Our entire diagram recognition pipeline is outlined. Its core parts are text/non-text separation, symbol segmentation, their classification and structural analysis. Individual parts have been published by the authors previously and so are described briefly and referenced. Thorough evaluation on benchmark databases shows the accuracy of the system reaches the state of the art and is ready for practical use. The paper brings several contributions: (a) the entire system and its state-of-the-art performance; (b) the methodology exploring document structure when it is loosely defined; (c) the thorough experimental evaluation; (d) the new annotated database for online sketched flowcharts and finite automata diagrams.

Some Classes of Rational Functions for Pictures

Autoři: Mraz, F., Otto, F., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: RAIRO - Theoretical Informatics and Applications. 2016, 50(4), 351-369. ISSN 0988-3754.
Rok: 2016

DOI: 10.1051/ita/2016025
Odkaz: https://doi.org/10.1051/ita/2016025
Pracoviště: Strojové učení
Anotace:
With the aid of homogeneous morphisms, we turn the deterministic two-dimensional twoway ordered restarting automaton and its extended variant into devices that compute transductions of pictures, and we study the resulting classes of transductions in detail.

(Un)decidability of the Emptiness Problem for Multi-dimensional Context-free Grammars

Autoři: doc. RNDr. Daniel Průša, Ph.D.,
Publikace: CIAA 2015: Proceedings of the 20th International Conference on Implementation and Application of Automata. Cham: Springer International Publishing AG, 2015, pp. 250-262. Lecture Notes in Computer Science. ISSN 0302-9743. ISBN 978-3-319-22359-9. Available from: http://dx.doi.org/10.1007/978-3-319-22360-5_21
Rok: 2015

DOI: 10.1007/978-3-319-22360-5_21
Odkaz: https://doi.org/10.1007/978-3-319-22360-5_21
Pracoviště: Katedra kybernetiky
Anotace:
We study how dimensionality and form of context-free productions affect the power of multi-dimensional context-free grammars over unary alphabets. Attention is paid to the emptiness decision problem. It is an open question whether or not it is decidable for two-dimensional Kolam type context-free grammars of Siromoney. We show that the undecidability can be proved in the three-dimensional setting. For the two-dimensional variant, we present several results revealing that the process of generating is still much more complex than that one of the classical one-dimensional context-free grammar.

Detection of Arrows in On-line Sketched Diagrams using Relative Stroke Positioning

Autoři: Bresler, M., doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V.
Publikace: WACV 2015: IEEE Winter Conference on Applications of Computer Vision. Los Alamitos: IEEE Computer Society Press, 2015. p. 610-617. ISBN 978-1-4799-6682-0.
Rok: 2015

DOI: 10.1109/WACV.2015.87
Odkaz: https://doi.org/10.1109/WACV.2015.87
Pracoviště: Katedra kybernetiky
Anotace:
This paper deals with recognition of arrows in online sketched diagrams. Arrows have varying appearance and thus it is a difficult task to recognize them directly. It is beneficial to detect arrows after other symbols (easier to detect) are already found. We proposed [4] an arrow detector which searches for arrows as arbitrarily shaped connectors between already found symbols. The detection is done two steps: a) a search for a shaft of the arrow, b) a search for its head. The first step is relatively easy. However, it might be quite difficult to find the head reliably. This paper brings two contributions. The first contribution is a design of an arrow recognizer where the head is detected using relative strokes positioning. We embedded this recognizer into the diagram recognition pipeline proposed earlier [4] and increased the overall accuracy. The second contribution is an introduction of a new approach to evaluate the relative position of two given strokes with neural networks (LSTM). This approach is an alternative to the fuzzy relative positioning proposed by Bouteruche et al. [2]. We made a comparison between the two methods through experiments performed on two datasets for two different tasks. First, we used a benchmark database of hand-drawn finite automata to evaluate detection of arrows. Second, we used a database presented in the paper by Bouteruche et al. containing pairs of reference and argument strokes, where argument strokes are classified into 18 classes. Our method gave significantly better results for the first task and comparable results for the second task.

Graph-based Simplex Method for Pairwise Energy Minimization with Binary Variables

Autoři: doc. RNDr. Daniel Průša, Ph.D.,
Publikace: CVPR 2015: Proceedings of the 2015 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. New York: IEEE Computer Society Press, 2015. pp. 475-483. ISSN 1063-6919. ISBN 978-1-4673-6964-0.
Rok: 2015

DOI: 10.1109/CVPR.2015.7298645
Odkaz: https://doi.org/10.1109/CVPR.2015.7298645
Pracoviště: Katedra kybernetiky
Anotace:
We show how the simplex algorithm can be tailored to the linear programming relaxation of pairwise energy minimization with binary variables. A special structure formed by basic and nonbasic variables in each stage of the algorithm is identified and utilized to perform the whole iterative process combinatorially over the input energy minimization graph rather than algebraically over the simplex tableau. This leads to a new efficient solver. We demonstrate that for some computer vision instances it performs even better than methods reducing binary energy minimization to finding maximum flow in a network.

How Hard Is the LP Relaxation of the Potts Min-Sum Labeling Problem?

Autoři: doc. RNDr. Daniel Průša, Ph.D., doc. Ing. Tomáš Werner, Ph.D.,
Publikace: EMMCVPR 2015: Proceedings of the 10th International Conference on Energy Minimization Methods in Computer Vision and Pattern Recognition. Heidelberg: Springer, 2015. p. 57-70. Lecture Notes in Computer Science. ISSN 0302-9743. ISBN 978-3-319-14611-9.
Rok: 2015

DOI: 10.1007/978-3-319-14612-6_5
Odkaz: https://doi.org/10.1007/978-3-319-14612-6_5
Pracoviště: Katedra kybernetiky
Anotace:
An important subclass of the min-sum labeling problem (also known as discrete energy minimization or valued constraint satisfaction) is the pairwise min-sum problem with arbitrary unary costs and attractive Potts pairwise costs (also known as the uniform metric labeling problem). In analogy with our recent result, we show that solving the LP relaxation of the Potts min-sum problem is not significantly easier than that of the general min-sum problem and thus, in turn, the general linear program. This suggests that trying to find an efficient algorithm to solve the LP relaxation of the Potts min-sum problem has a fundamental limitation. Our constructions apply also to integral solutions, yielding novel reductions of the (non-relaxed) general min-sum problem to the Potts min-sum problem.

On a class of rational functions for pictures

Autoři: Mráz, F., Otto, F., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: Proceedings of the 7th Workshop on Non-Classical Models of Automata and Applications (NCMA 2015). Wien: Österreichische Computer Gesellschaft, 2015. pp. 159-176. vol. 318. ISBN 978-3-903035-07-2.
Rok: 2015

Pracoviště: Strojové učení
Anotace:
With the aid of homogeneous morphisms,we extend the deterministic two-dimensional two-way ordered restarting automaton into a device that computes transductions of pictures, and we study the resulting class of transductions in detail.

Universality of the local marginal polytope

Autoři: doc. RNDr. Daniel Průša, Ph.D., doc. Ing. Tomáš Werner, Ph.D.,
Publikace: IEEE Transactions on Pattern Analysis and Machine Intelligence. 2015, 37(4), 898-904. ISSN 0162-8828.
Rok: 2015

DOI: 10.1109/TPAMI.2014.2353626
Odkaz: https://doi.org/10.1109/TPAMI.2014.2353626
Pracoviště: Katedra kybernetiky
Anotace:
We show that solving the LP relaxation of the min-sum labeling problem (also known as MAP inference problem in graphical models, discrete energy minimization, or valued constraint satisfaction) is not easier than solving any linear program. Precisely, every polytope is linear-time representable by a local marginal polytope and every LP can be reduced in linear time to a linear optimization (allowing infinite costs) over a local marginal polytope. The reduction can be done (though with a higher time complexity) even if the local marginal polytope is restricted to have a planar structure.

Using Agglomerative Clustering of Strokes to Perform Symbols Over-segmentation within a Diagram Recognition System

Autoři: Bresler, M., doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V.
Publikace: CVWW 2015: Proceedings of the 20th Computer Vision Winter Workshop. Graz: Graz University of Technology, 2015, pp. 67-74. ISBN 978-3-85125-388-7.
Rok: 2015

Pracoviště: Katedra kybernetiky
Anotace:
Symbol segmentation is a critical part of handwriting recognition. Any mistake done in this step is propagating further through the recognition pipeline. It forces researchers to consider methods generating multiple hypotheses for symbol segmentation-over-segmentation. Simple approaches which takes all reasonable combinations of strokes are applied very often, because they allow to achieve high recall rates very easily. However, they generate too much hypotheses. It makes a recognizer considerably slow. This paper presents our experimentation with an alternative method based on a single linkage agglomerative clustering of strokes with trainable distance metric. We embed the method into the state-of-the-art recognizer for on-line sketched diagrams. We show that it results in a decrease in the number of generated hypotheses while still reaching high recall rates. A problem emerges, since the number of bad hypotheses is still significantly higher than the number of symbols and it leads to unbalanced training datasets. To deal with it, we propose to train symbol classifiers with synthesized artificial samples. We show that the combination of these two improvements make the recognizer significantly faster and very precise.

Combining Structural and Statistical Approach to Online Recognition of Handwritten Mathematical Formulas

Autoři: Stria, J., doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V.
Publikace: CVWW2014: Proceedings of the 19th Computer Vision Winter Workshop. Praha: Czech Society for Cybernetics and Informatics, 2014, pp. 103-109. ISBN 978-80-260-5641-6.
Rok: 2014

Pracoviště: Katedra kybernetiky
Anotace:
This paper introduces a novel method for on-line recognition of handwritten mathematical formulas. The method is based on the combination of a structural analy sis with a statistical model specifying relations of individual symbols. A description o f all recognition phases is given, focusing mostly on the structural analysis stage. The recognition process following a bottom-up manner is driven by a spatial grammar, which expresses mathematical formulas unambiguously. As the grammar describes the relative pos itions only roughly, a statistical model learned from a database of annotated formulas i s employed to refine the description. Several experimental results are also reported.

Garment Perception and its Folding Using a Dual-arm Robot

Autoři: Stria, J., doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V., Wagner, L., Petrík, V., Krsek, P., Smutný, V.
Publikace: Proc. IEEE/RSJ International Conference on Inteligent Robots and Systems, IROS 2014. Los Alamitos: IEEE Computer Society Press, 2014. p. 61-67. ISSN 2153-0858. ISBN 978-1-4799-6934-0.
Rok: 2014

DOI: 10.1109/IROS.2014.6942541
Odkaz: https://doi.org/10.1109/IROS.2014.6942541
Pracoviště: Katedra kybernetiky
Anotace:
The work addresses the problem of clothing perception and manipulation by a two armed industrial robot aiming at a real-time automated folding of a piece of garment spread out on a flat surface. A complete solution combining vision sensing, garment segmentation and understanding, planning of the manipulation and its real execution on a robot is proposed. A new polygonal model of a garment is introduced. Fitting the model into a segmented garment contour is used to detect garment landmark points. It is shown how folded variants of the unfolded model can be derived automatically. Universality and usefulness of the model is demonstrated by its favorable performance within the whole folding procedure which is applicable to a variety of garments categories (towel, pants, shirt, etc.) and evaluated experimentally using the two armed robot. The principal novelty with respect to the state of the art is in the new garment polygonal model and its manipulation planning algorithm which leads to the speed up by two orders of magnitude.

Non-recursive Trade-offs between Two-Dimensional Automata and Grammars

Autoři: doc. RNDr. Daniel Průša, Ph.D.,
Publikace: DCFS 2014: Proceedings of the 16th International Workshop on Descriptional Complexity of Formal Systems. Berlin: Springer, 2014. pp. 352-363. Lecture Notes in Computer Science. ISSN 0302-9743. ISBN 978-3-319-09703-9.
Rok: 2014

DOI: 10.1007/978-3-319-09704-6_31
Odkaz: https://doi.org/10.1007/978-3-319-09704-6_31
Pracoviště: Katedra kybernetiky
Anotace:
We study succinctness of descriptional systems for picture languages. Basic models of two-dimensional finite automata and generalizations of context-free grammars are considered. It is shown t hat non-recursive trade-offs between the systems are very common. The results are based on the ability of th e systems to simulate Turing machines.

Polygonal Models for Clothing

Autoři: Stria, J., doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V.
Publikace: Advances in Autonomous Robotics Systems, 15th Annual Conference, TAROS 2014. New York: Springer, 2014. p. 173-184. Lecture Notes in Artificial Intelligence. ISSN 0302-9743. ISBN 978-3-319-10400-3.
Rok: 2014

DOI: 10.1007/978-3-319-10401-0_16
Odkaz: https://doi.org/10.1007/978-3-319-10401-0_16
Pracoviště: Katedra kybernetiky
Anotace:
We address the problem of recognizing a configuration of a piece of garment fairly spread out on a flat surface. We suppose that the background surface is invariant and that its color is sufficiently dissimilar from the color of a piece of garment. This assumption enables quite reliable segmentation followed by extraction of the garment contour. The contour is approximated by a polygon which is then fitted to a polygonal garment model. The model is specific for each category of garment (e.g. towel, pants, shirt) and its parameters are learned from training data. The fitting procedure is based on minimization of the energy function expressing dissimilarities between observed and expected data. The fitted model provides reliable estimation of garment landmark points which can be utilized for an automated folding using a pair of robotic arms. The proposed method was experimentally verified on a dataset of images. It was also deployed to a robot and tested in a real-time automated folding.

Recognition System for On-line Sketched Diagrams

Autoři: Bresler, M., Van Phan, T., doc. RNDr. Daniel Průša, Ph.D., Nakagawa, M., Hlaváč, V.
Publikace: ICFHR 2014: Proceedings of the 14th International Conference on Frontiers in Handwriting Recognition. Los Alamitos: IEEE Computer Society Press, 2014. p. 563-568. ISSN 2167-6445. ISBN 978-1-4799-4334-0.
Rok: 2014

DOI: 10.1109/ICFHR.2014.100
Odkaz: https://doi.org/10.1109/ICFHR.2014.100
Pracoviště: Katedra kybernetiky
Anotace:
We present our recent model of a diagram recognition engine. It extends our previous work which approaches the structural recognition as an optimization problem of choosing the best subset of symbol candidates. The main improvement is the integration of our own text separator into the pipeline to deal with text blocks occurring in diagrams. Second improvement is splitting the symbol candidates detection into two stages: uniform symbols detection and arrows detection. Text recognition is left for postprocessing when the diagram structure is already known. Training and testing of the engine was done on a freely available benchmark database of flowcharts. We correctly segmented and recognized 93.0% of the symbols having 55.1% of the diagrams recognized without any error. Considering correct stroke labeling, we achieved the precision of 95.7%. This result is superior to the state-of-the-art method with the precision of 92.4 %. Additionally, we demonstrate the generality of the proposed method by adapting the system to finite automata domain and evaluating it on own database of such diagrams.

The Power of LP Relaxation for MAP Inference

Autoři: Živný, S., doc. Ing. Tomáš Werner, Ph.D., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: Advanced Structured Prediction. Boston: MIT PRESS, 2014. p. 19-42. ISBN 978-0-262-02837-0.
Rok: 2014

Pracoviště: Katedra kybernetiky
Anotace:
Minimization of a partially separable function of many discrete variables is ubiquitous in machine learning and computer vision, in tasks like maximum a posteriori (MAP) inference in graphical models, or structured prediction. Among successful approaches to this problem is linear programming (LP) relaxation. We discuss this LP relaxation from two aspects. First, we review recent results which characterize languages (classes of functions permitted to form the objective function) for which the problem is solved by the relaxation exactly. Second, we show that solving the LP relaxation is not easier than solving any linear program, which makes a discovery of an efficient algorithm for the LP relaxation unlikely.

Two-dimensional Sgraffito automata

Autoři: doc. RNDr. Daniel Průša, Ph.D., Mráz, F., Otto, F.
Publikace: RAIRO - Theoretical Informatics and Applications. 2014, 48(5), 505-539. ISSN 0988-3754.
Rok: 2014

DOI: 10.1051/ita/2014023
Odkaz: https://doi.org/10.1051/ita/2014023
Pracoviště: Katedra kybernetiky
Anotace:
We present a new model of a two-dimensional computing device called Sgraffito automaton. In general, the model is quite simple, which allows a clear design of computations. When restricted to one-dimensional inputs, that is, strings, the Sgraffito automaton does not exceed the power of finite-state automata. On the other hand, for two-dimensional inputs, it yields a family of picture languages with good closure properties that strictly includes the class REC of recognizable picture languages. The deterministic Sgraffito automata define a class of picture languages that includes the class of deterministic recognizable picture languages DREC, the class of picture languages that are accepted by four-way alternating automata, those that are accepted by deterministic one-marker automata, and the sudoku-deterministically recognizable picture languages, but the membership problem for the accepted languages is still decidable in polynomial time. In addition, the deterministic Sgraffito automata accept some unary picture languages that are outside of the class REC.

Weight-Reducing Hennie Machines and Their Descriptional Complexity

Autoři: doc. RNDr. Daniel Průša, Ph.D.,
Publikace: LATA 2014: Proceedings of the 8th International Conference on Language an d Automata Theory and Applications. Berlin: Springer, 2014. p. 553-564. Lecture Notes in Computer Science. ISSN 0302-9743. ISBN 978-3-319-04920-5.
Rok: 2014

DOI: 10.1007/978-3-319-04921-2_45
Odkaz: https://doi.org/10.1007/978-3-319-04921-2_45
Pracoviště: Katedra kybernetiky
Anotace:
We present a constructive variant of the Hennie machine. It is demonstrated how it can facilitate the design of finite-state machines. We focus on the d eterministic version of the model and study its descriptional complexity. The model's suc cinctness is compared with common devices that include the nondeterministic finite automa ton, two-way finite automaton and pebble automaton.

Comparing Two-Dimensional One-Marker Automata to Sgraffito Automata

Autoři: doc. RNDr. Daniel Průša, Ph.D., Mráz, F., Otto, F.
Publikace: CIAA 2013: Proceedings of the 18th International Conference on Implementation and Application of Automata. Heidelberg: Springer, 2013, pp. 268-279. Lecture Notes in Computer Science. ISSN 0302-9743. ISBN 978-3-642-39273-3. Available from: http://dx.doi.org/10.1007/978-3-642-39274-0_24
Rok: 2013

DOI: 10.1007/978-3-642-39274-0_24
Odkaz: https://doi.org/10.1007/978-3-642-39274-0_24
Pracoviště: Katedra kybernetiky
Anotace:
We compare two types of automata for accepting picture languages to each other: the two-dimensional one-marker automaton and the sgraffito automaton. On the one hand, it is shown that deterministic sgraffito automata are strictly more powerful than deterministic two-dimensional one-marker automata. On the other hand, nondeterministic two-dimensional one-marker automata accept some picture languages that cannot be accepted by sgraffito automata. However, if nondeterministic two-dimensional one-marker automata were to accept all picture languages that are accepted by (deterministic) sgraffito automata, then the complexity classes NL (nondeterministic logarithmic space) and P (deterministic polynomial time) would coincide. Accordingly, it is likely that the classes of picture languages accepted by these two types of nondeterministic automata are incomparable under inclusion.

Modeling Flowchart Structure Recognition as a Max-Sum Problem

Autoři: Bresler, M., doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V.
Publikace: ICDAR 2013: Proceedings of the 12th International Conference on Document Analysis and Recognition. Los Alamitos: IEEE Computer Society, 2013. p. 1215-1219. ISSN 1520-5363.
Rok: 2013

DOI: 10.1109/ICDAR.2013.246
Odkaz: https://doi.org/10.1109/ICDAR.2013.246
Pracoviště: Katedra kybernetiky
Anotace:
This work deals with the on-line recognition of hand-drawn graphical sketches with structure. We present a novel approach, in which the search for a suitable interpretation of the input is formulated as a combinatorial optimization task -- the max-sum problem. The recognition pipeline consists of two main stages. First, groups of strokes possibly representing symbols of a sketch (symbol candidates) are segmented and relations between them are detected. Second, a combination of symbol candidates best fitting the input is chosen by solving the optimization problem. We focused on flowchart recognition. Training and testing of our method was done on a freely available benchmark database. We correctly segmented and recognized 82.7 prec. of the symbols having 31.5 prec. of the diagrams recognized without any error. It indicates that our approach has promising potential and can compete with the state-of-the-art methods.

New Results on Deterministic Sgraffito Automata

Autoři: doc. RNDr. Daniel Průša, Ph.D., Mráz, F., Otto, F.
Publikace: DLT 2013: Proceedings of the 17th International Conference on Developments in Language Theory. Heidelberg: Springer, 2013, pp. 409-419. Lecture Notes in Computer Science. ISSN 0302-9743. ISBN 978-3-642-38770-8. Available from: http://dx.doi.org/10.1007/978-3-642-38771-5_36
Rok: 2013

DOI: 10.1007/978-3-642-38771-5_36
Odkaz: https://doi.org/10.1007/978-3-642-38771-5_36
Pracoviště: Katedra kybernetiky
Anotace:
The deterministic sgraffito automaton is a two-dimensional computing device that allows a clear and simple design of important computations. The family of picture languages it accepts has many nice closure properties, but when restricted to one-row inputs (that is, strings), this family collapses to the class of regular languages. Here we compare the deterministic sgraffito automaton to some other two-dimensional models: the two-dimensional deterministic forgetting automaton, the four-way alternating automaton and the sudoku-deterministically recognizable picture languages. In addition, we prove that deterministic sgraffito automata accept some unary picture languages that are outside the class REC of recognizable picture languages.

Restarting Tiling Automata

Autoři: doc. RNDr. Daniel Průša, Ph.D., Mráz, F.
Publikace: International Journal of Foundations of Computer Science. 2013, 24(06), 863-878. ISSN 0129-0541.
Rok: 2013

DOI: 10.1142/S0129054113400236
Odkaz: https://doi.org/10.1142/S0129054113400236
Pracoviště: Katedra kybernetiky
Anotace:
We present a new model of a two-dimensional computing device called restarting tiling automaton. The automaton defines a set of tile-rewriting, weight-reducing rules and a scanning strategy by which a tile to rewrite is being searched. We investigate properties of the induced families of picture languages. Special attention is paid to picture languages that can be accepted independently of the scanning strategy. We show that this family strictly includes REC and exhibits similar closure properties. Moreover, we prove that its intersection with the set of one-row languages coincides with the regular languages.

Simultaneous Segmentation and Recognition of Graphical Symbols using a Composite Descriptor

Autoři: Bresler, M., doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V.
Publikace: CVWW 2013: Proceedings of the 18th Computer Vision Winter Workshop. Vienna: Vienna University of Technology, 2013. p. 16-23. ISBN 978-3-200-02943-9.
Rok: 2013

Pracoviště: Katedra kybernetiky
Anotace:
This work deals with recognition of hand-drawn graphical symbols in diagrams. We present two contributions. First, we designed a new composite descriptor expressing overall appearance of symbols. We achieved rather favorable accuracy in classification of segmented symbols on benchmark databases, which is 98.93 prec. for a database of flow charts, 98.33 prec. for a database of crisis management icons, and 92.94 perc. for a database of digits. Second, we used the descriptor in the task of simultaneous segmentation and recognition of graphical symbols. Our method creates symbol candidates by grouping spatially close strokes. Symbol candidates are classified by a multiclass SVM classifier learned on a dataset with negative examples. Thus, some portion of the candidates is filtered out. The joint segmentation and classification was tested on diagrams from the flowchart database. We were able to find 91.85 prec. of symbols while generating 8.8 times more symbol candidates than is the number of true symbols per diagram in average.

Universality of the Local Marginal Polytope

Autoři: doc. RNDr. Daniel Průša, Ph.D., doc. Ing. Tomáš Werner, Ph.D.,
Publikace: CVPR: 2013 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Los Alamitos: IEEE Computer Society Press, 2013. p. 1738-1743. ISSN 1063-6919.
Rok: 2013

DOI: 10.1109/CVPR.2013.227
Odkaz: https://doi.org/10.1109/CVPR.2013.227
Pracoviště: Katedra kybernetiky
Anotace:
We show that solving the LP relaxation of the MAP inference problem in graphical models (also known as the min-sum problem, energy minimization, or weighted constraint satisfaction) is not easier than solving any LP. More precisely, any polytope is linear-time representable by a local marginal polytope and any LP can be reduced in linear time to a linear optimization (allowing infinite weights) over a local marginal polytope.

MfrDB: Database of Annotated On-line Mathematical Formulae

Autoři: Stria, J., Bresler, M., doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V.
Publikace: ICFHR 2012: Proceedings of the 13th International Conference on Frontiers in Handwriting Recognition. Los Alamitos: IEEE Computer Society, 2012. p. 540-545. ISBN 978-0-7695-4774-9.
Rok: 2012

Pracoviště: Katedra kybernetiky
Anotace:
This paper announces a ground truthed database of on-line handwritten mathematical formulae. It have recently been collected in our group in connection with the research on methods for structural pattern recognition. Unlike the availability of handwritten characters or texts, collections of structural objects are rather scarce, thus we would like to provide them to the community. We also present the methodology and tools used for data acquisition. Finally, we report on our experiment with the automatic generation of additional samples. The process utilizes the dataset to extract statistical descriptions of symbols alignments and relative sizes.

Nao Robot Navigation Based on a Single VGA Camera

Autoři: Fojtů, Š., Bresler, M., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: CVWW 2012: Proceedings of the 17th Computer Vision Winter Workshop. Ljubljana: Slovenian Pattern Recognition Society, 2012, pp. 121-128. ISBN 978-961-90901-6-9.
Rok: 2012

Pracoviště: Katedra kybernetiky
Anotace:
We introduce a localization and mapping approach suitable for a humanoid robot Nao. The proposed method is using data acquired from a single VGA camera, mounted on the head of the robot. It combines two different approaches together to create a detailed map and accurately locate the robot. Firstly, we present a Simultaneous Localization and Mapping technique (SLAM) based on a sparse 3D point cloud reconstruction. Secondly, we investigate the possibilities of enriching the created map by the detection of special markers. Moreover, we describe the options of using the special markers to directly navigate the robot. The main benefit of our method is in the fact we are using only basic equipment of the robot Nao (no extra devices are needed e.g. external cameras) and we do not need high number of special markers in the scene. We use them to specify the created map. Thus, the method will not fail when no marker is detected. The accuracy of our method is tested using a robotic simulation environment Webots and the robustness is verified on a real robot Nao. The results show that our approach makes it possible to safely navigate the robot Nao.

Restarting Tiling Automata

Autoři: doc. RNDr. Daniel Průša, Ph.D., Mráz, F.
Publikace: Implementation and Application of Automata. Heidelberg: Springer, 2012, pp. 289-300. Lecture Notes in Computer Science. ISBN 978-3-642-31605-0.
Rok: 2012

Pracoviště: Katedra kybernetiky
Anotace:
We present a new model of a two-dimensional computing device called restarting tiling automaton. The automaton defines a set of tile-rewriting, weight-reducing rules and a scanning strategy by which a tile to rewrite is being searched. We investigate properties of the induced families of picture languages. Special attention is paid to picture languages that can be accepted independently of the scanning strategy. We show that this family strictly includes REC and exhibits similar closure properties. Moreover, we prove that its intersection with the set of one-row languages coincides with the regular languages.',NULL); keyword('two-dimensional languages

Two-dimensional sgraffito automata

Autoři: doc. RNDr. Daniel Průša, Ph.D., Mráz, F.
Publikace: Proceedings of the 16th international conference on Developments in Language Theory. Heidelberg: Springer, 2012. p. 251-262. Lecture Notes in Computer Science. ISSN 0302-9743. ISBN 978-3-642-31652-4.
Rok: 2012

DOI: 10.1007/978-3-642-31653-1_23
Odkaz: https://doi.org/10.1007/978-3-642-31653-1_23
Pracoviště: Katedra kybernetiky
Anotace:
We present a new model of a two-dimensional computing device called sgraffito automaton and demonstrate its significance. In general, the model is simple, allows a clear design of important computations and defines families exhibiting good properties. It does not exceed the power of finite-state automata when working over one-dimensional inputs. On the other hand, it induces a family of picture languages that strictly includes REC and the deterministic variant recognizes languages in DREC as well as those accepted by four-way automata.

Hand-drawn Objects with Structure as Means of Communication with Machines

Autoři: doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V.
Publikace: Beyond AI 2011: Proceedings of Interdisciplinary Aspects of Artificial Intelligence. Plzeň: Západočeská univerzita, 2011, pp. 97-102.
Rok: 2011

Pracoviště: Katedra kybernetiky
Anotace:
We study hand-drawn objects with structure as a way of human-machine communication. The objects include various schemas like building plans, electronic circuits, filled forms or notations like mathematical or chemical formulae. They usually carry a quite complex information in a form which is intelligible to the addressed side, thus we consider them as important means of information representation and transmission. We identify the main principles relating to the structure recognition as it is performed by humans and we outline how the principles can be simulated by machines.

Web Application for Recognition of Mathematical Formulas

Autoři: Stria, J., doc. RNDr. Daniel Průša, Ph.D.,
Publikace: ITAT 2011: Proceedings of the Conference on Theory and Practice of Information Technologies. Tilburg: CEUR Workshop Proceedings, 2011, pp. 47-54.
Rok: 2011

Pracoviště: Katedra kybernetiky
Anotace:
We present a system for on-line mathematical formulas recognition. The main principles of the method we applied for recognition are explained. It is based on the structural construction paradigm and utilizes a sort of two-dimensional grammars called coordinate grammars. Grammar productions are used to model spatial relationships among mathematical symbols. The system itself has been developed as a web application in HTML5. We give details on its client-server architecture and user interface. We also discuss advantages of the chosen approach.

Mathematical Formulae Recognition

Autoři: doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V.
Publikace: DML 2008: Towards digital mathematics library. Brno: Masarykova univerzita, 2008. pp. 69-73. ISBN 978-80-210-4658-0.
Rok: 2008

Pracoviště: Katedra kybernetiky
Anotace:
We present a summary of our work in progress related to mathematical formulae recognition. Our approach is based on the structural construction paradigm and two-dimensional grammars. It is a general framework and can be successfully used in the analysis of images containing objects exhibiting rich structural relations. In contrast to most of all other known approaches, the method does not treat symbols segmentation and structural analysis as two separate processes. This allows the system to solve arising ambiguities more reliably. We have already implemented pilot studies for the off-line as well as on-line mathematical formulae recognition showing that the proposed method can be effectively implemented and practically used.

Structural Construction for On-Line Mathematical Formulae Recognition

Autoři: doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V.
Publikace: Proceedings of the 13th Iberoamerican Congress on Pattern Recognition. Berlin: Springer, 2008. pp. 317-324. ISSN 0302-9743. ISBN 978-3-540-85919-2.
Rok: 2008

Pracoviště: Katedra kybernetiky
Anotace:
We present a method for on-line mathematical formulae recognition based on the structural construction paradigm and two-dimensional grammars. In general, this approach can be successfully used in the analysis of images containing objects that exhibit rich structural relations. An important benefit of the structural construction is in not treating symbols segmentation and structural analysis as two separate processes which allows the system to perform segmentation in the context of the whole formula structure - this can help to solve arising ambiguities more reliably. We show that the proposed method can be effectively implemented and practically used.

2D Context-free Grammars: Mathematical Formulae Recognition

Autoři: doc. RNDr. Daniel Průša, Ph.D., Hlaváč, V.
Publikace: Proceedings of the Prague Stringology Conference '06. Praha: Vydavatelství ČVUT, 2006. pp. 77-89. ISBN 80-01-03533-6.
Rok: 2006

Pracoviště: Katedra kybernetiky
Anotace:
This contribution advocates that two-dimensional context-free grammars can be successfully used in the analysis of images containing objects that exhibit structural relations. The idea of structural construction is further developed. The approach can be made computationally efficient, practical and be able to cope with noise. We have developed and tested the method on a pilot study aiming at recognition of off-line mathematical formulae. The other novelty is not treating symbol segmentation in the image and structural analysis as two separate processes. This allows the system to recover from errors made in initial symbol segmentation.

doc. RNDr. Daniel Průša, Ph.D.

Všechny publikace

SCOD: From Heuristics to Theory

Consistent and Tractable Algorithm for Markov Network Learning

One-Side Pumping and Two-Side Pumping by Complete CF(¢, $)-grammars

Optimal Strategies for Reject Option Classifiers

Two-Dimensional Rank-Reducing Grammars and Their Complexity

Weight-reducing Turing machines

Converting nondeterministic two-way automata into small deterministic linear-time machines

On Pumping RP-automata Controlled by Complete LR(¢, $)-grammars

On separations of LR(0)-grammars by two types of pumping patterns

Two-dimensional pattern matching against local and regular-like picture languages

Complexity of Searching for 2 by 2 Submatrices in Boolean Matrices

Complexity of Two-Dimensional Rank-Reducing Grammars

Relative Interior Rule in Block-Coordinate Descent

Solvers for Mathematical Word Problems in Czech

A Simple Extension to Finite Tree Automata for Defining Sets of Labeled, Connected Graphs

On Discriminative Learning of Prediction Uncertainty

Solving LP Relaxations of Some NP-Hard Problems Is As Hard As Solving Any Linear Program

Two-Dimensional Pattern Matching Against Basic Picture Languages

Dynamics of the Independence Number and Automata Synchronization

Rank-Reducing Two-Dimensional Grammars for Document Layout Analysis

Two-Way Automata and One-Tape Machines: Read Only Versus Linear Time

Complexity of Matching Sets of Two-Dimensional Patterns by Two-Dimensional On-Line Tessellation Automaton

LP Relaxation of the Potts Labeling Problem Is as Hard as Any Linear Program

LP Relaxations of Some NP-Hard Problems Are as Hard as Any LP

Recognizing Off-Line Flowcharts by Reconstructing Strokes and Using On-Line Recognition Techniques

Template-Based Pattern Matching in Two-Dimensional Arrays

Undecidability of the emptiness problem for context-free picture languages

Complexity of sets of two-dimensional patterns

Non-recursive trade-offs between two-dimensional automata and grammars

Online recognition of sketched arrow-connected diagrams

Some Classes of Rational Functions for Pictures

(Un)decidability of the Emptiness Problem for Multi-dimensional Context-free Grammars

Detection of Arrows in On-line Sketched Diagrams using Relative Stroke Positioning

Graph-based Simplex Method for Pairwise Energy Minimization with Binary Variables

How Hard Is the LP Relaxation of the Potts Min-Sum Labeling Problem?

On a class of rational functions for pictures

Universality of the local marginal polytope

Using Agglomerative Clustering of Strokes to Perform Symbols Over-segmentation within a Diagram Recognition System

Combining Structural and Statistical Approach to Online Recognition of Handwritten Mathematical Formulas

Garment Perception and its Folding Using a Dual-arm Robot

Non-recursive Trade-offs between Two-Dimensional Automata and Grammars

Polygonal Models for Clothing

Recognition System for On-line Sketched Diagrams

The Power of LP Relaxation for MAP Inference

Two-dimensional Sgraffito automata

Weight-Reducing Hennie Machines and Their Descriptional Complexity

Comparing Two-Dimensional One-Marker Automata to Sgraffito Automata

Modeling Flowchart Structure Recognition as a Max-Sum Problem

New Results on Deterministic Sgraffito Automata

Restarting Tiling Automata

Simultaneous Segmentation and Recognition of Graphical Symbols using a Composite Descriptor

Universality of the Local Marginal Polytope

MfrDB: Database of Annotated On-line Mathematical Formulae

Nao Robot Navigation Based on a Single VGA Camera

Restarting Tiling Automata

Two-dimensional sgraffito automata

Hand-drawn Objects with Structure as Means of Communication with Machines

Web Application for Recognition of Mathematical Formulas

Mathematical Formulae Recognition

Structural Construction for On-Line Mathematical Formulae Recognition

2D Context-free Grammars: Mathematical Formulae Recognition

Mějte přehled