News:
Mar'26:Veronika's preprint: Why phylogenies compress so well: combinatorial guarantees under the Infinite Sites Model
Ján's and Ondřej's preprint: Pareto optimization of masked superstrings improves compression of pan-genome k-mer sets
Karel's interview for Radio Prague International
Feb'26:Tam presented at DSB'26: Phylogenetic ordering and batching of million-genome collections
Jan'26:Paper on prediction of genetic relatedness of E. coli using neighbor typing published in Antimicrobial Agents and Chemotherapy
Arya's preprint: Novel genes arise from genomic deletions across the bacterial tree of life
Karel's invited talk at Sapienza University of Rome: From bacteria to bits and back again: towards faster, more accurate, and smarter diagnostics of antibiotic resistance
Francesca defended her Ph.D. thesis: Assessment of genetic determinants in E. coli uropathogenic lifestyle and intracellular persistence via optimized k-mer matching of million-genome collections. Congratulations!
Veronika defended her MSc thesis: Mathematical modeling of phylogenetic compression. Congratulations!
Nov'25:Paper on FMSI for k-mer indexing using superstrings and the BWT published in Bioinformatics Advances
Francesca's preprint: Optimized k-mer search across millions of bacterial genomes on laptops
Tam presented at SeqBIM'25: Phylogenetic batching of million-genome collections for reduced storage and faster retrieval
Oct'25:Karel gave a keynote at the Czech-French Science Meetup 2025 at the Czech Embassy in Paris: From bacteria to bits and back again: faster, more accurate, and smarter diagnostics of antibiotic resistance
Loren started a permanent position at Limoges University Hospital. Congratulations!
Marie joined us to work on phylogenetically compressed sketches. Welcome!
Sep'25:Preprint: Neighbour Typing Using LR Sequencing Provides Rapid Prediction of ST and Antimicrobial Susceptibility of K. pneumoniae
Francesca presented at the National Congress of the Italian Society of Microbiology: Rapid search across million-genome bacterial collections on laptops
Aug'25:Paper on k-mer set operations via masked superstrings published in the Proceedings of PSC'25
Jul'25:Ondřej presented at ISMB/ECCB 2025 in Liverpool on indexing k-mer sets using masked superstrings
Apr'25:Paper on phylogenetic compression published in Nature Methods
Karel Břinda, Ph.D.

Karel Břinda, Ph.D.

Permanent Researcher (Inria Starting Faculty)

Inria, Irisa, Univ. Rennes

About

I hold a permanent PI position at Inria, the French National Institute for Research in Digital Science and Technology. I’m based at the Inria Center at Rennes University, where I’m part of the GenScale project team. I’m also associated with the Irisa Research Institute of Computer Science and Random Systems. Prior to joining Inria in 2022, I was a postdoctoral fellow and a research associate in biomedical informatics and epidemiology at Harvard Medical School and the Harvard T.H. Chan School of Public Health. I received my Ph.D. in computer science from the University of Paris-Est in 2016 and a master’s degree in mathematical computer science from the Czech Technical University in Prague in 2013.

Our research focuses on the intersection of computer science and biology, with applications in epidemiology and medicine. We develop new bioinformatics methods to analyze big genomic datasets, with the ultimate goals of enabling the rapid diagnosis of antibiotic resistance, monitoring of the spread of pathogens worldwide, and enabling instant searches across all publicly deposited DNA sequence data. Our research questions range from the theoretical aspects of computer science to the practical end-to-end protocols used in laboratories. In this context, we are particularly excited about the potential of emerging rapid and portable genomic technologies, such as nanopore sequencing or CRISPR tests.

Our work is made possible thanks to funding from Inria, the French National Research Agency (ANR), the University of Rennes, Campus France, and Rennes Métropole.

Download my CV .

People

Opportunities available at all levels: postdoc / PhD student / intern / research engineer. Don't hesitate to get in touch!

Portrait of Léo Ackermann

Léo Ackermann

PhD student, Inria, co-supervised with Pierre Peterlongo

Portrait of Tam Truong

Tam Truong

PhD student, Irisa, co-supervised with Pierre Peterlongo and Dominique Lavenier

Portrait of Marie Picard

Marie Picard

MSc student, École normale supérieure de Rennes

Portrait of Ján Plachý

Ján Plachý

MSc student, Charles University, co-supervised with Pavel Veselý

Alumni:

Postdocs

Loren Dejoies, postdoctoral researcher, Inria, 2023–2025

PhD Students

Francesca Brunetti, PhD student, Sapienza University of Rome, 2022–2026

Arya Kaul, visiting PhD student, Harvard Medical School, 2023–2024

Chateaubriand Fellowship

Bachelor's and Master's Students

Veronika Hendrychová, BSc student, Czech Technical University in Prague, 2022–2023; MSc student, Czech Technical University in Prague, 2023–2026

Stanislav Hanzel Prize, 2024

Rektorys Competition, 2024 – 2nd place

Amory Antao, MSc student, ENS Rennes, 2024–2025

Ondřej Sladký, BSc student, Charles University, 2022–2024

Dean's Prize – best Computer Science BSc thesis

SVOČ, 2024 – 1st place in the national TCS/ML category

Ulysse McConnell, summer intern, ETH Zurich, 2024

Léo Laffeach, MSc student, ENS Rennes, 2022–2023

Katya Milyutina, BSc student, Charles University, 2022–2023

Publications

Selected publications

  1. K. Břinda, L. Lima, S. Pignotti, N. Quinones-Olvera, K. Salikhov, R. Chikhi, G. Kucherov, Z. Iqbal, and M. Baym.

    Efficient and robust search of microbial genomes via phylogenetic compression.

    Nature Methods 22, 692–697, 2025.

  2. O. Sladký, P. Veselý, and K. Břinda.

    From superstring to indexing: a space-efficient index for unconstrained k-mer sets using the Masked Burrows-Wheeler Transform (MBWT).

    Bioinformatics Advances 6(1), vbaf290, 2026.

  3. K. Břinda, M. Baym, and G. Kucherov.

    Simplitigs as an efficient and scalable representation of de Bruijn graphs.

    Genome Biology 22:96, 2021.

  4. K. Břinda, A. Callendrello, K. C. Ma, D. R. MacFadden, T. Charalampous, R. S. Lee, L. Cowley, C. B. Wadsworth, Y. H. Grad, G. Kucherov, J. O’Grady, M. Baym, and W. P. Hanage.

    Rapid inference of antibiotic resistance and susceptibility by genomic neighbour typing.

    Nature Microbiology 5, 455–464, 2020.

  5. K. Břinda, M. Sykulski, and G. Kucherov.

    Spaced seeds improve k-mer-based metagenomic classification.

    Bioinformatics 31(22), 3584–3592, 2015.


Complete list

See the complete publications page or Google Scholar.

Methods

A. Phylogenetic compression

Framework implementing phylogenetic compression, a technique using evolutionary history to guide compression and efficiently search large collections of microbial genomes using existing algorithms and data structures. This improves the compression ratios of assemblies, de Bruijn graphs, and k-mer indexes by one to two orders of magnitude.

Associated tools:

  • Phylign – BLAST-like search across all pre-2019 bacteria on standard desktops and laptops.
  • MiniPhy – Phylogenetic compression of extremely large genome collections.

B. Metagenomic classification

  • ProPhyle - Accurate, resource-frugal, and deterministic metagenomic classification, based on k-mer propagation, simplitigs, and k-mer indexing using the Burrows-Wheeler Transform.

C. Genomic Neighbor Typing

A proof-of-concept framework for Genomic Neighbor Typing for within-minutes predictions of antibiotic resistance during nanopore sequencing. Pipeline, library, two species databases (S. pneumoniae and N. gonorrhoeae), and demonstrations of within-minutes diagnostic from isolates and metagenomes.


D. Simplitigs and masked superstrings

Novel generation of text representations for k-mer sets. Simplitigs provide compact representations of k-mer sets by covering de Bruijn graphs with vertex-disjoint paths that spell all original k-mers while drastically reducing sequence count and total length. Masked superstrings generalize this idea to overlap graphs, allowing arbitrary overlaps and using binary masks to eliminate unintended k-mers, which yields even shorter representations and unifies the handling of diverse k-mer sets. Building on these concepts, KmerCamel constructs near-optimal masked superstrings at large scale, and FMSI turns them into highly compressed BWT-based indexes that support fast membership queries and efficient set operations. ProphAsm and ProPhex provide the corresponding practical toolchain for generating these representations and using them in downstream k-mer indexing workflows.

  • FMSI - memory-efficient k-mer set index based on masked superstrings and the Burrows–Wheeler Transform.

  • KmerCamel - compressing k-mer sets using masked superstrings.

  • ProphAsm - rapid and memory-efficient computation of simplitigs (spectrum-preserving string sets) and set operations with k-mer sets.

  • ProPhex An efficient k-mer index based on the Burrows-Wheeler Transform.


E. Other tools

  • RNFtools – A generic format for naming simulated sequencing reads using arbitrary tools and the associated toolkit and pipeline for read simulation and read mapper evaluation.

  • SAMsift – Advanced filtering and tagging of SAM/BAM alignments using Python expressions.

  • Disty McMatrixFace – Tool for computing a distance matrix from a core genome alignment.

  • NanoSim-H – An easy-to-use simulator of nanopore reads, a fork of NanoSim.

For a full software list, see the GitHub page.

Teaching

École normale supérieure de Rennes

  • Experimental Bioinformatics; BOX (Fall 2023, 2024, and 2025; Lecturer)

    Master’s-level course (M1), focused on computational methods for genomics. Taught jointly with Léo Ackermann (before 2025 with Riccardo Vicedomini).

Harvard Medical School

  • Concepts in Genome Analysis; BMIF 201 (Fall 2019; TA)

    PhD-level course, focused on quantitative aspects of genetics and genomics, including computational and statistical methods of genomic analysis. Close collaboration with Profs. Shamil R. Sunyaev, Michael Baym, Cheng-Zhong Zhang, and Heng Li.

Czech Technical University in Prague

  • Assistive Technology; 01ASTE (Fall 2010–2012; Instructor)

    Master’s-level course.

  • Software Project; 01SWP1, 01SWP2 (Fall and spring semesters, 2010–2012; Supervisor)

    Master’s-level course.

Media coverage

2026/03Radio Prague International: Science without Borders
Interview about my academic career and research.
2025/09Sciences Ouest (n.431): Mieux caractériser l’antibiorésistance
Article about our research on antibiotic resistance.
2025/06Inria Emergence: Diagnostiquer plus vite la résistance aux antibiotiques
Article about our research on faster resistance diagnostics.
2020/06Massive Science: Rapid DNA sequencing of unknown bacteria helps doctors choose which antibiotics to treat it with
Article about how our rapid diagnostic method could help doctors prescribe antibiotics more effectively.
2020/02The Bioinformatics Chat: Spectrum-preserving string sets and simplitigs
Interview about our paper on simplitigs.
2020/02BBC World Service: Science in Action
Interview about our paper on rapid diagnostics of antibiotic resistance by Genomic Neighbor Typing. Listen to the segment starting at 8:10.

Contact