Registration Open for Bioc2024 July 24-26


kallisto | bustools R utilities

Bioconductor version: Release (3.19)

The kallisto | bustools pipeline is a fast and modular set of tools to convert single cell RNA-seq reads in fastq files into gene count or transcript compatibility counts (TCC) matrices for downstream analysis. Central to this pipeline is the barcode, UMI, and set (BUS) file format. This package serves the following purposes: First, this package allows users to manipulate BUS format files as data frames in R and then convert them into gene count or TCC matrices. Furthermore, since R and Rcpp code is easier to handle than pure C++ code, users are encouraged to tweak the source code of this package to experiment with new uses of BUS format and different ways to convert the BUS file into gene count matrix. Second, this package can conveniently generate files required to generate gene count matrices for spliced and unspliced transcripts for RNA velocity. Here biotypes can be filtered and scaffolds and haplotypes can be removed, and the filtered transcriptome can be extracted and written to disk. Third, this package implements utility functions to get transcripts and associated genes required to convert BUS files to gene count matrices, to write the transcript to gene information in the format required by bustools, and to read output of bustools into R as sparses matrices.

Author: Lambda Moses [aut, cre] , Lior Pachter [aut, ths]

Maintainer: Lambda Moses <dlu2 at>

Citation (from within R, enter citation("BUSpaRse")):


To install this package, start R (version "4.4") and enter:

if (!require("BiocManager", quietly = TRUE))


For older versions of R, please refer to the appropriate Bioconductor release.


To view documentation for the version of this package installed in your system, start R and enter:

Converting BUS format into sparse matrix HTML R Script
Transcript to gene HTML R Script
Reference Manual PDF


biocViews RNASeq, SingleCell, Software, WorkflowStep
Version 1.18.0
In Bioconductor since BioC 3.10 (R-3.6) (4.5 years)
License BSD_2_clause + file LICENSE
Depends R (>= 3.6)
Imports AnnotationDbi, AnnotationFilter, biomaRt, BiocGenerics, Biostrings, BSgenome, dplyr, ensembldb, GenomeInfoDb, GenomicFeatures, GenomicRanges, ggplot2, IRanges, magrittr, Matrix, methods, plyranges, Rcpp, S4Vectors, stats, stringr, tibble, tidyr, utils, zeallot
System Requirements GNU make
Bug Reports
See More
Suggests knitr, rmarkdown, testthat, BiocStyle, txdbmaker, TENxBUSData, TxDb.Hsapiens.UCSC.hg38.knownGene, BSgenome.Hsapiens.UCSC.hg38, EnsDb.Hsapiens.v86
Linking To Rcpp, RcppArmadillo, RcppProgress, BH
Depends On Me
Imports Me
Suggests Me
Links To Me
Build Report Build Report

Package Archives

Follow Installation instructions to use this package in your R session.

Source Package BUSpaRse_1.18.0.tar.gz
Windows Binary
macOS Binary (x86_64) BUSpaRse_1.18.0.tgz
macOS Binary (arm64) BUSpaRse_1.18.0.tgz
Source Repository git clone
Source Repository (Developer Access) git clone
Bioc Package Browser
Package Short Url
Package Downloads Report Download Stats