Skip to content

A Python library of biclustering algorithms, evaluation measures and datasets.

License

Notifications You must be signed in to change notification settings

padilha/biclustlib

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

biclustlib

biclustlib is a Python library of biclustering algorithms, evaluation measures and datasets distributed under the GPLv3 license.

This library is under constant update. We expect to review its code and release a first version soon.

Installation

First you need to ensure that all packages have been installed.

  • See requirements.txt
  • R >= 3.5;
  • biclust R package;
  • isa2 R package;
  • Other specific libraries may be required by third party implementations that are wrapped in this package;

If you miss something you can simply type:

  • pip install -r requirements.txt

If you have all dependencies installed:

  • python setup.py install

Citing us

If you use biclustlib in a scientific publication, we would appreciate citations of our paper where this library was first mentioned and used.

To cite biclustlib use: Padilha, V. A. & Campello, R. J. G. B. (2017). A systematic comparative evaluation of biclustering techniques. BMC Bioinformatics, 18(1):55.

For TeX/LaTeX:

@article{padilha2017,
  title={A systematic comparative evaluation of biclustering techniques},
  author={Padilha, Victor A and Campello, Ricardo J G B},
  journal={BMC Bioinformatics},
  volume={18},
  number={1},
  pages={55},
  year={2017},
  publisher={BioMed Central}
}

Available algorithms

Python implementations

Wrappers for publicly available software

Python packages

R packages

Other implementations

All the binaries are available with biclustlib and are compiled for the x86_64 architecture.

Available data collections

Available external evaluation measures

Example of use

import numpy as np

from biclustlib.algorithms import ChengChurchAlgorithm
from biclustlib.datasets import load_yeast_tavazoie

# load yeast data used in the original Cheng and Church's paper
data = load_yeast_tavazoie().values

# missing value imputation suggested by Cheng and Church
missing = np.where(data < 0.0)
data[missing] = np.random.randint(low=0, high=800, size=len(missing[0]))

# creating an instance of the ChengChurchAlgorithm class and running with the parameters of the original study
cca = ChengChurchAlgorithm(num_biclusters=100, msr_threshold=300.0, multiple_node_deletion_threshold=1.2)
biclustering = cca.run(data)
print(biclustering)

License (GPLv3)

biclustlib: A Python library of biclustering algorithms and evaluation measures.
Copyright (C) 2017  Victor Alexandre Padilha

This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with this program.  If not, see <http://www.gnu.org/licenses/>.

About

A Python library of biclustering algorithms, evaluation measures and datasets.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages