Skip to content

Latest commit

 

History

History
93 lines (61 loc) · 2.91 KB

README.md

File metadata and controls

93 lines (61 loc) · 2.91 KB

Maximum Cardinality Bipartite Matching

Synopsis

A somewhat efficient, purely functional (Haskell) algorithm to find maximum cardinality matchings in bipartite graphs (MCBM).

This project contains a library, command line tool, tests and benchmark.

Usage

Module Data.Graph.MaxBipartiteMatching exports the function

matching :: (Ord a, Ord b) => S.Set (a,b) -> M.Map b a

which calculates a maximum cardinality matching on the given bipartite graph. It directly works on Data.Set.Sets of pairs, and returns a backward mapping as Data.Map.Strict.Map.

> matching $ fromList [(1,'a'),(1,'b'),(2,'a'),(2,'c'),(3,'b')]
fromList [('a',1),('b',3),('c',2)]

The small command line tool matcher demonstrates the use of the matching library. See build instructions for more.

Performance & Testing

The implementation is quite compact with the core functions accounting for only 21 lines. The source file contains extensive information about the workings of the algorithm. There is no correctness proof, but a test suite is available.

$ sed -rn '/\{-/,/-\}/d; /^>/p' src/Data/Graph/MaxBipartiteMatching.lhs | wc -l
25

Despite its brevity it seems rather efficient.

There are very few other purely functional MCBM implementations around. AFAIK there is none in FGL (June 2016), but they have a MaxFlow algorithm which is a much more general approach of course. However, if you only need MCBM, then my implementation scales better than using FGL.

Update: Recently I came across an implementation by Nikita Danilenko: While my algorithm uses slightly less memory, CPU usage is more dispersed with larger graphs. For graphs in the region 50k < #V + #E < 150k, both implementations are on par, for smaller graphs mine is faster, for larger graphs Danilenko's implementation has a more reliable, and with increasing size better, runtime complexity.

cpu/size

memory/size

sample distribution

For better comparison, the same plots without showing th FGL results:

cpu/size

memory/size

Scripts to run the comparison are contained in this repository, except that I do not distribute Danilenko's code.

Bugs

See the BUGS/open subdirectory.

History

The implementation was originally announced on Mon, 22 Oct 2012 on the haskell-cafe at haskell.org mailing list. Since then I use this toy project to play with other tools, e.g., GitHub.