-
Notifications
You must be signed in to change notification settings - Fork 0
/
hdsk.cabal
188 lines (181 loc) · 6.58 KB
/
hdsk.cabal
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
-- This file has been generated from package.yaml by hpack version 0.20.0.
--
-- see: https://github.com/sol/hpack
--
-- hash: 37959dbba392a83a40b4dee8a683dee3cea6356d178efa08473c4106121501ed
name: hdsk
version: 0.1.0.0
synopsis: Unified data science toolkit for Haskell and more
description: = Haskell Data Science Kit
.
![Build status](https://travis-ci.org/wbadart/hdsk.svg?branch=master)
.
The Haskell Data Science Kit (/HDSK/) project is an attempt to create a
well-documented, well-tested, and performant data science library implemented
in the Haskell language.
.
Sources suggest that in spite of huge potential for performance gains over
current de facto methods [1], adoption of Haskell in the data science
community lags for a variety of reasons, the greatest of which seems to be the
dearth [2] of easy-to-use data science libraries (indeed, searching for
"/data science/" on GitHub yields __14__ Haskell-language repositories and
__5,807__ Python-language repositories [3]). This project seeks to mediate
that issue by presenting a /unified/ (though modular) library of data science
utilities which support the entire life-cycle of a data science project.
.
__Disclaimer:__ At the time of writing, I am still a beginner in Haskell, and
this project is as much about the above stated goal as it is about me learning
and practicing Haskell itself and the software development ecosystem around it.
So, I make no guarantees that I will give the most optimal or idiomatic
solution to any given function (and in cases when I don't, pull requests are
gladly welcomed!).
.
1. https://izbicki.me/blog/hlearn-cross-validates-400x-faster-than-weka
2. https://www.linkedin.com/pulse/haskell-data-science-good-bad-ugly-tom-hutchins
3. https://github.com/search?q=data+science+language%3AHaskell&type=Repositories
.
.
== Installation
.
To use HDSK within your __stack__ project, you must add this repository to the
@extra-deps@ list in @stack.yaml@. NOTE: this step will change once HDSK is
released on Hackage.
.
@
extra-deps:
- git: [email protected]:wbadart/hdsk.git
commit: a52bed4216f607628e71594256dafd550ffe2d3e
@
.
The commit hash listed above is the most recent commit at the time of this writing.
Be sure that the value you use is a recent enough to contain the features you
need.
.
The cabal file generated by stack has been checked in, so if you aren't using
stack, and are only using cabal, the library can be installed from a fresh clone
of the repository.
.
.
== Usage
.
Please see [willbadart.com/hdsk](https://willbadart.com/hdsk) for library documentation.
Further project info, such as planned features, is made available on the
wiki: https://github.com/wbadart/hdsk/wiki.
.
.
== License
.
You'll notice a key theme in this document has been __promoting adoption.__ As
such, I'm developing and eventually releasing this project under the
BSD-3-Clause [1] license, due to its general permissiveness. This is also
one of the more popular licenses among the Haskell community [2].
.
Please see [LICENSE](./LICENSE) for the full text.
.
11. https://opensource.org/licenses/BSD-3-Clause
12. https://wiki.haskell.org/How_to_write_a_Haskell_program
category: Math
homepage: https://github.com/wbadart/hdsk#readme
bug-reports: https://github.com/wbadart/hdsk/issues
author: Will Badart
maintainer: wbadart (at) pm (dot) me
copyright: 2018 Will Badart
license: BSD3
license-file: LICENSE
build-type: Simple
cabal-version: >= 1.10
extra-source-files:
CHANGELOG.md
README.md
source-repository head
type: git
location: https://github.com/wbadart/hdsk
library
hs-source-dirs:
src
ghc-options: -Wall
build-depends:
base >=4.7 && <5
, containers <0.6
, hashable >=1.2 && <2
, hashtables >=1.2
, heap >=1.0.4 && <2
, matrix >=0.3
, vector ==0.12.*
exposed-modules:
Hdsk.Bins
Hdsk.Cluster
Hdsk.DecisionTree
Hdsk.DecisionTree.Information
Hdsk.Description
Hdsk.Metrics
Hdsk.NaiveBayes
Hdsk.NearestNeighbors
Hdsk.Numerical
Hdsk.Util
other-modules:
Paths_hdsk
default-language: Haskell2010
test-suite spec
type: exitcode-stdio-1.0
main-is: Spec.hs
hs-source-dirs:
test
ghc-options: -threaded -rtsopts -with-rtsopts=-N -Wall -Wno-type-defaults
build-depends:
QuickCheck ==2.11.*
, base >=4.7 && <5
, containers <0.6
, hashable >=1.2 && <2
, hashtables >=1.2
, hdsk
, heap >=1.0.4 && <2
, hspec ==2.*
, matrix >=0.3
, vector ==0.12.*
other-modules:
Hdsk.BinsSpec
Hdsk.ClusterSpec
Hdsk.DecisionTree.InformationSpec
Hdsk.DecisionTreeSpec
Hdsk.DescriptionSpec
Hdsk.MetricsSpec
Hdsk.NaiveBayesSpec
Hdsk.NearestNeighborsSpec
Hdsk.NumericalSpec
Hdsk.UtilSpec
Test.Util
Paths_hdsk
default-language: Haskell2010
benchmark metrics
type: exitcode-stdio-1.0
main-is: benchmark/Metrics.hs
build-depends:
base >=4.7 && <5
, containers <0.6
, criterion >=1.4 && <2
, hashable >=1.2 && <2
, hashtables >=1.2
, hdsk
, heap >=1.0.4 && <2
, matrix >=0.3
, vector ==0.12.*
other-modules:
Paths_hdsk
default-language: Haskell2010
benchmark uniq
type: exitcode-stdio-1.0
main-is: benchmark/Uniq.hs
build-depends:
base >=4.7 && <5
, containers <0.6
, criterion >=1.4 && <2
, hashable >=1.2 && <2
, hashtables >=1.2
, hdsk
, heap >=1.0.4 && <2
, matrix >=0.3
, vector ==0.12.*
other-modules:
Paths_hdsk
default-language: Haskell2010