Falkor

A web service for turning HTML pages into traversable JSON documents

Very early stage development. If you have any feature requests just create an issue on the project

Getting started

Running the server locally

lein uberjar
docker build -t falkor .
docker run -t falkor

# Visit http://localhost:5000

Comming soon

Better error handling
CORS
Query filtering (return only certain attributes)
Fetching multiple elements in a single request ( e.g [h1 > a, .subtitle] )

Usage

Get all the title links from the Reddit.com home page

https://falkor-api.herokuapp.com/api/query?url=http://reddit.com&query=a.title

Grab all the news stories from Digg.com

https://falkor-api.herokuapp.com/api/query?url=http://digg.com&query=.story-title%20a

Extract all the images from Digg.com

https://falkor-api.herokuapp.com/api/query?url=http://digg.com&query=img[src]

TODO

Filters to remove some of the attribute cruft

For example if we just want to extract the text for an element and ignore the other attributes

&filter=[text]

License

Distributed under the Eclipse Public License either version 1.0 or (at your option) any later version.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
resources/public		resources/public
src/falkor		src/falkor
test/falkor		test/falkor
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
logs.txt		logs.txt
project.clj		project.clj
swagger.yml		swagger.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Falkor

Getting started

Comming soon

Usage

TODO

License

About

Releases

Packages

Languages

License

owainlewis/falkor

Folders and files

Latest commit

History

Repository files navigation

Falkor

Getting started

Comming soon

Usage

TODO

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages