Skip to content

models tiiuae falcon 40b

github-actions[bot] edited this page Jul 18, 2023 · 24 revisions

tiiuae-falcon-40b

Overview

Description: Falcon-40B is a large language model (LLM) developed by the Technology Innovation Institute (TII) with 40 billion parameters. It is a causal decoder-only model trained on 1 trillion tokens from the RefinedWeb dataset, enhanced with curated corpora. Falcon-40B supports English, German, Spanish, and French languages, with limited capabilities in Italian, Portuguese, Polish, Dutch, Romanian, Czech, and Swedish. It is available under the Apache 2.0 license. Falcon-40B is considered the best open-source model currently available, optimized for inference with features such as FlashAttention and multiquery. However, it is recommended to fine-tune the model for specific use cases. The training of Falcon-40B involved using 384 A100 40GB GPUs and took two months. The model carries biases and stereotypes encountered online and requires appropriate precautions for production use. It is suggested to finetune the model for specific tasks and consider guardrails. The technical specifications, training details, and evaluation results are provided in the summary. > The above summary was generated using ChatGPT. Review the original model card to understand the data used to train the model, evaluation metrics, license, intended uses, limitations and bias before using the model. ### Inference samples Inference type|Python sample (Notebook)|CLI with YAML |--|--|--| Real time|text-generation-online-endpoint-dolly.ipynb|text-generation-online-endpoint-dolly.sh Batch |text-generation-batch-endpoint.ipynb| coming soon ### Sample inputs and outputs (for real-time inference) json { "input_data": { "input_string":["The meaning of the life is"] } } #### Sample output json [ { "0": "The meaning of the life is to find your gift. The purpose of life is to give it away" } ]

Version: 1

Tags

Featured Preview license : apache-2.0 task : text-generation author : tiiuae inference_compute_allow_list : ['Standard_NC24rs_v3', 'Standard_ND40rs_v2', 'Standard_ND96asr_v4']

View in Studio: https://ml.azure.com/registries/azureml/models/tiiuae-falcon-40b/version/1

License: apache-2.0

Properties

SHA: c47b371b31a68349c233104050ac76680b8485db

datasets: tiiuae/falcon-refinedweb

languages: en, de, es, fr

inference-min-sku-spec: 24|4|448|1344

inference-recommended-sku: Standard_ND40rs_v2

Clone this wiki locally