Skip to content

models tiiuae falcon 7b

github-actions[bot] edited this page Jul 15, 2023 · 25 revisions

tiiuae-falcon-7b

Overview

Description: Falcon-7B is a large language model with 7 billion parameters. It is a causal decoder-only model developed by TII and trained on 1,500 billion tokens of RefinedWeb dataset, which was enhanced with curated corpora. The model is available under the Apache 2.0 license. It outperforms comparable open-source models and features an architecture optimized for inference. However, it is a raw, pretrained model that should be further finetuned for most use cases. The model is recommended for research on large language models and as a foundation for further specialization and finetuning for specific tasks. It should not be used in production without adequate assessment of risks and mitigation. The model carries biases commonly encountered online and is trained on English and French data only. The training details of Falcon-7B include information about the training data, training procedure, and hyperparameters used. It was trained on 384 A100 40GB GPUs using a 2D parallelism strategy combined with ZeRO. The model description mentions the architectural adaptations from the GPT-3 model, such as rotary positional embeddings, multiquery attention, and FlashAttention. > The above summary was generated using ChatGPT. Review the original model card to understand the data used to train the model, evaluation metrics, license, intended uses, limitations and bias before using the model. ### Inference samples Inference type|Python sample (Notebook)|CLI with YAML |--|--|--| Real time|text-generation-online-endpoint-dolly.ipynb|text-generation-online-endpoint-dolly.sh Batch |text-generation-batch-endpoint.ipynb| coming soon ### Sample inputs and outputs (for real-time inference) json { "input_data": { "input_string":["the meaning of life is"] } } #### Sample output json [ { "0": "the meaning of life is to find your gift. the purpose of life is to give it away." } ]

Version: 1

Tags

Featured Preview license : apache-2.0 task : text-generation inference_compute_allow_list : ['Standard_E8s_v3', 'Standard_E16s_v3', 'Standard_E32s_v3', 'Standard_E48s_v3', 'Standard_E64s_v3', 'Standard_F16s_v2', 'Standard_F32s_v2', 'Standard_F48s_v2', 'Standard_F64s_v2', 'Standard_F64s_v2', 'Standard_NC6s_v3', 'Standard_NC12s_v3', 'Standard_NC24s_v3', 'Standard_NC24rs_v3', 'Standard_ND40rs_v2', 'Standard_ND96asr_v4']

View in Studio: https://ml.azure.com/registries/azureml/models/tiiuae-falcon-7b/version/1

License: apache-2.0

Properties

SHA: 2f5c3cd4eace6be6c0f12981f377fb35e5bf6ee5

datasets: tiiuae/falcon-refinedweb

languages: en

inference-min-sku-spec: 8|0|64|128

inference-recommended-sku: Standard_NC12s_v3

Clone this wiki locally