components Retrieval Augmented Generation documentation

Retrieval Augmented Generation

Components in this category

llm_autoprompt_qna

A command component for Zero/Few Shot Learning with GPT based models for Question Answering tasks.
llm_dbcopilot_create_promptflow
llm_dbcopilot_deploy_endpoint
llm_dbcopilot_grounding
llm_dbcopilot_grounding_ground_samples
llm_ingest_dataset_to_acs

Single job pipeline to chunk data from AzureML data asset, and create ACS embeddings index
llm_ingest_dataset_to_acs_basic

Single job pipeline to chunk data from AzureML data asset, and create ACS embeddings index
llm_ingest_dataset_to_acs_qa_only

Single job pipeline to chunk data from AzureML data asset, and create ACS embeddings index
llm_ingest_dataset_to_faiss

Single job pipeline to chunk data from AzureML data asset, and create FAISS embeddings index
llm_ingest_dataset_to_faiss_basic

Single job pipeline to chunk data from AzureML data asset, and create FAISS embeddings index
llm_ingest_dataset_to_faiss_qa_only

Single job pipeline to chunk data from AzureML data asset, and create FAISS embeddings index
llm_ingest_db_to_acs

Single job pipeline to chunk data from AzureML sql data store, and create ACS embeddings index
llm_ingest_db_to_faiss

Single job pipeline to chunk data from AzureML sql data store, and create FAISS embeddings index
llm_ingest_dbcopilot_acs_e2e

Single job pipeline to chunk data from AzureML DB Datastore and create acs embeddings index
llm_ingest_dbcopilot_faiss_e2e

Single job pipeline to chunk data from AzureML DB Datastore and create faiss embeddings index
llm_ingest_existing_acs

Single job pipeline to import embedded data from ACS index, and create MlIndex, generate test/prompt data, and create PF
llm_ingest_existing_acs_basic

Single job pipeline to import embedded data from ACS index, and create MlIndex, generate test/prompt data, and create PF
llm_ingest_existing_acs_qa_only

Single job pipeline to import embedded data from ACS index, and create MlIndex, generate test/prompt data, and create PF
llm_ingest_git_to_acs

Single job pipeline to import data from Github, chunk, and create embeddings index
llm_ingest_git_to_acs_basic

Single job pipeline to import data from Github, chunk, and create embeddings index
llm_ingest_git_to_acs_qa_only

Single job pipeline to import data from Github, chunk, and create embeddings index
llm_ingest_git_to_faiss

Single job pipeline to import data from Github, chunk, and create FAISS embeddings index
llm_ingest_git_to_faiss_basic

Single job pipeline to import data from Github, chunk, and create FAISS embeddings index
llm_ingest_git_to_faiss_qa_only

Single job pipeline to import data from Github, chunk, and create FAISS embeddings index
llm_rag_crack_and_chunk

Creates chunks no larger than chunk_size from input_data, extracted document titles are prepended to each chunk

LLM models have token limits for the prompts passed to them, this is a limiting factor at embedding time and even more limiting at prompt completion time as only so much context ca...

llm_rag_crack_and_chunk_and_embed

Creates chunks no larger than chunk_size from input_data, extracted document titles are prepended to each chunk

LLM models have token limits for the prompts passed to them, this is a limiting factor at embedding time and even more limiting at prompt completion time as only so much context ca...

llm_rag_create_faiss_index

Creates a FAISS index from embeddings. The index will be saved to the output folder. The index will be registered as a Data Asset named asset_name if register_output is set to True.
llm_rag_create_promptflow

This component is used to create a RAG flow based on your mlindex data and best prompts. The flow will look into your indexed data and give answers based on your own data context. The flow also provides the capability to bulk test with any built-in or custom evaluation flows.
llm_rag_data_import_acs

Collects documents from Azure Cognitive Search Index, extracts their contents, saves them to a uri folder, and creates an MLIndex yaml file to represent the search index.

Documents collected can then be used in other components without having to query the ACS index again, allowing for a consiste...

llm_rag_generate_embeddings

Generates embeddings vectors for data chunks read from chunks_source.

chunks_source is expected to contain csv files containing two columns:

"Chunk" - Chunk of text to be embedded
"Metadata" - JSON object containing metadata for the chunk

If embeddings_container is supplied, input c...

llm_rag_generate_embeddings_parallel

Generates embeddings vectors for data chunks read from chunks_source.

chunks_source is expected to contain csv files containing two columns:

"Chunk" - Chunk of text to be embedded
"Metadata" - JSON object containing metadata for the chunk

If previous_embeddings is supplied, input ch...

llm_rag_git_clone

Clones a git repository to output_data path
llm_rag_qa_data_generation

Generates a test dataset of questions and answers based on the input documents.

A chunk of text is read from each input document and sent to the specified LLM with a prompt to create a question and answer based on that text. These question, answer, and context sets are saved as either a csv or j...

llm_rag_register_autoprompt_data_asset

Registers a QA data csv or json and supporting files as an AzureML data asset
llm_rag_register_mlindex_asset

Registers a MLIndex yaml and supporting files as an AzureML data asset
llm_rag_register_qa_data_asset

Registers a QA data csv or json and supporting files as an AzureML data asset
llm_rag_update_acs_index

Uploads embeddings into Azure Cognitive Search instance specified in acs_config. The Index will be created if it doesn't exist.

The Index will have the following fields populated:

"id", String, key=True
"content", String,
"content_vector_(open_ai|hugging_face)", Collection(Single)
"c...
llm_rag_validate_deployments

Validates that completion model, embedding model, and Azure Cognitive Search resource deployments is successful and connections works. For default AOAI, it attempts to create the deployments if not valid or present. This validation is done only if customer is using Azure Open AI models or creatin...

Wiki menu

Home
Reference Documentation
- Components
- Data
- Environments
- Models
Contributing

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

components Retrieval Augmented Generation documentation

Retrieval Augmented Generation

Components in this category

Wiki menu

Clone this wiki locally