Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: session state and save fix #30

Open
wants to merge 75 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
75 commits
Select commit Hold shift + click to select a range
cccea65
Update .gitignore
Kacper-W-Kozdon May 4, 2024
3649768
.gitignore
Kacper-W-Kozdon May 4, 2024
802a022
.gitignore + docker
Kacper-W-Kozdon May 4, 2024
342cf3a
Merge branch 'samthakur587:main' into main
Kacper-W-Kozdon May 5, 2024
2e18241
Merge branch 'main' of https://github.com/Kacper-W-Kozdon/LLM_playground
Kacper-W-Kozdon May 5, 2024
a6521ee
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
d3d0742
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
eacf6a5
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
759279a
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
3425a9a
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
750d136
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
2b4b465
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
d6acd57
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
5fe54a7
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
cfc37e2
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
eaedfb8
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
3688d1c
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
9d836e7
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
5e0557e
minor leaderboard changes
Kacper-W-Kozdon May 7, 2024
6c11a20
minor leaderboard changes
Kacper-W-Kozdon May 8, 2024
1e27c8d
main test merge
Kacper-W-Kozdon May 8, 2024
84e75d8
main test merge
Kacper-W-Kozdon May 8, 2024
3134a17
Merge branch 'main' of https://github.com/Kacper-W-Kozdon/LLM_playground
Kacper-W-Kozdon May 8, 2024
aa78f6e
Merge branch 'samthakur587:main' into main
Kacper-W-Kozdon May 8, 2024
2b7cc9a
reset temp leaderboards on save
Kacper-W-Kozdon May 8, 2024
d6b3ceb
save added to the main page, fixed session state keys
Kacper-W-Kozdon May 10, 2024
a341c1a
names
Kacper-W-Kozdon May 10, 2024
6764052
names
Kacper-W-Kozdon May 10, 2024
809c429
Added light/dark mode switch
Kacper-W-Kozdon May 11, 2024
f31b4df
Added light/dark mode switch
Kacper-W-Kozdon May 11, 2024
e27d1ab
working light/dark switch
Kacper-W-Kozdon May 11, 2024
607cd93
working light/dark switch
Kacper-W-Kozdon May 11, 2024
c3dd56b
Added extra voting options
Kacper-W-Kozdon May 11, 2024
aab291b
Added hyperlink to google sheets
Kacper-W-Kozdon May 11, 2024
63fbc8e
Minor save fix
Kacper-W-Kozdon May 11, 2024
8567b42
added st.rerun() in model selection to increase responsiveness to cha…
Kacper-W-Kozdon May 11, 2024
e24c652
Minor fixes in save
Kacper-W-Kozdon May 11, 2024
2752c7b
Minor fixes in save
Kacper-W-Kozdon May 11, 2024
882e3b0
Minor fixes in save
Kacper-W-Kozdon May 11, 2024
712e33c
Minor fixes in save
Kacper-W-Kozdon May 11, 2024
6b011ab
FIXED ALL OF THE KNOWN SAVE ISSUES
Kacper-W-Kozdon May 11, 2024
5123ed0
control flow changes in voting
Kacper-W-Kozdon May 11, 2024
15dc059
control flow changes in voting
Kacper-W-Kozdon May 11, 2024
6666bc8
Minor fixes in save offline
Kacper-W-Kozdon May 11, 2024
463f558
Minor fixes in save offline
Kacper-W-Kozdon May 11, 2024
690087c
Minor fixes in save offline
Kacper-W-Kozdon May 11, 2024
f7a1c7b
Minor fixes in save offline
Kacper-W-Kozdon May 11, 2024
0cb7fcd
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
1c15b51
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
272b017
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
ec156eb
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
06b21db
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
920d665
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
f7af268
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
b8e9248
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
c395244
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
2fbd160
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
c0c2fb1
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
456fe3d
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
25bc860
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
446c525
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
364d717
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
def71e7
Minor fixes in save offline
Kacper-W-Kozdon May 12, 2024
9c5d296
leaderboards display
Kacper-W-Kozdon May 12, 2024
2c71cb5
detailed save online fix
Kacper-W-Kozdon May 12, 2024
f7bd080
removed 'scores'
Kacper-W-Kozdon May 12, 2024
e6eaf96
removed 'scores'
Kacper-W-Kozdon May 12, 2024
5d88d18
fully fixed detailed view
Kacper-W-Kozdon May 12, 2024
8913af5
fully fixed detailed view
Kacper-W-Kozdon May 12, 2024
bd9032c
fully fixed detailed view
Kacper-W-Kozdon May 12, 2024
c7f4929
added Demo.mp4
Kacper-W-Kozdon May 12, 2024
4f920ee
Rename chatbot_arena.py to Chatbot_arena.py
Kacper-W-Kozdon May 12, 2024
27aacae
Rename 1_leaderboards.py to 1_Leaderboards.py
Kacper-W-Kozdon May 12, 2024
9917eaa
Rename 2_import_model.py to 2_Import_model.py
Kacper-W-Kozdon May 12, 2024
c1e9d37
Update README.md, add demo vid
Kacper-W-Kozdon May 12, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
@@ -1,5 +1,6 @@
.env
code/
__pycache__/
.mypy_cache/
.pytest_cache/
.streamlit
.streamlit/secrets.toml
166 changes: 71 additions & 95 deletions chatbot_arena.py → Chatbot_arena.py
Original file line number Diff line number Diff line change
Expand Up @@ -40,19 +40,23 @@ def select_model(api_key: str = "", authenticated: bool = False) -> None:
model1_other_disabled = True
model2_other_disabled = True

st.selectbox(
st.session_state.model1_selectbox = st.selectbox(
"Select the first model's endpoint:",
all_models,
options=all_models + ["I'm feeling lucky"],
disabled=disabled,
index=st.session_state.index_model1,
on_change=lambda: (
setattr(st.session_state, "chat_history1", []),
setattr(st.session_state, "chat_history2", []),
setattr(st.session_state, "winner_selected", False),
setattr(st.session_state, "winner_selected", True),
setattr(st.session_state, "prompt_provided", False),
setattr(st.session_state, "new_models_selected", True),
),
key="model1_selectbox",
)
if st.session_state.model1_selectbox == "I'm feeling lucky":
st.session_state.model1_selectbox = random.choice(all_models[:-1])
st.session_state.index_model1 = all_models.index(st.session_state.model1_selectbox)

if st.session_state.model1_selectbox == "other":
model1_other_disabled = False
st.text_input(
Expand All @@ -63,24 +67,29 @@ def select_model(api_key: str = "", authenticated: bool = False) -> None:
on_change=lambda: (
setattr(st.session_state, "chat_history1", []),
setattr(st.session_state, "chat_history2", []),
setattr(st.session_state, "winner_selected", False),
setattr(st.session_state, "winner_selected", True),
setattr(st.session_state, "prompt_provided", False),
setattr(st.session_state, "new_models_selected", True),
),
key="model1_other",
)
st.selectbox(
st.session_state.model2_selectbox = st.selectbox(
"Select the second model's endpoint:",
all_models,
options=all_models + ["I'm feeling lucky"],
disabled=disabled,
index=st.session_state.index_model2,
on_change=lambda: (
setattr(st.session_state, "chat_history1", []),
setattr(st.session_state, "chat_history2", []),
setattr(st.session_state, "winner_selected", False),
setattr(st.session_state, "winner_selected", True),
setattr(st.session_state, "prompt_provided", False),
setattr(st.session_state, "new_models_selected", True),
),
key="model2_selectbox",
)
if st.session_state.model2_selectbox == "I'm feeling lucky":
st.session_state.model2_selectbox = random.choice(all_models[:-1])
st.session_state.index_model2 = all_models.index(st.session_state.model2_selectbox)

if st.session_state.model2_selectbox == "other":
model2_other_disabled = False
st.text_input(
Expand All @@ -91,7 +100,8 @@ def select_model(api_key: str = "", authenticated: bool = False) -> None:
on_change=lambda: (
setattr(st.session_state, "chat_history1", []),
setattr(st.session_state, "chat_history2", []),
setattr(st.session_state, "winner_selected", False),
setattr(st.session_state, "winner_selected", True),
setattr(st.session_state, "prompt_provided", False),
setattr(st.session_state, "new_models_selected", True),
),
key="model2_other",
Expand All @@ -107,8 +117,8 @@ def select_model(api_key: str = "", authenticated: bool = False) -> None:
else st.session_state.model2_other
)

st.session_state.index_model1 = all_models.index(st.session_state.model1_selectbox)
st.session_state.index_model2 = all_models.index(st.session_state.model2_selectbox)
# st.session_state.index_model1 = all_models.index(st.session_state.model1_selectbox)
# st.session_state.index_model2 = all_models.index(st.session_state.model2_selectbox)
if st.session_state.model1_selectbox == "other":
st.session_state.value_model1_other = selected_model1
if st.session_state.model2_selectbox == "other":
Expand All @@ -121,6 +131,7 @@ def select_model(api_key: str = "", authenticated: bool = False) -> None:
st.session_state["model1"] = selected_models.pop(0)
st.session_state["model2"] = selected_models.pop(0)
st.session_state.new_models_selected = False
st.rerun()


def history(model: str = "model1", output: str = "") -> None:
Expand Down Expand Up @@ -193,32 +204,6 @@ def input_api_key(api_key: str = " ") -> None:
st.sidebar.write(f"{r['error']}")


def print_history(contain: st.container) -> None:
"""Print the chat history in a streamlit split container.

Parameters
----------
contain
streamlit container to print the chat history into.

Returns
-------
None
"""

cont1, cont2 = contain
for i in st.session_state["chat_history1"]:
if i["role"] == "user":
cont1.write("🧑‍💻" + " " + i["content"])
else:
cont1.write(i["content"])
for i in st.session_state["chat_history2"]:
if i["role"] == "user":
cont2.write("🧑‍💻" + " " + i["content"])
else:
cont2.write(i["content"])


def call_model(Endpoint: str) -> AsyncUnify:
"""Prepare the Unify model to which the prompts will be sent.

Expand Down Expand Up @@ -260,6 +245,10 @@ async def main() -> None:
source = "offline"
helpers.init_session(source)

_, theme_col = st.columns([7, 1])
with theme_col:
helpers.Buttons.change_theme_button()

all_models = list(st.session_state.models)
json_data = st.session_state.leaderboard

Expand Down Expand Up @@ -294,7 +283,7 @@ async def main() -> None:
col11, col21 = st.columns(2)
# Display chat UI
with col11:
if st.session_state.winner_selected is True:
if all([st.session_state.winner_selected, st.session_state.prompt_provided]):
st.markdown(
"<span style='font-size:20px; color:blue;'>Model 1: "
+ st.session_state["model1"]
Expand All @@ -307,7 +296,7 @@ async def main() -> None:
unsafe_allow_html=True,
)
with col21:
if st.session_state.winner_selected is True:
if all([st.session_state.winner_selected, st.session_state.prompt_provided]):
st.markdown(
"<span style='font-size:20px; color:blue;'>Model 2: "
+ st.session_state["model2"]
Expand All @@ -331,7 +320,10 @@ async def main() -> None:
if prompt := st.chat_input(
"Say something",
disabled=False if st.session_state.api_key_provided is True else True,
on_submit=lambda: setattr(st.session_state, "winner_selected", False),
on_submit=lambda: (
setattr(st.session_state, "winner_selected", False),
setattr(st.session_state, "prompt_provided", True),
),
):
st.session_state["chat_input"] = prompt
st.session_state.code_input = prompt
Expand All @@ -343,7 +335,7 @@ async def main() -> None:
)
message1 = st.session_state["chat_history1"]
message2 = st.session_state["chat_history2"]
print_history(contain=(cont1, cont2))
helpers.print_history(contain=(cont1, cont2))
u1 = None
u2 = None
try:
Expand All @@ -360,7 +352,7 @@ async def main() -> None:
model1_to_add := st.session_state["model1"][
: st.session_state["model1"].find("@")
]
) not in data.keys():
) not in data["Model Name"]:
st.session_state["vote_counts"].at[f"{model1_to_add}", "Wins ⭐"] = 0
st.session_state["vote_counts"].at[f"{model1_to_add}", "Losses ❌"] = 0
st.session_state["vote_counts"].at[
Expand Down Expand Up @@ -392,7 +384,7 @@ async def main() -> None:
model2_to_add := st.session_state["model2"][
: st.session_state["model2"].find("@")
]
) not in data.keys():
) not in data["Model Name"]:
st.session_state["vote_counts"].at[f"{model2_to_add}", "Wins ⭐"] = 0
st.session_state["vote_counts"].at[f"{model2_to_add}", "Losses ❌"] = 0

Expand Down Expand Up @@ -447,74 +439,58 @@ async def call(unify_obj, model, contain, message):
call(u2, model="model2", contain=cont2, message=message2),
)

c1, c2 = st.columns(2)
c1, c2, c3, c4 = st.columns([3, 1, 3, 1])
# Display the vote buttons
vote_disabled = True if st.session_state.winner_selected in [None, True] else False

vote_disabled = (
True
if all([
st.session_state.winner_selected,
])
else False
)
with c1:
left_button_clicked = st.button(
"👍 Vote First Model",
"👍 Vote 1st Model",
disabled=vote_disabled,
on_click=lambda: setattr(st.session_state, "winner_selected", True),
on_click=lambda: (setattr(st.session_state, "winner_selected", True),),
)
if left_button_clicked:
st.balloons()
# Increase the vote count for the selected model by 1 when the button is clicked
model1 = st.session_state["model1"].split("@")[0]
model2 = st.session_state["model2"].split("@")[0]

st.session_state["vote_counts"].at[model1, "Wins ⭐"] += 1
st.session_state["vote_counts"].at[
st.session_state["model2"].split("@")[0], "Losses ❌"
] += 1
if (
model1 not in st.session_state.detailed_leaderboards["scores"].keys()
or model1 not in st.session_state.detailed_leaderboards["scores"].keys()
):
st.session_state.detailed_leaderboards["scores"].at[model1, model2] = 0
st.session_state.detailed_leaderboards["scores"].at[model1, model2] += 1
helpers.Buttons.left_button_clicked(cont1, cont2)

print_history(contain=(cont1, cont2))
try:
st.session_state.code_input = st.session_state["chat_history1"][-2][
"content"
]
except IndexError:
st.session_state.code_input = " "
with c2:
tie_button_clicked = st.button(
"👔 Vote Tie (1:1)",
disabled=vote_disabled,
on_click=lambda: (setattr(st.session_state, "winner_selected", True),),
)
if tie_button_clicked:
helpers.Buttons.tie_button(cont1, cont2)

with c3:
no_win_button_clicked = st.button(
"❌ No Winners (0:0)",
disabled=vote_disabled,
on_click=lambda: (setattr(st.session_state, "winner_selected", True),),
)
if no_win_button_clicked:
helpers.Buttons.no_win_button(cont1, cont2)

with c4:
right_button_clicked = st.button(
"👍 Vote Second Model",
"👍 Vote 2nd Model",
disabled=vote_disabled,
on_click=lambda: setattr(st.session_state, "winner_selected", True),
on_click=lambda: (setattr(st.session_state, "winner_selected", True),),
)
if right_button_clicked:
st.balloons()
# Increase the vote count for the selected model by 1 when the button is clicked
model1 = st.session_state["model1"].split("@")[0]
model2 = st.session_state["model2"].split("@")[0]

st.session_state["vote_counts"].at[model2, "Wins ⭐"] += 1
st.session_state["vote_counts"].at[
st.session_state["model1"].split("@")[0], "Losses ❌"
] += 1
if (
model2 not in st.session_state.detailed_leaderboards["scores"].keys()
or model1 not in st.session_state.detailed_leaderboards["scores"].keys()
):
st.session_state.detailed_leaderboards["scores"].at[model2, model1] = 0
st.session_state.detailed_leaderboards["scores"].at[model2, model1] += 1

print_history(contain=(cont1, cont2))
try:
st.session_state.code_input = st.session_state["chat_history2"][-2][
"content"
]
except IndexError:
st.session_state.code_input = " "
helpers.Buttons.right_button_clicked(cont1, cont2)
# Add custom CSS for the buttons
history_button_clicked = st.button("Clear Histroy")
if history_button_clicked:
st.session_state["chat_history1"] = []
st.session_state["chat_history2"] = []
with st.sidebar:
helpers.Buttons.save_button()


if __name__ == "__main__":
Expand Down
Binary file added Demo.mp4
Binary file not shown.
6 changes: 5 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
<a href="https://unify-llmplayground.streamlit.app/" target="_blank">
<a href="https://llm-playground-unify.streamlit.app/" target="_blank">
<img src="app_screenshot_from_Kacper.png" alt="Screenshot">
</a>

Expand Down Expand Up @@ -61,6 +61,10 @@ This Streamlit application provides a user interface for interacting with Unify
---
## Usage


https://github.com/Kacper-W-Kozdon/LLM_playground/assets/102428159/077d3a5a-6025-4c5f-bfa9-a72f82190763


1. Input Unify API Key: Enter your Unify API key in the provided text input box on the sidebar.

2. Select endpoints : Choose the models and providers from the sidebar dropdown menus.
Expand Down
52 changes: 26 additions & 26 deletions detail_leaderboards.csv
Original file line number Diff line number Diff line change
@@ -1,26 +1,26 @@
,mixtral-8x7b-instruct-v0.1,llama-2-70b-chat,gpt-4-turbo,mistral-large,llama-2-7b-chat,gemma-2b-it,mistral-7b-instruct-v0.1,gemma-7b-it,llama-2-13b-chat,codellama-13b-instruct,yi-34b-chat,gpt-3.5-turbo,deepseek-coder-33b-instruct,llama-3-70b-chat,mistral-medium,mixtral-8x22b-instruct-v0.1,other,codellama-34b-instruct,llama-3-8b-chat,pplx-7b-chat,mistral-7b-instruct-v0.2,mistral-small,gpt-4,pplx-70b-chat,codellama-7b-instruct
mixtral-8x7b-instruct-v0.1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
llama-2-70b-chat,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
gpt-4-turbo,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
mistral-large,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
llama-2-7b-chat,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
gemma-2b-it,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
mistral-7b-instruct-v0.1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
gemma-7b-it,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
llama-2-13b-chat,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
codellama-13b-instruct,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
yi-34b-chat,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
gpt-3.5-turbo,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
deepseek-coder-33b-instruct,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
llama-3-70b-chat,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
mistral-medium,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
mixtral-8x22b-instruct-v0.1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
other,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
codellama-34b-instruct,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
llama-3-8b-chat,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
pplx-7b-chat,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
mistral-7b-instruct-v0.2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
mistral-small,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
gpt-4,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
pplx-70b-chat,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
codellama-7b-instruct,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
,codellama-13b-instruct,codellama-34b-instruct,codellama-7b-instruct,deepseek-coder-33b-instruct,gemma-2b-it,gemma-7b-it,gpt-3.5-turbo,gpt-4,gpt-4-turbo,llama-2-13b-chat,llama-2-70b-chat,llama-2-7b-chat,llama-3-70b-chat,llama-3-8b-chat,mistral-7b-instruct-v0.1,mistral-7b-instruct-v0.2,mistral-large,mistral-medium,mistral-small,mixtral-8x22b-instruct-v0.1,mixtral-8x7b-instruct-v0.1,other,pplx-70b-chat,pplx-7b-chat,yi-34b-chat
codellama-13b-instruct,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
codellama-34b-instruct,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
codellama-7b-instruct,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
deepseek-coder-33b-instruct,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
gemma-2b-it,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
gemma-7b-it,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
gpt-3.5-turbo,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
gpt-4,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
gpt-4-turbo,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0
llama-2-13b-chat,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
llama-2-70b-chat,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
llama-2-7b-chat,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
llama-3-70b-chat,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
llama-3-8b-chat,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
mistral-7b-instruct-v0.1,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
mistral-7b-instruct-v0.2,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
mistral-large,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
mistral-medium,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
mistral-small,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
mixtral-8x22b-instruct-v0.1,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
mixtral-8x7b-instruct-v0.1,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
other,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
pplx-70b-chat,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
pplx-7b-chat,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
yi-34b-chat,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
Loading