-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
Approach:
-
use a larger context window
-
repeat and concatenate, with higher temperature
-
multiply programmatically (post-process)
LLM - Mistral 7B
Context size: a sliding 4K window - see https://huggingface.co/mistralai/Mistral-7B-v0.1/discussions/4
There are many flavours of Mistral-7B:
- regular mistral-7B, quantized
- TheBloke/Mistral-7B-v0.1-GGUF = https://huggingface.co/TheBloke/Mistral-7B-v0.1-GGUF
- cognitivecomputations/dolphin-2.6-mistral-7b-dpo
- good for coding, has 16K context (note: is not 'ethically trained'
- https://huggingface.co/cognitivecomputations/dolphin-2.6-mistral-7b-dpo
- quantized = https://huggingface.co/TheBloke/dolphin-2.6-mistral-7B-dpo-GPTQ
-
- mistral-7B with larger context window
- TheBloke_Writing_Partner_Mistral_7B-GPTQ_gptq-8bit-32g-actorder_True = https://huggingface.co/TheBloke/Writing_Partner_Mistral_7B-GPTQ/tree/gptq-8bit-32g-actorder_True
4 - instruct prompting, and newer:
- a base version, suitable for FT
- hermes-2.5 - mistral-7B
- https://huggingface.co/mlabonne/NeuralHermes-2.5-Mistral-7B
- quantized = https://huggingface.co/TheBloke/NeuralHermes-2.5-Mistral-7B-AWQ
- phi from Microsoft
- phi-1.5 (from Microsoft) - Open LLM leaderboard
= https://huggingface.co/microsoft/phi-1_5 - phi-2 = https://huggingface.co/microsoft/phi-2
Metadata
Metadata
Assignees
Labels
No labels