You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: llm/qwen/README.md
+32-5Lines changed: 32 additions & 5 deletions
Original file line number
Diff line number
Diff line change
@@ -1,14 +1,22 @@
1
-
# Serving Qwen2 on Your Own Kubernetes or Cloud
1
+
# Serving Qwen3/Qwen2 on Your Own Kubernetes or Cloud
2
2
3
3
[Qwen2](https://github.com/QwenLM/Qwen2) is one of the top open LLMs.
4
4
As of Jun 2024, Qwen1.5-110B-Chat is ranked higher than GPT-4-0613 on the [LMSYS Chatbot Arena Leaderboard](https://chat.lmsys.org/?leaderboard).
5
5
6
-
**Update (Sep 18, 2024) -** SkyPilot now supports the [**Qwen2.5**](https://qwenlm.github.io/blog/qwen2.5/) model!
6
+
**Update (Apr 28, 2025) -** SkyPilot now supports the [**Qwen3**](https://qwenlm.github.io/blog/qwen3/) model!
7
+
8
+
📰 **Update (Sep 18, 2024) -** SkyPilot now supports the [**Qwen2.5**](https://qwenlm.github.io/blog/qwen2.5/) model!
7
9
8
10
📰 **Update (Jun 6, 2024) -** SkyPilot now also supports the [**Qwen2**](https://qwenlm.github.io/blog/qwen2/) model! It further improves the competitive model, Qwen1.5.
9
11
10
12
📰 **Update (April 26, 2024) -** SkyPilot now also supports the [**Qwen1.5-110B**](https://qwenlm.github.io/blog/qwen1.5-110b/) model! It performs competitively with Llama-3-70B across a [series of evaluations](https://qwenlm.github.io/blog/qwen1.5-110b/#model-quality). Use [qwen15-110b.yaml](https://github.com/skypilot-org/skypilot/blob/master/llm/qwen/qwen15-110b.yaml) to serve the 110B model.
@@ -32,7 +40,7 @@ After [installing SkyPilot](https://docs.skypilot.co/en/latest/getting-started/i
32
40
1. Start serving Qwen 110B on a single instance with any available GPU in the list specified in [qwen15-110b.yaml](https://github.com/skypilot-org/skypilot/blob/master/llm/qwen/qwen15-110b.yaml) with a vLLM powered OpenAI-compatible endpoint (You can also switch to [qwen25-72b.yaml](https://github.com/skypilot-org/skypilot/blob/master/llm/qwen/qwen25-72b.yaml) or [qwen25-7b.yaml](https://github.com/skypilot-org/skypilot/blob/master/llm/qwen/qwen25-7b.yaml) for a smaller model):
33
41
34
42
```console
35
-
sky launch -c qwen qwen15-110b.yaml
43
+
sky launch -c qwen qwen3-235b.yaml
36
44
```
37
45
2. Send a request to the endpoint for completion:
38
46
```bash
@@ -41,7 +49,7 @@ ENDPOINT=$(sky status --endpoint 8000 qwen)
The concept of "the best food" is highly subjective and depends on personal preferences, cultural background, dietary needs, and even mood! For example:
82
+
83
+
- **Some crave comfort foods** like macaroni and cheese, ramen, or dumplings.
84
+
- **Others prioritize health** and might highlight dishes like quinoa bowls, grilled salmon, or fresh salads.
85
+
- **Global favorites** often include pizza, sushi, tacos, or curry.
86
+
- **Unique or adventurous eaters** might argue for dishes like insects, fermented foods, or molecular gastronomy creations.
87
+
88
+
Could you clarify what you mean by "best"? For instance:
89
+
- Are you asking about taste, health benefits, cultural significance, or something else?
90
+
- Are you looking for a specific dish, ingredient, or cuisine?
0 commit comments