-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
dd40dd8
commit 06cca70
Showing
1 changed file
with
29 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,29 @@ | ||
--- | ||
title: Mistral Large | ||
authors: | ||
- Patrik Björklund | ||
share: true | ||
date: 2024-02-29 07:35:00 +0200 | ||
summary: Mistral releases a new model. Mistral large. Does not seem to beat GPT-4 | ||
tags: | ||
- AI | ||
series: | ||
- AI | ||
--- | ||
|
||
Mistral has launched its latest AI model, Mistral Large, boasting enhanced reasoning and multilingual capabilities. | ||
|
||
My key question: Is Mistral Large more adept at writing semi-complicated Python scripts than its competitor, GPT-4? | ||
|
||
I did a straightforward, non-scientific test (have two chats with it) to determine the models' proficiency in generating Python code with specific instructions. | ||
|
||
Verdict: | ||
> GPT-4 maintains its edge in accurately writing Python scripts that adhere to the given instructions. | ||
Mistral's introduction of both Mistral Large and Mistral Small models marks a significant step in their commitment to providing versatile AI solutions and provides healthy competition. | ||
|
||
Mistral Large's performance in various benchmarks highlights its potential in multiple domains, even as it falls short in direct comparison with GPT-4 for most things. | ||
|
||
The company has also announced partnerships and plans for future enhancements, including improvements in function calling and JSON formatting, promising a comprehensive toolset for developers and organizations. | ||
|
||
Good to see - but not a true shakeup. |