With the rapid advancement in Large Language Models (LLMs), Deepseek-r1 has emerged as a top contender. Matching the performance of the GPT-o1 mode...
For further actions, you may consider blocking this person and/or reporting abuse
DeepSeek cut-off date is end of 2023, which is ancient compared to Gemini 2.0 (August 2024).
It is imperative to have a recent cut-off date for front-end related work or when querring anything about recent things.
This is why I will NOT use it.
On the other hand, Gemini is rubbish, so there's that.
Agreed, based on personal experience.
What can you expect? its free lol
Gemini is BERT which is bidirectional model not best for Q&A usage like other decoder only model, e.g. GPT , llama , etc
So how do you explain thousands of people had rated it so highly (for coding)?
I don't think they are all Google workers who are trying to boost their own product by rating it.
r1comes with a lot of variants:The
1.5bvariant could very easily run on a weak or older device, and the8bvariant works fine on my device.I am running deepseek-r1 on my old intel macbook
Which variant? I doubt the full 671b variant works on a macbook. And everything under 32b is really only useful with fine tuning for specialized tasks.
Yes
Okay, I just checked: 'deepseek-r1:latest' is actually 'only' the 7b model. So, yeah, there's that.
for windows follow ;
: Setting Up Deepseek-r1 (Using Windows CMD)
Install Ollama
Download the Deepseek-r1 Model
To test if the model is working, use this command:
If you see output in the terminal, Deepseek-r1 is ready to go!
** Setting Up the Continue.dev Extension**
Install Visual Studio Code (VS Code)
Install the Continue.dev Extension
Connect Deepseek-r1 to Continue.dev
: Using Deepseek-r1
Now you can use Deepseek-r1βs features directly in VS Code:
: Why Choose Deepseek-r1?
: WhatsApp Status Message
"Deepseek-r1: Smarter than GPT, excelling in reasoning and code generation. π
DeepseekR1 #AI #Coding #WindowsCMD"
Start using Deepseek-r1 in your workflow and enjoy smarter, faster coding! π₯
(hashnode.com/@askpravinjadhav)
Hey Varnit, had a question for you after seeing this. How good is deepseek in terms of hallucinations, as yesterday night I was reading on LLM hallucination detection.
What I don't understand is that how come a small startup is able to build LLM model that beats chatgpt? Even Tech giants like Google or Elon Musk didn't build LLM model that can beat chatgpt. All of a sudden a small Chinese company comes up with LLM model superior than OpenAI's chatgpt. Did they use a LLM recipe which OpenAI doesnot know?Guys if there is something i missed here, please enlighten meπ
This is what you get when enthusiastic smart people try a different angle instead of just running for investors money. Yes, deepseek R1 is trained differently. They used a rule/heuristics based reward system and automated benchmarking during the training process... nothing new, but apparently nobody else thought of combining this with LLM training.
Most obvious question: why would you use this instead of CoPilot? or am I comparing apples and oranges ;-)
Maybe to not send data to a company you can or cannot trust
"A company you cannot trust" - DeepSeek is Chinese, does your "data" go straight to the Communist Party? ;-)
Not if you run their models locally...
There was a better guide a few days ago already:
dev.to/shayy/run-deepseek-locally-...
Don't you have to have a pc of decent memory & processing capibilities to do this?
not really there a variants with parameters they are trained on below 16B params most of the domestic use machines can handle
Huh... I did not know that.
Deepseek R1 in VS Code for free? Thatβs a game-changer! π Have you tried it yet? Curious to know how it compares to other AI coding assistants like Copilot or Codeium. firekirinus.xyz/
thank u π
Is it completely free?
Yes
Niceπ₯
How can we use it???
For me still "cursor" is better option to implement software.
That autocomplete experience is wonderful.
But if I build some new software from scratch, then maybe this can work better than "cursor".
I mean reactivity is needed to implement existed software.
Considering the article below, I think cursor is hybrid ai agent, reactive and deliberative.
geeksforgeeks.org/reactive-vs-deli...